How to interpret GSEA results and plot - simple explanation of ES, NES, leading edge and more!

Подписаться 6 тыс.

Просмотров 22 тыс.

50% 1

In this video, I will focus on how to interpret the results from Gene Set Enrichment Analysis (GSEA) and to interpret the plots.
Learn what are the main statistics given by GSEA and how to use them to make the most of your pathway enrichment analysis results, including how to interpret the Enrichment Score (ES), Normalised Enrichment Score (NES), p-values, FDR...
We will go through basic GSEA terms like the ranking metric, the leading edge subset and more!
Hope you like it!
--------------------------------------------------------------------------------------------------------------------
Watched it already?
If you liked this video or found it useful, please let me know! Your comments and feedback are very much appreciated😊
If you have questions, don't hesitate to leave me a comment down below, I will answer as soon as I can:)
--------------------------------------------------------------------------------------------------------------------
Are you into biostatistics and computational analysis?
For more biostatistics tools and resources, you can visit:
biostatsquid.com/
Follow me on Instagram at @biostatsquid:
/ biostatsquid
For more
• simple and clear explanations of biostatistics methods
• computational biology tools
• easy step-by-step tutorials in R and Python
to analyse and visualise your biological data!
Don’t forget to subscribe if you don’t want to miss another video from me! --------------------------------------------------------------------------------------------------------------------
Other interesting resources for GSEA:
Original publication: www.pnas.org/doi/10.1073/pnas...
You can conduct your own Gene Set Enrichment Analysis with GSEA Software:
www.gsea-msigdb.org/gsea/inde...
or if you want to program your way through it, I recommend the fgsea or clusterProfiler packages:
bioconductor.org/packages/rel...
bioconductor.org/packages/rel...

Опубликовано:

14 июл 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 50

@user-mp2qj3ip3d Год назад

Im so appreciated for this video that simplify the basic concept of enrichment analysis. Im look forward to topology-based method.

@biostatsquid Год назад

Thank you for your comment! I'm glad you liked the video. Some more coding tutorials coming up but I will definitely write down topology-based methods in my to-do list:)

@fadingawayyyyy Год назад

Thank you! Really helped in my understanding so much better compared to trying to read articles :') your hard work is much appreciated by us all here!

@aleon9166 Год назад

this is simply amazing, cant wait for new videos!

@mocabeentrill Год назад

BiostatSquidee!!! My enduring gratitude as always! You're the best.

@shreyarao7032 5 месяцев назад

Your videos are a lifesaver! Thank you for making these

@Andrew-oq3fs Год назад

Thank you for this video! Helped me out alot!

@nancychuttani5831 8 дней назад

Amazing work

@mihacerne7313 Год назад

I love mountains but i love the ones in your video even more!

@bllimbyeee Год назад

amazing teaching!! best gsea tutorial on RU-vid omg this helped me so much, thank u!

@bioinforbricker Год назад

This is very impressive video for better understanding the GSEA results, thank you for your effort

@Tearr 3 месяца назад

Thanks a bunch! Wonderful video describing how to interpret GSEA!

@HH-ew5pd 2 месяца назад

Thank you for the clear explanation!! Great help!! Looking forward to upcoming videos:)

@faezehrafiee6257 Год назад

Thank you! you are the best!

@thomaskizzar3403 2 месяца назад

This is such a great video thank you so much!!!

@constanzarodriguez3165 Год назад

thank u so much for this videos!! 😍

@Delios90 Год назад

Thanks a lot for this!

@denizkortik1716 Год назад

you are amazing. please keep doing what you do. ı am grateful.😍

@xelaldaero9339 Год назад

Amazing!! Thank you

@jgk9111 Год назад

This is the best video ever

@biostatsquid Год назад

Thank you! Glad you found it useful:)

@zazoudunet5756 Месяц назад

Thank you, very useful !

@efstratioskirtsios298 9 месяцев назад

Amazing!

@amrsalaheldinabdallahhammo663 Год назад

You are genius !!

@gerardhoeltzel4690 6 месяцев назад

elite explanation. ELITE I TELL YOU. thanks very much

@hossein37 Год назад

Great thanks

@Muuip 10 месяцев назад

Thanks! 🙂👍

@juliachristiaanse2985 8 месяцев назад

thank you!

@juhijaiswal4441 2 месяца назад

Amazing

@nikelElegance 4 месяца назад

thank you alot

@kayondofadhir8923 Год назад

Great. Have u ever tried to plot the p-value distribution just to get a relationship of the p-value with the corresponding FDR out put?

@danielgladish2502 2 месяца назад

2:14 I am not clear on what contributes to the magnitude of the increase/decrease of the running statistic (i.e. what number specifically is the input for the running statistic calculation). Is it the rank value? In the video you focus explicitly on fold change, but in the previous video you mentioned that rank is determined by both fold change AND significance. Great video by the way :)

@biostatsquid 2 месяца назад

Hey Daniel, thanks for your comment, great question! I tend to use -log10(pval)*sign(FC), to get a combination of both, but there's not a consensus in the community as far as I know. There's a few blogs/papers that discuss it: www.biostars.org/p/375584/

@danielgladish2502 2 месяца назад

@@biostatsquid ah makes sense! So it sounds like there are a number of different ways of doing this. Thanks for clarifying and the quick reply! I will have a look at the link.

@shizhaocheng4422 Год назад

I have a question? Is image with negative enrichment score wrongly placed? At 3 minutes and 41 seconds, the graph on the right side of the display is a list of genes with no specific distribution.

@biostatsquid Год назад

Hi Shizhao, thanks for your question! I am not sure which graph are you referring to. If you mean the orange graph in front of the ES graph (with the fish), perhaps I could have made it more obviously distributed towards the lower part (negative diff expression), yes!

@shizhaocheng4422 Год назад

@@biostatsquid Thank you for your reply. I have learned a lot from your video. cant wait for new videos! Thank you ！

@biostatsquid Год назад

@@shizhaocheng4422 I'm happy to hear that:) thanks for your question and feedback!

@jessehines4044 Год назад

Im confused at the end of the video. You said the q-value is the probability of the p-value for the test being wrong, ok but which p-value? The nominal or the adjusted one? Also, isnt the q-value just the adjusted p-value for multiple testing?

@biostatsquid Год назад

Hi Jesse, it is a great question, p-values, q-values and p-adjusted values can be confusing. Yes, as you say, the q-value is an adjusted p-value for multiple testing. So, in simple words: p-val = chance of a false positive (i.e., if you use a p-val cut off of 0.05, it means you are taking a chance that there are 5% of false positives --> calling something significant when it is actually not) Problem of multiple testing - the more tests, the more chance of observing at least one significant result, even if it is actually not significant. We need to correct for this - for which we can use different methods: p-adjusted values: p values corrected using the (most commonly) Bonferroni correction. Usually too stringent. q--values: p-values corrected based on the False Discovery Rate (FDR) - now we are not taking about 5% of all results being false positives, but 5% of SIGNIFICANT results being false positives. Hopefully that made sense. If you want a more exhaustive explanation you might want to check my video on multiple testing correction: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-LVKLyt1B35w.html&embeds_euri=https%3A%2F%2Fbiostatsquid.com%2F&source_ve_path=Mjg2NjY&feature=emb_logo or blog post: biostatsquid.com/multiple-testing-correction-fdr/

@jessehines4044 Год назад

@@biostatsquid Thank you for the indepth reply. Ok, if the q-value is the adjusted p-value then the only thing I don't get is why is there a column for the adjusted p-value and a column for the q-value in the chart near the end of the video? Furthermore, their values are different for each row? Ohhh wait I just accidently skipped over the adjusted p-value section of your reply, sorry. Ok I see so difference between the q and adjusted p-values is the method used to correct for multiple testing(adjusted p-value yielded from the Bonferroni correction and the q-value from FDR). Thanks, that really clears it up!

@biostatsquid Год назад

@@jessehines4044 Exactly! Glad it helped:)

@chusty93 7 месяцев назад

@@biostatsquidHold on. Now I'm even more confused. I thought that the adjusted p-value was a correction using Benjamini-Hochberg (as said in the video), not Bonferroni. Besides, I thought that Benjamini-Hochberg was a FDR-controlling method. So that means that both adjusted-p-val and q-val are FDR corrected? Please, help

@biostatsquid 7 месяцев назад

Hi @@chusty93 thanks for pointing that out! So p-adjusted values are just p-values corrected for multiple testing. This adjustment can be made using various methods: Bonferroni, BH.... q-values are adjusted p-values that control the False Discovery Rate. There are several FDR-controlling methods but the most common one is Benjamini Hochberg. Therefore, when you see "p-adjusted," it often implies FDR correction, just because BH is much more common than Bonferroni, but it's essential to check the specific method used. Same way, if you have q-values, you know they are FDR-corrected p-values, and chances are they will have been corrected using BH since it's the most commmon one, but you should always check. Hope this clarified things, let me know!