How to interpret GSEA results and plot - simple explanation of ES, NES, leading edge and more!

Поделиться
HTML-код
  • Опубликовано: 12 июл 2024
  • In this video, I will focus on how to interpret the results from Gene Set Enrichment Analysis (GSEA) and to interpret the plots.
    Learn what are the main statistics given by GSEA and how to use them to make the most of your pathway enrichment analysis results, including how to interpret the Enrichment Score (ES), Normalised Enrichment Score (NES), p-values, FDR...
    We will go through basic GSEA terms like the ranking metric, the leading edge subset and more!
    Hope you like it!
    --------------------------------------------------------------------------------------------------------------------
    Watched it already?
    If you liked this video or found it useful, please let me know! Your comments and feedback are very much appreciated😊
    If you have questions, don't hesitate to leave me a comment down below, I will answer as soon as I can:)
    --------------------------------------------------------------------------------------------------------------------
    Are you into biostatistics and computational analysis?
    For more biostatistics tools and resources, you can visit:
    biostatsquid.com/
    Follow me on Instagram at @biostatsquid:
    / biostatsquid
    For more
    • simple and clear explanations of biostatistics methods
    • computational biology tools
    • easy step-by-step tutorials in R and Python
    to analyse and visualise your biological data!
    Don’t forget to subscribe if you don’t want to miss another video from me! --------------------------------------------------------------------------------------------------------------------
    Other interesting resources for GSEA:
    Original publication: www.pnas.org/doi/10.1073/pnas...
    You can conduct your own Gene Set Enrichment Analysis with GSEA Software:
    www.gsea-msigdb.org/gsea/inde...
    or if you want to program your way through it, I recommend the fgsea or clusterProfiler packages:
    bioconductor.org/packages/rel...
    bioconductor.org/packages/rel...

Комментарии • 50

  • @user-mp2qj3ip3d
    @user-mp2qj3ip3d Год назад +6

    Im so appreciated for this video that simplify the basic concept of enrichment analysis. Im look forward to topology-based method.

    • @biostatsquid
      @biostatsquid  Год назад +1

      Thank you for your comment! I'm glad you liked the video. Some more coding tutorials coming up but I will definitely write down topology-based methods in my to-do list:)

  • @fadingawayyyyy
    @fadingawayyyyy Год назад +5

    Thank you! Really helped in my understanding so much better compared to trying to read articles :') your hard work is much appreciated by us all here!

  • @mocabeentrill
    @mocabeentrill Год назад +2

    BiostatSquidee!!! My enduring gratitude as always! You're the best.

  • @aleon9166
    @aleon9166 Год назад

    this is simply amazing, cant wait for new videos!

  • @shreyarao7032
    @shreyarao7032 5 месяцев назад +1

    Your videos are a lifesaver! Thank you for making these

  • @Andrew-oq3fs
    @Andrew-oq3fs Год назад +1

    Thank you for this video! Helped me out alot!

  • @nancychuttani5831
    @nancychuttani5831 6 дней назад

    Amazing work

  • @bllimbyeee
    @bllimbyeee Год назад

    amazing teaching!! best gsea tutorial on RUclips omg this helped me so much, thank u!

  • @Tearr
    @Tearr 3 месяца назад

    Thanks a bunch! Wonderful video describing how to interpret GSEA!

  • @HH-ew5pd
    @HH-ew5pd Месяц назад

    Thank you for the clear explanation!! Great help!! Looking forward to upcoming videos:)

  • @bioinforbricker
    @bioinforbricker Год назад +2

    This is very impressive video for better understanding the GSEA results, thank you for your effort

  • @mihacerne7313
    @mihacerne7313 Год назад +1

    I love mountains but i love the ones in your video even more!

  • @thomaskizzar3403
    @thomaskizzar3403 2 месяца назад

    This is such a great video thank you so much!!!

  • @faezehrafiee6257
    @faezehrafiee6257 Год назад

    Thank you! you are the best!

  • @constanzarodriguez3165
    @constanzarodriguez3165 Год назад

    thank u so much for this videos!! 😍

  • @zazoudunet5756
    @zazoudunet5756 Месяц назад

    Thank you, very useful !

  • @xelaldaero9339
    @xelaldaero9339 Год назад

    Amazing!! Thank you

  • @denizkortik1716
    @denizkortik1716 Год назад

    you are amazing. please keep doing what you do. ı am grateful.😍

  • @Delios90
    @Delios90 Год назад

    Thanks a lot for this!

  • @efstratioskirtsios298
    @efstratioskirtsios298 9 месяцев назад

    Amazing!

  • @juhijaiswal4441
    @juhijaiswal4441 2 месяца назад

    Amazing

  • @gerardhoeltzel4690
    @gerardhoeltzel4690 6 месяцев назад

    elite explanation. ELITE I TELL YOU. thanks very much

  • @amrsalaheldinabdallahhammo663
    @amrsalaheldinabdallahhammo663 Год назад +1

    You are genius !!

  • @jgk9111
    @jgk9111 Год назад +1

    This is the best video ever

    • @biostatsquid
      @biostatsquid  Год назад

      Thank you! Glad you found it useful:)

  • @hossein37
    @hossein37 Год назад +1

    Great thanks

  • @cintiapalu1929
    @cintiapalu1929 2 месяца назад

    Beautifully explained! Keep up the good work, I'm a fan and will be spread your tutorials :

  • @nikelElegance
    @nikelElegance 4 месяца назад

    thank you alot

  • @juliachristiaanse2985
    @juliachristiaanse2985 8 месяцев назад

    thank you!

  • @Muuip
    @Muuip 10 месяцев назад

    Thanks! 🙂👍

  • @kayondofadhir8923
    @kayondofadhir8923 Год назад

    Great. Have u ever tried to plot the p-value distribution just to get a relationship of the p-value with the corresponding FDR out put?

  • @danielgladish2502
    @danielgladish2502 2 месяца назад

    2:14 I am not clear on what contributes to the magnitude of the increase/decrease of the running statistic (i.e. what number specifically is the input for the running statistic calculation). Is it the rank value? In the video you focus explicitly on fold change, but in the previous video you mentioned that rank is determined by both fold change AND significance.
    Great video by the way :)

    • @biostatsquid
      @biostatsquid  2 месяца назад +2

      Hey Daniel, thanks for your comment, great question! I tend to use -log10(pval)*sign(FC), to get a combination of both, but there's not a consensus in the community as far as I know. There's a few blogs/papers that discuss it: www.biostars.org/p/375584/

    • @danielgladish2502
      @danielgladish2502 2 месяца назад

      @@biostatsquid ah makes sense! So it sounds like there are a number of different ways of doing this. Thanks for clarifying and the quick reply! I will have a look at the link.

  • @shizhaocheng4422
    @shizhaocheng4422 Год назад

    I have a question? Is image with negative enrichment score wrongly placed? At 3 minutes and 41 seconds, the graph on the right side of the display is a list of genes with no specific distribution.

    • @biostatsquid
      @biostatsquid  Год назад

      Hi Shizhao, thanks for your question! I am not sure which graph are you referring to. If you mean the orange graph in front of the ES graph (with the fish), perhaps I could have made it more obviously distributed towards the lower part (negative diff expression), yes!

    • @shizhaocheng4422
      @shizhaocheng4422 Год назад

      @@biostatsquid Thank you for your reply. I have learned a lot from your video. cant wait for new videos! Thank you !

    • @biostatsquid
      @biostatsquid  Год назад

      @@shizhaocheng4422 I'm happy to hear that:) thanks for your question and feedback!

  • @jessehines4044
    @jessehines4044 Год назад +1

    Im confused at the end of the video. You said the q-value is the probability of the p-value for the test being wrong, ok but which p-value? The nominal or the adjusted one? Also, isnt the q-value just the adjusted p-value for multiple testing?

    • @biostatsquid
      @biostatsquid  Год назад +3

      Hi Jesse, it is a great question, p-values, q-values and p-adjusted values can be confusing. Yes, as you say, the q-value is an adjusted p-value for multiple testing.
      So, in simple words:
      p-val = chance of a false positive (i.e., if you use a p-val cut off of 0.05, it means you are taking a chance that there are 5% of false positives --> calling something significant when it is actually not)
      Problem of multiple testing - the more tests, the more chance of observing at least one significant result, even if it is actually not significant. We need to correct for this - for which we can use different methods:
      p-adjusted values: p values corrected using the (most commonly) Bonferroni correction. Usually too stringent.
      q--values: p-values corrected based on the False Discovery Rate (FDR) - now we are not taking about 5% of all results being false positives, but 5% of SIGNIFICANT results being false positives.
      Hopefully that made sense. If you want a more exhaustive explanation you might want to check my video on multiple testing correction: ruclips.net/video/LVKLyt1B35w/видео.html&embeds_euri=https%3A%2F%2Fbiostatsquid.com%2F&source_ve_path=Mjg2NjY&feature=emb_logo
      or blog post: biostatsquid.com/multiple-testing-correction-fdr/

    • @jessehines4044
      @jessehines4044 Год назад +1

      @@biostatsquid Thank you for the indepth reply. Ok, if the q-value is the adjusted p-value then the only thing I don't get is why is there a column for the adjusted p-value and a column for the q-value in the chart near the end of the video? Furthermore, their values are different for each row? Ohhh wait I just accidently skipped over the adjusted p-value section of your reply, sorry. Ok I see so difference between the q and adjusted p-values is the method used to correct for multiple testing(adjusted p-value yielded from the Bonferroni correction and the q-value from FDR). Thanks, that really clears it up!

    • @biostatsquid
      @biostatsquid  Год назад +1

      @@jessehines4044 Exactly! Glad it helped:)

    • @chusty93
      @chusty93 7 месяцев назад

      @@biostatsquidHold on. Now I'm even more confused. I thought that the adjusted p-value was a correction using Benjamini-Hochberg (as said in the video), not Bonferroni. Besides, I thought that Benjamini-Hochberg was a FDR-controlling method. So that means that both adjusted-p-val and q-val are FDR corrected? Please, help

    • @biostatsquid
      @biostatsquid  7 месяцев назад

      Hi @@chusty93 thanks for pointing that out! So p-adjusted values are just p-values corrected for multiple testing. This adjustment can be made using various methods: Bonferroni, BH....
      q-values are adjusted p-values that control the False Discovery Rate. There are several FDR-controlling methods but the most common one is Benjamini Hochberg. Therefore, when you see "p-adjusted," it often implies FDR correction, just because BH is much more common than Bonferroni, but it's essential to check the specific method used. Same way, if you have q-values, you know they are FDR-corrected p-values, and chances are they will have been corrected using BH since it's the most commmon one, but you should always check. Hope this clarified things, let me know!