Gene Set Enrichment Analysis (+ R tutorial)

Поделиться
HTML-код
  • Опубликовано: 8 сен 2024

Комментарии • 57

  • @omairshariq6444
    @omairshariq6444 3 года назад +32

    I'm in the closing stages of my PhD and needed to get my head around GSEA and clusterprofiler in a short period of time, and this video was EXACTLY what I needed. Thank you!

    • @LiquidBrain
      @LiquidBrain  3 года назад +5

      Glad that it helps! All the best to your PhD :)

    • @AqleemAbbas
      @AqleemAbbas 2 года назад

      @@LiquidBrain Hello Great, lecture Give me your wechat id

    • @AqleemAbbas
      @AqleemAbbas 2 года назад

      Great What is enrichment? and Enrichment score? What are samples? You mean RNA from different plants spp or same spp

    • @AqleemAbbas
      @AqleemAbbas 2 года назад

      What is difference between false discovery rate and type errors

    • @marijohnson5212
      @marijohnson5212 Год назад +1

      I'm exactly the same haha, bless this vid

  • @ashleyhoped
    @ashleyhoped 2 месяца назад

    I have been trying to figure out this part of my analysis for so long and was struggling with where to even begin but this video was perfect! So easy to understand and straight to the point, you're amazing. Thank you!

  • @clutch3171
    @clutch3171 5 месяцев назад

    very kind of you to explain this sort of material. it could easily take a book probably for some to get through it.

  • @maltam3873
    @maltam3873 4 месяца назад

    Thumbs up to you guys, what a wonderful channel, looking forward to see more bioinformatic enthusiasts in Malaysia

  • @Unaimend
    @Unaimend 2 года назад

    Wow I watched your WGCNA Tutorial a few weeks ago and how I have to analyse my results and jsut saw you also have a GO video. Thanks

  • @c.p.8689
    @c.p.8689 Год назад

    very helpful. Thanks for sharing the knowledge!

  • @SuperMixedd
    @SuperMixedd Год назад +3

    having subtitles would be nice

  • @player90nurken85
    @player90nurken85 Год назад

    Thank you very much for making this video
    It was very clear and straight to the point

  • @MrRamaeri
    @MrRamaeri 3 года назад +1

    thank you so much for your presentation

  • @user-yh4tx6ol9e
    @user-yh4tx6ol9e 2 года назад

    Thank you very much. I like your presentation and explanation.

  • @mortezahadizadeh9637
    @mortezahadizadeh9637 3 года назад

    Thanks for your valuable tutorials.

  • @Gabrielowns321
    @Gabrielowns321 2 года назад +1

    So informative and helpful thank you ☺️

  • @whobugui
    @whobugui Год назад

    Thanks for the video! I think the table accompanying the hypergeometric testing is not accurate in a sense that the row and column names should be switched to match the definition though it is mathematically equivalent.

  • @abebemisganaw7377
    @abebemisganaw7377 7 месяцев назад

    great

  • @Tiagomed96
    @Tiagomed96 Год назад +1

    Thanks for these nice videos, your tutorials are a great resource!
    I was wondering, for the GSEA analysis, should you not filter out noise? Like genes that are not DEGs based on p value alone (not fold change)? Otherwise, your input will be full of genes that are just noise, but GSEA only takes into account the ranked list of logFC. If a gene has a high logFC but it is not reproducible in your samples, it won't have a low pvalue and would be filtered out as noise in a DEG analysis. But if you input just a ranked list of all genes without filtering for pvalues prior, you might end up with many genes which are just noise?
    Thanks in advance!

    • @LiquidBrain
      @LiquidBrain  Год назад +2

      Hi thanks for liking the video! Yes, for GSEA analysis, I usually filter and use only the genes with significant pval from their differential expression analysis output. -Lind

  • @setarehsohail5422
    @setarehsohail5422 2 года назад

    Perfect!

  • @martbull12
    @martbull12 2 года назад

    Very helpful!

  • @johirislam8174
    @johirislam8174 2 года назад +2

    Great vedio.But I would have to clear something suppose i analysed a data set by deseq2 and found DEG .So now i have to convert the DEG symbol to entrez id for functional analysis.So will i now download the file into .csv and then cut the gene symbol from the file and save into a new excel sheet and import the R studio for the entrez id?After getting the id i again download the file and pick the entrez id and paste it in the first download file and remove the symbol?After that import the file and generate that.That'll about you mean to say the lecture??Am I write?Another thing is that My gene symbol is about Homo sapiens but you should there mmusculus.So what will be the code for homosapiens??

    • @LiquidBrain
      @LiquidBrain  2 года назад +1

      Hi Johir, for converting the gene symbol to entrez ID, you can do it in R using biomaRt package:
      library(biomaRt)
      gene_id

    • @johirislam8174
      @johirislam8174 2 года назад

      @@LiquidBrain i am not clear about your data code.and gene id means my selected gene id?Yourdata

    • @stretch8390
      @stretch8390 2 года назад +1

      @@johirislam8174 See how in defining the object "gene_id" attributes was listed with 3 things: ensembl_gen_id, external_gene_name, and entrezgene_id.
      The c(2,3) you are querying is columns two and three in gene_id. That is because of the comma in the square brackets means the selection is on columns and not rows, see: gene_id[ ,c(2,3)]
      I believe you will understand the x and y by looking at the documentation for function merge() in r.
      Hope this helps!

  • @kdr9608
    @kdr9608 3 года назад

    Thanks!

  • @gianni_drm
    @gianni_drm Год назад

    What if I don't have DEGs but instead I have co-expression modules. I have a gene list with associated GO terms, but I don't have any LFC or values. Is it still possible to use clusterProfiler?

    • @LiquidBrain
      @LiquidBrain  Год назад

      Yes, you can use enrichplot package for the visualization :)

  • @jasonlee416
    @jasonlee416 Год назад +1

    Hi, thank you very much for the video. I am really new to R studio, and this maybe a stupid question, but could you kindly expand a bit more on the GOplot (circle_dat and GOChord) please? For example, I have an RNA-seq data (species: rat) with EMSEMBL IDs and log2fold changes, and would like to generate a circular plot as you did in this video. Thank you!

    • @LiquidBrain
      @LiquidBrain  Год назад

      Hi thanks for your comment! Too bad that I afraid may not have time to demo properly, but I found the GOplot creator, Wencke Walter, has a very nice tutorial about it wencke.github.io/ hope it is useful for you :)

  • @javierhernando5063
    @javierhernando5063 Год назад

    Is it possible to apply this approach to RT-qPCR analysis? Obtaining relative quantification values (Ct or cycle threshold) normalized by its endogenous controls? Or just RNA-Seq?

    • @LiquidBrain
      @LiquidBrain  Год назад

      Yes, it is the downstream analysis to look for the enrichment of a set of gene-of-interest, as long as you supply the correct gene IDs and/or the ranking values (log2FC in my example), hope this helps :) - Lind

  • @chrislee8408
    @chrislee8408 Год назад

    is it possible to do a gene set enrichment analysis without doing a DEG? In my lab, we have just started doing NGS and we are still setting up our QCs. What we have in mind right now for one of our QC is to see if we can guess which sample (there are 4 samples) came from which tissues (heart, liver, kidney, diluent) by doing a gene set enrichment analysis to see if we can identify overexpressed genes which may only be expressed in specific tissues. Do you think it's feasible? Thank you.

    • @LiquidBrain
      @LiquidBrain  Год назад

      Oh you need to perform DGE. Differential gene expression analysis will come out a list of genes that the expression is distinguished from other samples, then GSEA is to understand what are terms/pathways that enriched in the DEGs. Hope this helps - Lind

  • @user-yh4tx6ol9e
    @user-yh4tx6ol9e 2 года назад

    Hi, I wrote a question about the argument of gseGO used in the script, but I found it deleted...
    can you please answer me if the argument "padjMethod" is neccessary in the analysis or not?
    I found several papers skipped with the argument, on the other hand, they used BH correction
    I would appreciate it for your advice. Thanks

    • @LiquidBrain
      @LiquidBrain  Год назад +2

      Hi it is a statistical way to perform the multiple test correction, BH is commonly applied , you may change to other method or use the default one as most people do :)
      Here’s for better understanding:
      www.rdocumentation.org/packages/stats/versions/3.5.0/topics/p.adjust

  • @supriyaawasthi2944
    @supriyaawasthi2944 2 года назад

    Hi,
    Really an awesome presentation...thanks a lot...can you please make video on analysing somatic mutation data from exome data analysis...or else suggest some good tools for somatic mutation data analysis in form of network analysis

    • @LiquidBrain
      @LiquidBrain  2 года назад

      Hi may I know more what kind of somatic mutation analysis do you refer to?

  • @trishakrishna939
    @trishakrishna939 Год назад

    Really hard to catch up with what she is saying.

  • @amrsalaheldinabdallahhammo663
    @amrsalaheldinabdallahhammo663 2 года назад

    Thank you genius, can you please make mitch analysis on R :)

  • @nitinshukla6532
    @nitinshukla6532 2 года назад

    I am stuck how to do same thing for non model organism such labeo rohita which is present on NCBI.
    It doesn't have any ensemble id or anything Please help

    • @LiquidBrain
      @LiquidBrain  2 года назад +1

      Hi Nitin, yes, you need to create the gene annotation reference list yourself, here's the tutorial by the author GuangChuang Yu guangchuangyu.github.io/2015/05/use-clusterprofiler-as-an-universal-enrichment-analysis-tool/
      hope it helps. -Lind

  • @ntsopoul
    @ntsopoul 2 года назад

    before doing gse are you filtering for pvaladj from your list of differential expressed genes?

    • @LiquidBrain
      @LiquidBrain  2 года назад

      Yes, based on what I understood, we can filter with adjusted p value -Lind

  • @chinhhoang2375
    @chinhhoang2375 2 года назад

    Thank you for your useful video. I used to run clusterProfier smoothly, but recently it has informed that "fail to download KEGG data...
    Error in download.KEGG.Path(species) " even though I used the same script that worked before. I would be grateful if you show me how to fix it. Thanks!

    • @LiquidBrain
      @LiquidBrain  2 года назад +1

      Hi it’s hard to diagnose with little info…could you let me to have a look on the script ? You may send it to liquidbrain.r@gmail.com

  • @amtd3808
    @amtd3808 6 месяцев назад

    I dont understand...

  • @palettenpeter331
    @palettenpeter331 2 года назад

    Thanks for the great video! When performing gsea: should I only use up- or downregulated genes or all genes?
    Thanks a Lot! :)

    • @LiquidBrain
      @LiquidBrain  2 года назад

      For GSEA, you can decide based on the aim for the experiment, but I believe the default ranking tries to place upregulated genes on the left and down regulated genes on the right. So you can observe the effects of both in the enrichment analysis. -Brandon

    • @palettenpeter331
      @palettenpeter331 2 года назад

      @@LiquidBrain thank you so much! Really love this channel

  • @AqleemAbbas
    @AqleemAbbas 2 года назад

    What is enrichment? Definition

    • @LiquidBrain
      @LiquidBrain  2 года назад +3

      Hi what do you want to know about the enrichment? Perhaps Google is your good buddy?

  • @shadabali9476
    @shadabali9476 Год назад

    difficult to understand whether you are speaking chinese or english

  • @ncedilemankahla9758
    @ncedilemankahla9758 Год назад

    sadly your stepwise guidance was not as great here