Difference between RPKM/FPKM and TPM | RNA-Seq Normalization Methods | Bioinformatics 101

Поделиться
HTML-код
  • Опубликовано: 26 ноя 2024

Комментарии • 56

  • @Bioinformagician
    @Bioinformagician  Год назад +4

    NOTE: RPKM values at 11:23 were calculated using scaling factor 10. RPKM values at 14:40 were calculated using scaling factor 1 million. Apologies for not using the same table as 11:23 for consistency.

    • @devraj1989
      @devraj1989 10 месяцев назад +1

      Found your explanation very good. Can you clarify about the new scaling factor of 1 million ? I am not getting the value 6.66 even when using scaling factor 1 million for gene A Technical Replicate 1. Did you use the scaling factor of 1 million by considering the entire table and not only gene A and gene B ?

    • @OmaymaAlSaei
      @OmaymaAlSaei 10 месяцев назад

      Me too, I have the same question.@@devraj1989 @Bioinformagician

    • @FahimHoq
      @FahimHoq 6 месяцев назад

      She actually used scaling factor 10 here.

  • @hanyingwang2687
    @hanyingwang2687 2 года назад +10

    Very concise and easy to understand, especially for beginners. Thanks!

  • @zlj8435
    @zlj8435 2 года назад +4

    The best explanation that I ever have! Thanks a lot!!!

  • @wesleyeliasbheringbarrios8108
    @wesleyeliasbheringbarrios8108 2 года назад +4

    Very good explanation, perfect for begginers!

    • @snijesh
      @snijesh 2 года назад

      A perfect copy of another tutorial
      ruclips.net/video/TTUrtCY2k-w/видео.html

  • @Nguyenlinh-u2s
    @Nguyenlinh-u2s Год назад +2

    thank you for your video. Your explanation is very easy to follow.

  • @Moni-wq1yt
    @Moni-wq1yt 2 года назад +3

    Amazing! I just thought you missed explaining the concepts of sequencing depth. thanks

    • @Bioinformagician
      @Bioinformagician  2 года назад +1

      I shall note it down to explain it the next time I am covering any concept that involves sequencing depth. Thanks for bringing it to my notice :)

  • @d-gnome-wick7838
    @d-gnome-wick7838 2 года назад +1

    This is very illuminating. Thank you!

  • @trytry6569
    @trytry6569 2 года назад +4

    You're awesome please keep doing this work, I'll support you from my end.

    • @snijesh
      @snijesh 2 года назад

      A perfect copy of another tutorial
      ruclips.net/video/TTUrtCY2k-w/видео.html

  • @LAVANYAABSBTech_ME
    @LAVANYAABSBTech_ME Год назад

    have been searching this foe a soooooooo long time ...thankyouuu sooooooo much

  • @yu-shengwang7236
    @yu-shengwang7236 2 года назад +2

    Thank you so much! It's very easy to understand!!!

  • @BongTrotter0
    @BongTrotter0 11 месяцев назад

    fantastic .. Loved the explanation

  • @jaber_adam
    @jaber_adam 2 года назад +3

    Thank you for you videos. How can we do the RPKM for counts using R ?

  • @libiagomez2547
    @libiagomez2547 8 месяцев назад

    You are great at explaining! Thank you a lot!!!

  • @manuelsokolov
    @manuelsokolov Год назад +4

    Why can't you use TPM values for differential gene expression analysis?

  • @omarluciana
    @omarluciana 2 года назад +1

    Amazing explanation! Thanks

  • @divyarashmi3737
    @divyarashmi3737 2 месяца назад

    I had an query.. In the 2nd and 3rd lecture you have taken FPKM normalized data and demonstrated the gene expression between the samples and in this video lecture you are demonstrating that FPKM can't be used for Differential gene expression analysis.
    I am confused between the concepts.

  • @mocabeentrill
    @mocabeentrill 2 года назад +1

    Crystal clear! Thanks.

  • @harshasatuluri4540
    @harshasatuluri4540 2 года назад +1

    Thanks for making this video and Very very good explanation!

  • @PulseraenTumano
    @PulseraenTumano 2 года назад +2

    Thank you sooooo much!! Super helpful! :)

  • @gabrielac.k.7446
    @gabrielac.k.7446 Год назад +2

    Thank you very much for all your videos. It has helped me a lot to understand the analysis better since you explain it in a very didactic way. Please always continue with the channel.
    I would like to ask and clarify a doubt about normalization. I have RNAseq data made with selection by PolyA and other RNA-seq Total data. I would like to join this data in order to increase the sampling within some subtypes that I have few samples. Do you know any method or normalization process that makes this joining of RNA-seq PolyA and Total possible? I looked for this information in a lot of articles that work with multiple types of data, but they don´t detail how they did.
    Thank you very much

  • @ReySoraya
    @ReySoraya 2 месяца назад

    very clear, thank you !

  • @preciousoleh73
    @preciousoleh73 Год назад

    nice video, please how did u arrive at the final RPKM value because from your teaching we didn't get this value u used

  • @S_M_LA01
    @S_M_LA01 Год назад +1

    not able to determine FPKM in R, I have the mean fragment length, rawcount annotated with gene symbol and I have the gene length. Will you please help me with this? The problem is the same gene has multiple transcripts and each transcript has a separate mean length value.

  • @陈坤-f2l
    @陈坤-f2l 2 года назад

    thnaks for the explaination, very useful

  • @alicekao6305
    @alicekao6305 2 года назад +1

    What is count matrix data we put into the DESeq2? I'm confused for the term "raw counts". What are some common tools people used to get the raw count data mapping to the ref genome?

    • @Bioinformagician
      @Bioinformagician  2 года назад

      I have explained what raw counts are in this initial section of this video: ruclips.net/video/2RFYKTvCXHs/видео.html
      I have explained how to get raw counts from aligned reads here: ruclips.net/video/lG11JjovJHE/видео.html
      These videos should help clear your doubts.

  • @u.sangeethashenoy3988
    @u.sangeethashenoy3988 2 года назад +2

    Very nice explanation mam. thank you so much. Can you please clarify what's the difference between RPKM and FPKM. And is that possible to determine the fold changes from fpkm data and compare between control and test samples.

    • @Bioinformagician
      @Bioinformagician  2 года назад +1

      FPKM is analogous to RPKM and is used specifically in paired-end data.
      You can calculate fold changes from FPKM by taking a ratio of FPKM from test/FPKM from control.

  • @yijingwang7308
    @yijingwang7308 Год назад +1

    Thank you so much for your explanation! I wonder if I want to visualize a gene's expression across samples, which value should I use? In theory, TPM is the best, right? But since I only compare this gene, I do not need to consider about the gene length. The normalized counts from DESeq2 could also be used, right?

    • @alexandrebarroso4886
      @alexandrebarroso4886 Год назад

      I had the same question. To compare genes between samples, you need to use normalized raw counts, and not TPM. In your case, even if it is only one gene, I think you still need to use normalized raw counts, because the other genes are still influencing your data. TPM is ok only if you want to compare gene expression within one sample (Kallisto is a good mapping software which uses pseudoallignments to obtain TPM values).
      In my case, for comparation between samples, I converted TPM in log2CPM (counts per milion), filtered (removed genes with 0 expression) and normalized (trough the TMM method using edgeR). Then, I used those values for comparation by plotting a heatmap with the genes of interest.
      Hope I could help 🙂

  • @Harshraj19988
    @Harshraj19988 10 месяцев назад

    Thank you very much for such a nice and detailed information. I am looking for information on batch normalization of RNA sequencing data, I have observed bias in mapping rate.

  • @stemcell1167
    @stemcell1167 8 месяцев назад

    Hi
    I am supposed to perform a TPM normalisation of my counts Matrix. Can I use the steps explained here as it is or should I use any tool/ package for doing so

  • @davidmartins7104
    @davidmartins7104 5 месяцев назад

    Is there any problem applying TMP normalization in metagenomic paired-end sequencing data?

  • @tushardhyani3931
    @tushardhyani3931 2 года назад

    Thank you for this video !!

  • @Ali-on1eh
    @Ali-on1eh Год назад

    many tnx for your videos, have a question, if i want to use RNA seq data that is downloaded from TCGA for train the model can i use one of these three normalized method data if i can not please tell me what should i do?

  • @dr.niveditayadav8410
    @dr.niveditayadav8410 Год назад

    Could you please make a video to identify FPKM from DESeq2

  • @naveedkhan-fi6ux
    @naveedkhan-fi6ux 2 года назад

    thank you..... you are impressive

  • @abishek1143
    @abishek1143 8 месяцев назад

    Hi, 11:23 you show the RPKM values table with a gene length of 1.5kb, 2kb side by side, my question is a few seconds before you need to find out the 1kb value but why do you mention the gene length differently at the whole RPKM values 11:23

  • @juanete69
    @juanete69 5 месяцев назад

    Do I need to do alignment before counting?

  • @nandithasreeni
    @nandithasreeni 2 месяца назад

    how to get the gene length value for each gene? In this it is directly taken as 1.5 kb and 2 kb

  • @vahidgorganli8895
    @vahidgorganli8895 Год назад +1

    Thanks a lot🙂👍

  • @ahmedal-mammari9639
    @ahmedal-mammari9639 2 года назад +1

    thx a lot, plz can you make a video to teach us how to get gene lengths (width) b/c sometimes data don't have gene lengths (width) b/c i want to get cpm/rpkm,tpm

    • @Bioinformagician
      @Bioinformagician  2 года назад

      You can use biomart to get gene lengths. I have explained how to use biomart in one of my videos: ruclips.net/video/cWe359VnfaY/видео.html

    • @ahmedal-mammari9639
      @ahmedal-mammari9639 2 года назад

      @@Bioinformagician thank you so much

  • @lipeiqi3432
    @lipeiqi3432 2 года назад

    very god explanation for beginners, thank you!!! but why the RPKM table at 11:23 different from the one at 14:40?

    • @Bioinformagician
      @Bioinformagician  2 года назад

      Thank you for bringing it to my notice. RPKM values at 11:23 were calculated using scaling factor 10. RPKM values at 14:40 were calculated using scaling factor 1 million. I should have used the same table as 11:23 for consistency. My bad. I'll leave a note in the description.

  • @grsbiosciences
    @grsbiosciences 2 года назад

    Thank you madam

  • @swagatbaruah7839
    @swagatbaruah7839 Год назад

    if all the three are not suitable for gene expression in DEseq2 and edgeR, which method should i use

    • @Bioinformagician
      @Bioinformagician  Год назад +1

      They have their own normalization methods. Please refer to DESeq2 videos on my channel where I explain that in detail.

  • @Iman_1987
    @Iman_1987 2 года назад

    perfect, thanks