I can't believe it. I was actually crying and panicking when I clicked on the video because of all that stress caused by "I don't know, I don't understand" thoughts. And your intro just killed me XD
I've learned more from this 18-min long video, than my whole course in transcriptomics. You are so good at explaining these complex processes. Thank you!
I know the feeling. Transcriptomics is so complex that most people, including those working in the field, seem to get "Betriebsblind" and unable to explain what they are actually doing. Josh Starmer is awesome, I just whish we had instructors like that in every research facility.
@@statquest same for me, I've understood this concept really easily thanks to you ( better than what I've understood in a 2h genetic engineering class )
Hey Josh, I just started doing my RNA sequencing and your video was very useful for me to get the basic idea behind the RNA sequencing. I spent hours and hours reading papers and didn't understand a thing about it. I really appreciate what you have done. I think your channel will play a big role in my research. Hats off your efforts in coming up with useful informative videos like this.
Awesome! That's great news. Since I made that video, I updated my PCA video to be easier to understand, so if you're interested in that, check out: ruclips.net/video/FgakZw6K1QQ/видео.html
@Highlander Yeti same for me! Just found out we're going to do RNAseq, immediately went on RUclips search instead of the classical science databases, came out here! I was already looking into (often expensive) courses, but it is all online! This is the future of learning. I should have had stuff like this 12 years ago to study Biomedical Sciences. That's why I'm on a mission helping future/current Biomeds and life scientists discover channels like this. Already BIG thanks from my part, StatQuest! I've yet to check out the rest of your stuff, but I WILL!
First you got me through my statistics course. Then you pulled me through my machine learning courses. Now, you will save me in my graduate genetics lab. Thank you so much Josh.
8:48 Two semesters of Bioinformatics and all I wanted was the sequence "GATTACA" to appear on any one slide during a lecture. Thanks, StatQuest for making my wish finally come true!
Hey! I'm Brazilian, and I've been learning English and molecular bioinformatics. I'm grateful for this video. Thanks a lot, your pronunciation is easy to understand and you have good didactics too.
Call this a 'gentle introduction' if you want. It is really just a high level overview of the process. Such presentations are important to building students mental models of any subject, and honestly are frequently overlooked in many subjects in my experience. Great stuff.
So the "+" symbol used to be a section where individuals could ad more info about their sequence. This is back when we had far smaller output. Since then, data has grown so much that we have developed tools to go through the data more efficiently. In the end, the "+" is simply an artifact from the early days of these files.
I'm not so sure about that, I think it was like that at the origin of this method, but in the new type of document layout, the + symbol could stand for "+ strand", which is often more investigated than the "- strand" of the double helix. I could be wrong, for sure, but I remeber like that
I am seeing this video in December 2021 and I can't thank you enough for this incredible concept clearing video of RNA-seq. Very Concise and subtle explanation, Besides your ENGLISH is really understandable for other language speakers. I am gonna check another playlist too.
Hey. I'm a clinician trying to do a basic sciences post doc. I haven't touched my biology notebooks in 10 years... and your Video makes it look... like I could maybe, one day, perhaps... UNDERSTAND transcriptomics. (Also makes me feel like an idiot, but in a gentle and kind manner). Thanks
This is actually one of the easiest-to-understand videos I have ever watched about this subject. Thank you for making it so that I can actually understand the information being taught!
@@statquest same comment! patiently explained without assuming we're all experts, written text of what you are saying verbally, simple, pedagogical, and systematic. It's great teaching, and so happy it's free!
"+" sign in raw data indicates the forward strand. If paired-end sequencing was performed there will be files with"+" or "-" signs that indicate forward or reverse strands, respectively.
this makes sense… i think its easy for the machine to figure out in the beginning since probably the oligo adaptors, which bind to the flow cell floor, are probably colour-coded for forward and reverse adaptors, and the adaptors themselves, are also probably colour coded, and the camera can pick up on this at the start, which means the algorithm can interpret forward and reverse strand locations and write it to the file.
Oh wow. Thanks for the explanation! I didn't really understand this concept until now. I've watched this video 3 times now because it is so comprehensive. Thanks a lot.
Hey JOSH…First thanks for making such great videos over the years. I see that you even left your job as a professor and have devoted full time to make this stuff since 2017. Appreciate it as it is difficult to break your comfort zone. One thing I would say after reading books on mastery is that all elite masters have this skill to explain a tough concept in 2 minutes and they can drag the same concept for hours and hours. From thousands of comments on your videos, majority says you make things extremely simple which is a validation of mastery. The second aspect of elite masters are after working for about a decade and mastering a field then they bring innovation or a new thing (a new concept, theory, a formula, anything that was never explained). I’m sure this would be that phase and stat quest is part of that, and I hope many things would come along with your creativity which you have exploited to make this elite content. There are masters but very few have this skill to make things simpler. Many people might comment that you have mastery or talent in statistics or in music but I can say without any doubt that your talent lies is making complex things easier (the medium could either be through a video or through a book don’t matter), but that’s your core talent which you have used. So I wish you keep using this rare talent and bring more contribution to all the humans. I will certainly support statquest by purchasing the guides and encourage all who have benefited from this content. Thanks again and lastly is there any email if I could send you a message or share something of value. Will appreciate it.
Thank you for the clear explanations. This was the first time I learned how the machine reads the bases by taking pictures from the top with single base annealing intervals. It was very informative. I am heading to watch your other videos. Many thanks!
My God Lord this was heaven sent. Any you guys are hilarious! I don't think it's "excessive self promotion" at all, I'm going to watch your PCA one right now!
These are great, I’ve been inundating myself with media about genomics and bioinformatics, but a lot of the specific techniques and terminology were still going over my head. Thanks for the thorough, but no-nonsense explanations!
Normally I get bored very quickly from explanation but hey this didn't feel like 18 mins at all :D Very useful content and you explain the ideas in pretty well organized way. Thank you that was really helpful ✔
I'm just starting my internship for my bioinformatics studies. Just watched the intro and feel so blessed to have a Video like this. So... thanks a lot ^^
Dear Josh, can you please make a video on small RNA-seq, I think you are the best teacher to explain small RNA-seq in detail. Hope you consider. Thank You.
@@statquest I want to say a hesitant yes :D I remember reading about orientation representation in NGS somewhere: It is almost universally + because only the forward/reverse strand is kept as one of the final steps of sequencing.
According to the "+" in fastQ, I have heard once, that in this line it used to be a repeat of the sequence name without "@" at the begining, to ensure that quality regards to the sequence above, but to lower file weight it has been switched to the "+". That's what I have heard, but it would be worth to approve somehow this information, I have no sources to do that.
I was taught in my course in metagenome and microbiome analysis, that the + line exists so you can put comments there! Never seen one with comments though.
MrErPeeeg is correct; but also it signals the end of the sequence lines. FastQ spec: @
+[]
A requirement is that the following '+' is optional, but if it appears right after '+', it should be identical to the following '@'. maq.sourceforge.net/fastq.shtml
The reason is that FASTQ is a combination of FASTA and QUAL files. Each of those has identifier followed by data. FASTA has sequence, QUAL has quality scores. interleaving these two pieces of information (2 lines from each file "zippered in") gives us 4 lines where 2 of the lines are identical. Later on, when machines got smarter, the third line became obsolete (and expensive space-wise) but is retained in a stunted form for backward compatibility.
You're the first person to ask about that. I wish I could tell you I would cover it soon. However, I have a long over due promise to make a series on neural networks and another series on time series and mixed models... So it might be a while to get to this. :(
Love the song! Also, thank you for explaining the fragments part. I didn't know that I don't fully understand the rationale behind fragmentation until you explained it. It's an important side note to understand RNA-seq.
I loved this explanation!!!. It is the most didactic I have seen of all. Congratulations and many thanks. It was very useful. I really appreciate what you have done.
I thought I understood this process until I started looking at a condition where a gene is on a region that should be deleted but the gene is still producing counts so watching this is as a refresher.
@@statquest it did but I'm still baffled. I'm getting counts from a gene thats on a region that is very clearly deleted. Either Ive got the genomic range of the deletion wrong and the gene is right on the border but not deleted orrrrr something has gone terribly wrong lol
+ symbol on the 3rd row is in aim to distinguish the sequence in the 2nd row from the data score in the last row. I know it sounds stupid but it is what it is.
Thank you so much for this video. It is a great starting point for me, and I have a feeling that I'll be coming back to your channel. Keep up the great work! :)
I loved the video...Very well explained...Can you also make a video about the analysis of raw RNA-Seq data, that is getting from raw reads to FC, FDR and p values for each gene...
This video is great! Could you do a video someday going into details of working with the raw data? Ie accessing it (terminal and fastq files), converting it... which software to use etc? I think you might be the only one who'd be able to explain hahaha
Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/
I can't believe it. I was actually crying and panicking when I clicked on the video because of all that stress caused by "I don't know, I don't understand" thoughts. And your intro just killed me XD
Bam! :)
Me too!! I was breaking down internally from exam stress, then i watched the intro and it was like a breath of fresh air :,,,,)
I've learned more from this 18-min long video, than my whole course in transcriptomics. You are so good at explaining these complex processes. Thank you!
Hooray! :)
I know the feeling. Transcriptomics is so complex that most people, including those working in the field, seem to get "Betriebsblind" and unable to explain what they are actually doing. Josh Starmer is awesome, I just whish we had instructors like that in every research facility.
@@statquest same for me, I've understood this concept really easily thanks to you ( better than what I've understood in a 2h genetic engineering class )
Hey Josh, I just started doing my RNA sequencing and your video was very useful for me to get the basic idea behind the RNA sequencing. I spent hours and hours reading papers and didn't understand a thing about it. I really appreciate what you have done. I think your channel will play a big role in my research. Hats off your efforts in coming up with useful informative videos like this.
Awesome! That's great news. Since I made that video, I updated my PCA video to be easier to understand, so if you're interested in that, check out: ruclips.net/video/FgakZw6K1QQ/видео.html
@Highlander Yeti same for me! Just found out we're going to do RNAseq, immediately went on RUclips search instead of the classical science databases, came out here! I was already looking into (often expensive) courses, but it is all online! This is the future of learning. I should have had stuff like this 12 years ago to study Biomedical Sciences. That's why I'm on a mission helping future/current Biomeds and life scientists discover channels like this. Already BIG thanks from my part, StatQuest! I've yet to check out the rest of your stuff, but I WILL!
Amazing video! Better than any textbooks or even my professor😂
First you got me through my statistics course. Then you pulled me through my machine learning courses. Now, you will save me in my graduate genetics lab. Thank you so much Josh.
No way! That’s totally crazy. Good luck with your lab! :)
8:48 Two semesters of Bioinformatics and all I wanted was the sequence "GATTACA" to appear on any one slide during a lecture. Thanks, StatQuest for making my wish finally come true!
Yes!!!! :)
I think it's great how there's an effort to speak and pronounce clearly; It really helps when you're not a native English speaker!
Thanks!
Great explanation! I'am bachelor's degree from biology of Universitas Indonesia. It's very useful for me to start learn RNA-seq scope. Big thankyou!!
Glad it was helpful!
Hey! I'm Brazilian, and I've been learning English and molecular bioinformatics. I'm grateful for this video. Thanks a lot, your pronunciation is easy to understand and you have good didactics too.
Muito obrigado! :)
This is one of the best videos for explaining RNA-seq. You are fantastic. Thank you very much.
Thank you! :)
I'm writing my bachelor's thesis about RNA-seq and usage of this method in the viruses life cycle, u helped me a lot in understanding this. Thank you!
Call this a 'gentle introduction' if you want. It is really just a high level overview of the process. Such presentations are important to building students mental models of any subject, and honestly are frequently overlooked in many subjects in my experience.
Great stuff.
Thanks!
This is the clearest explanation in this sequencing videos. Usually explanation of such technology is hard, but they did a great job.
Wow, thanks!
As a scientist with no previous experience in big data (who will need to be tuned in for my new research position), all I can say is THANK YOU.
Thanks!
I LOVE gentle intros
double bam! :)
Josh Starmer I love you, man. I have been spending hours reading and have only added stress and headaches after
Thank you so much
Happy to help!
The intro song really fits me lol. I'm literally crying and watching StatQuest right now.
Bam!
You stop my worring , stress and anxiety! thank you God and bless Josh Starmer!
Thank you very much! :)
So the "+" symbol used to be a section where individuals could ad more info about their sequence. This is back when we had far smaller output. Since then, data has grown so much that we have developed tools to go through the data more efficiently. In the end, the "+" is simply an artifact from the early days of these files.
I'm not so sure about that, I think it was like that at the origin of this method, but in the new type of document layout, the + symbol could stand for "+ strand", which is often more investigated than the "- strand" of the double helix.
I could be wrong, for sure, but I remeber like that
Fantastic video. This is the kind of gentle explanation students (including myself) need. Many thanks from Sweden!
Glad you enjoyed it!
I am seeing this video in December 2021 and I can't thank you enough for this incredible concept clearing video of RNA-seq. Very Concise and subtle explanation, Besides your ENGLISH is really understandable for other language speakers. I am gonna check another playlist too.
Thank you!
thank you for making this video so accessible by having captions for everything
Thank you so much for the video, I have been having headache to grasp the RNA seq concept, now I am set for transcriptomics module.
bam! :)
Hey. I'm a clinician trying to do a basic sciences post doc. I haven't touched my biology notebooks in 10 years... and your Video makes it look... like I could maybe, one day, perhaps... UNDERSTAND transcriptomics. (Also makes me feel like an idiot, but in a gentle and kind manner). Thanks
Good luck! :)
Less than half way through, already learned more than an entire online course!
bam!
this 18-min video is way more concise than 2-years study at Uni.
Thanks!
This is actually one of the easiest-to-understand videos I have ever watched about this subject. Thank you for making it so that I can actually understand the information being taught!
Glad it was helpful!
@@statquest same comment! patiently explained without assuming we're all experts, written text of what you are saying verbally, simple, pedagogical, and systematic. It's great teaching, and so happy it's free!
@@daedrmr2dae Thank you!
"+" sign in raw data indicates the forward strand. If paired-end sequencing was performed there will be files with"+" or "-" signs that indicate forward or reverse strands, respectively.
Thanks!
this makes sense… i think its easy for the machine to figure out in the beginning since probably the oligo adaptors, which bind to the flow cell floor, are probably colour-coded for forward and reverse adaptors, and the adaptors themselves, are also probably colour coded, and the camera can pick up on this at the start, which means the algorithm can interpret forward and reverse strand locations and write it to the file.
Oh wow. Thanks for the explanation! I didn't really understand this concept until now. I've watched this video 3 times now because it is so comprehensive. Thanks a lot.
Glad it was helpful!
Simple, concise, and straight forward. Well worth the 18 minutes.
Thank you! :)
Thank you for this. My current molecular bio professor recommedned we see it before our first class, and it was well worth the 18 minutes!
That's awesome! I'm glad you liked the video. :)
Hey JOSH…First thanks for making such great videos over the years. I see that you even left your job as a professor and have devoted full time to make this stuff since 2017. Appreciate it as it is difficult to break your comfort zone.
One thing I would say after reading books on mastery is that all elite masters have this skill to explain a tough concept in 2 minutes and they can drag the same concept for hours and hours. From thousands of comments on your videos, majority says you make things extremely simple which is a validation of mastery. The second aspect of elite masters are after working for about a decade and mastering a field then they bring innovation or a new thing (a new concept, theory, a formula, anything that was never explained). I’m sure this would be that phase and stat quest is part of that, and I hope many things would come along with your creativity which you have exploited to make this elite content.
There are masters but very few have this skill to make things simpler. Many people might comment that you have mastery or talent in statistics or in music but I can say without any doubt that your talent lies is making complex things easier (the medium could either be through a video or through a book don’t matter), but that’s your core talent which you have used. So I wish you keep using this rare talent and bring more contribution to all the humans.
I will certainly support statquest by purchasing the guides and encourage all who have benefited from this content. Thanks again and lastly is there any email if I could send you a message or share something of value. Will appreciate it.
Wow! Thank you very much!!! BAM! :)
how grateful i am to the universe to find this 20hours before my interview
Good luck! :)
I am also revising before my interview :D
@@aleksandrat.5058 good luck!
Very impressed how you explain pseudo-alignment! So clear and so concise! Amazing!!
I'm working on a new video that will do a whole analysis pipeline from start to finish and will include pseudo-alignment.
Thank you for this video! It help me a lot, I didn't find anywhere a clear explanation of this content.
Glad it was helpful!
My report on RNA-Seq is due in 74 hours and I’m just starting this. Fingers crossed.
Good luck!
Thank you for the clear explanations. This was the first time I learned how the machine reads the bases by taking pictures from the top with single base annealing intervals. It was very informative. I am heading to watch your other videos. Many thanks!
Glad it was helpful!
Wow this is such a god send for my presentation tomorrow
Good luck!
Bro, super love the various title songs you used for each videos, that attract me to the statistic world! Thank you!
Hooray!!! :)
My God Lord this was heaven sent. Any you guys are hilarious! I don't think it's "excessive self promotion" at all, I'm going to watch your PCA one right now!
Great description. Thank you.
Additional information/description about the sequence may be put in the third line that starts with the +
Thank you very much! :)
Thanks for this tutorial. It's much better than others I've viewed.
Thanks! :)
you are literally saving my entire life rn
Happy to help! :)
1:27 I love that you did this
These are great, I’ve been inundating myself with media about genomics and bioinformatics, but a lot of the specific techniques and terminology were still going over my head.
Thanks for the thorough, but no-nonsense explanations!
this is the clearest explanation on the internet about rna seq, everywhere else makes it sounds like gobbledygook
Hooray! I'm glad you like my video. :)
someone give this man a medal please
Thanks! :)
Normally I get bored very quickly from explanation but hey this didn't feel like 18 mins at all :D Very useful content and you explain the ideas in pretty well organized way. Thank you that was really helpful ✔
Thank you very much! :)
This is an amazing video! I have learnt a lot. We would like to see more videos on bioinformatics.
I'm just starting my internship for my bioinformatics studies. Just watched the intro and feel so blessed to have a Video like this. So... thanks a lot ^^
Hooray!!! Good luck with your internship. :)
Really helpful video, and extremely easy to understand for beginners, many thanks!
Glad it was helpful!
wonderful i always wondered how the dot like thing tell you about sequence ,lovely you cleared my basic doubt
bam!
This technique finally makes sense! Thank you so much, this really helped a lot. Very clearly explained and easy to follow.
Dear Josh, can you please make a video on small RNA-seq, I think you are the best teacher to explain small RNA-seq in detail.
Hope you consider.
Thank You.
I'll keep that in mind.
This is honestly amazing... the best explanation on RUclips for RNA-seq!!
This is such a good video. I felt lost until now. Thank you
bam! :)
im so glad I found ur channel to potentially save my course
Hooray and good luck! :)
For 7:30, the + symbol is the orientation I believe. That is, the direction from 3' to 5' or vice versa in which the sequence is read. Great video.
So is it sometimes a '-'?
@@statquest I want to say a hesitant yes :D I remember reading about orientation representation in NGS somewhere: It is almost universally + because only the forward/reverse strand is kept as one of the final steps of sequencing.
The intro song gets me every time ha. But honestly thanks for breaking down the concepts for a newbie like me, keep up the good work
Perfect for the lay-person!! Thanks much
You're welcome!
This is really well-explained! You got a suscriber! Hope you continue making videos
Thank you very much! :)
This is an incredibly brilliant playlist. Thank you sp much for your videos!
Glad you like them!
i love you. cozy stat vids. this is good for the soul
Thanks! :)
You are totally amaizing! Thank you so so much for so complete explanations . Some details are diff to find or figure out.
Thank you! :)
According to the "+" in fastQ, I have heard once, that in this line it used to be a repeat of the sequence name without "@" at the begining, to ensure that quality regards to the sequence above, but to lower file weight it has been switched to the "+". That's what I have heard, but it would be worth to approve somehow this information, I have no sources to do that.
I was taught in my course in metagenome and microbiome analysis, that the + line exists so you can put comments there! Never seen one with comments though.
MrErPeeeg is correct; but also it signals the end of the sequence lines. FastQ spec: @
+[]
A requirement is that the following '+' is optional, but if it appears right after '+', it should be identical to the following '@'. maq.sourceforge.net/fastq.shtml
The reason is that FASTQ is a combination of FASTA and QUAL files. Each of those has identifier followed by data. FASTA has sequence, QUAL has quality scores. interleaving these two pieces of information (2 lines from each file "zippered in") gives us 4 lines where 2 of the lines are identical. Later on, when machines got smarter, the third line became obsolete (and expensive space-wise) but is retained in a stunted form for backward compatibility.
Too many ads
Thank you for sharing, it is very useful to increase my knowledge of RNAseq
Glad it was helpful!
Your videos help me immensely! Thank you!
Thanks!
Clear, concise, well done! Loved it!
Thank you very much! :)
Thank you for explaining all the concept in a simple way for easy understanding. Could you please also post a video on pseudo-alignment in RNA-Seq
You're the first person to ask about that. I wish I could tell you I would cover it soon. However, I have a long over due promise to make a series on neural networks and another series on time series and mixed models... So it might be a while to get to this. :(
@@statquest Thank you so much for the reply. It is a well worth the wait.
Love the song! Also, thank you for explaining the fragments part. I didn't know that I don't fully understand the rationale behind fragmentation until you explained it. It's an important side note to understand RNA-seq.
Glad you liked it!
Thank you so much for this very gentle intro to RNA-seq. It helped me with my defense!
Hooray!!! Congratulations on your defense! I hope it went well. :)
Excellent course! Thank you for making these stuff so clear! It's much easier to understand!
Hooray! I'm glad the video was helpful. :)
I finally understand like ten things. Thank you!
Hooray! :)
I loved this explanation!!!. It is the most didactic I have seen of all. Congratulations and many thanks. It was very useful. I really appreciate what you have done.
Thank you so much!!! :)
I thought I understood this process until I started looking at a condition where a gene is on a region that should be deleted but the gene is still producing counts so watching this is as a refresher.
I hope it helped!
@@statquest it did but I'm still baffled. I'm getting counts from a gene thats on a region that is very clearly deleted. Either Ive got the genomic range of the deletion wrong and the gene is right on the border but not deleted orrrrr something has gone terribly wrong lol
@@stretch8390 Dang! Maybe you can use qPRC to verify the reads that you are seeing with RNA-seq
@@statquest Not only is your channel a wealth of knowledge your responses are insightful too! Thank you
Your channel is a blessing i might clear an interview after watching all this xD
Good luck! Let me know how it goes. :)
Thanks so very much for the distinguished way of your teaching.
Would you please make a video on "Multiple hypothesis testing" on gene expression data
How about this one: ruclips.net/video/NEaUSP4YerM/видео.html
Hey Josh, amazing content, incredible way of teaching.
P.S. 'excessive self promotion' killed me! Thanks for being ausum.
Thank you! :)
Hey, really good video!
This was very educational, straight to the point and very much coherent.
Thanks
Thank you!
Ya I even can watch Statquest when I am bored ... Great video
Nice!!! :)
thank you so much for the awesome simple explaining
Thanks!
Very nice. I'll use this in my genetics class
Awesome! :)
I just loved it! You made it so simple! Thank u so much!
Thank you! :)
Thank you! This is the best video I have ever seen! ❤️
Awesome!
best explanation for this on youtube.... Thanks man. You're the best.
+ symbol on the 3rd row is in aim to distinguish the sequence in the 2nd row from the data score in the last row. I know it sounds stupid but it is what it is.
Nice!
Amazing content ! Easy to understand and complete as possible for YT! Keep it up! Cheers !
Thanks Josh, great overview!!
Thanks!
good explanation and graphics keep up the good work
Thank you! :)
Thank you very much for your videos! They are indeed clearly explained.
You’re welcome!
Fantastic explanation! Thank you!
Thanks!
This is so clear! I'm so happy I checked this. Thank you!
You’re welcome! :)
I LOVE statquest
Thank you!
Thank you so much for this video. It is a great starting point for me, and I have a feeling that I'll be coming back to your channel. Keep up the great work! :)
I loved the video...Very well explained...Can you also make a video about the analysis of raw RNA-Seq data, that is getting from raw reads to FC, FDR and p values for each gene...
This video is great! Could you do a video someday going into details of working with the raw data? Ie accessing it (terminal and fastq files), converting it... which software to use etc? I think you might be the only one who'd be able to explain hahaha
I'll keep that in mind.
Thank you, good easy to understand explanation
Thanks! :)
Super helpful and clear! Thanks!
Glad it was helpful!
Thank you so. Well explained.
Glad it was helpful!
Nice explanation. Thank you sir!
Thanks!