Why Dividing By N Underestimates the Variance

StatQuest with Josh Starmer

Просмотров 133 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 6 янв 2025

Комментарии • 638

@statquest 4 года назад ⁺¹¹
Corrections:
3:23 I should have said "To understand why dividing by n underestimates the variation around the population mean".
3:40 The estimated mean was switched with the population mean.
Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/
@Viralvlogvideos 4 года назад
BAM BUM hahah
@mayurihazarika6550 Год назад ⁺²
Please Give Video on degrees of freedom please🙇
@m3c4nyku43 Год назад ⁺²
At around 8:35, you should've used asterisk '*' character instead of 'x' character for multiplication. I was a bit confused and thought you wrote 2*(x-v)*x-1 instead of 2*(x-v)*(-1). Great video by the way!
@statquest Год назад ⁺³
@@m3c4nyku43 noted
@paulpaschert6215 5 лет назад ⁺²⁸⁶
is there some sort of award we can give this guy? please?!
@statquest 5 лет назад ⁺²¹
:)
@jacobmoore8734 5 лет назад ⁺²⁷
I think we're encouraged to purchase a double dam t-shirt or sweatshirt, which is more of a financial incentive than an award but who doesn't like getting paid to be awesome? I'll probably pick one up this weekend
@statquest 5 лет назад ⁺¹⁰
@@jacobmoore8734 Thanks! :)
@paulpaschert6215 5 лет назад ⁺¹⁹
@@jacobmoore8734 award + t-shirt = double bam! just ordered my own shirt. gonna wear it to my statistics test in 2 weeks
@karankartik1327 4 года назад
Really, your way is too unique. one of the best
@arun5351 4 года назад ⁺⁶⁸
Amazing Josh!!
I can't imagine how much hard work goes into simplifying the complex statistics concepts and coming up with these amazing videos. And on top of that your ingenious ideas of adding humor and musical creativity, taking the content to another level.
If there was an Oscar for tutoring you'd be the undisputed winner.
BAMMM !!- simply the best educator on RUclips....
@statquest 4 года назад ⁺⁹
BAM! :)
@ksrajavel 4 года назад ⁺¹⁵
Came to this video for "Why Dividing By N Underestimates the Variance" but got to know why absolute values are not used in Variance calculation. Literally cried, Prof. Josh.. Kudos to you. You are supporting me to understand the topics in statistics. I will support you regularly after I get a job soon. And I'm sure your teachings are required for many of the upcoming students in the coming decades. In India we have a concept called "Guru Kulam", and I see you as my guru (Not the term commonly known in the western world, this is more about respect)
@statquest 4 года назад ⁺⁴
Thank you so much!!! It means a lot to me.
@namanjain8939 5 лет назад ⁺⁴¹
I searched for this on a number of online resources, some mentioned "n" while others "n-1", leaving me confused. This is the best possible explanation to the problem you made it really easy for us to understand. Thanks a lot !!! Bammmm subscribed and shared with friends.
@statquest 5 лет назад ⁺⁵
Awesome!!! Thank you very much for subscribing and sharing my videos with your friends. :)
@ramkotha4726 4 года назад ⁺¹³
Josh, This is a total hypnotism you did with BAMs, echos, and other sounds. You mastered the art of making us stick there. I been searching for statistics and machine learning videos where they have kind of a roadmap, and simple explanations for complex topics, and this is it. You saved my life for sure, my donation is on its way, I know anything is small for what goes into making these. Hats off to you, you are a LEGEND. We owe you...
@statquest 4 года назад ⁺²
Wow, thank you!
@arbanafal 5 лет назад ⁺¹⁸
I have nothing but admiration; this is the clearest explanation that I've seen so far that does not shy away from the underlying math, yet still keeping it understandable for those with minimal math background. I feel like a bit of a fool when I see the contrast between my own attempts to explain this correction factor and your explanation.
@statquest 5 лет назад ⁺¹
I'm glad you like the video so much. Thanks! :)
@nursahidassafaat6283 4 года назад ⁺⁸
I've been 2 years asking how to plot variance, why sample variance (also sd) divided by n-1. And this is best explanation i ever had
@statquest 4 года назад ⁺¹
Awesome! :)
@dimiw5435 4 года назад ⁺¹
the best accessible explanation I can find in the whole internet for this mystery. then just as I was about to say "aha! you missed out something!" towards the end of the video, you seemed to have read my mind and "p.s. if you are wondering why n-1 and not 0.5 or 2 .... " you are so so spot-on!
@statquest 4 года назад
Thank you very much! :)
@naysannaderi5135 4 года назад ⁺¹
@@statquest I agree - best explanation i have found and i'm sharing this video with all my students. THANK YOU! So.... any chance that next video is coming out soon? (or has come out already?)
@statquest 4 года назад ⁺¹
@@naysannaderi5135 I hope the next video will come out soon. Possibly in the next 4 months or so. I hope!
@punktdotcom 5 лет назад ⁺⁵
I rather get a clear and understanding explanation with "BAMS" like i'm five, than a 50 pages long explanation with words like "trivial" and abbreviations (q.e.d) and just feel depressed and left clueless. And an other very important thing: Only if you *really* understood the topic, you can explain it with easy words. Very well done, Josh! Thank you very much!
@statquest 5 лет назад ⁺²
Thank you very much!!!! :)
@wowZhenek 3 года назад ⁺²
Yet another video from this channel that leaves me speechless. I've never really understood this concept until I've watched your video. Thank you very much, again.
@statquest 3 года назад
Wow, thank you!
@achannel9598 3 года назад ⁺²
came from calculating the mean, variance and SD video. Did not expect a proof for why variance = x-bar. This is a really good in depth video i've ever watched for statistics. Thank you very much.
@statquest 3 года назад
bam!
@ireneylhsiao 5 лет назад ⁺⁴
Bam!!! I've watched lots of your videos after I discovered the one explaining the standard error. You make me understand stats concepts more clearly. Please continue making these awesome videos (machine learning too)!
5 dollars donated!
@statquest 5 лет назад
Thank you very, very much. I really appreciate it. :)
@keej7146 Год назад ⁺³
Thank you for this!! The first time I saw the formula for the sample variance I wondered why the n-1 was there, this is a great explanation.
@statquest Год назад ⁺¹
Thanks!
@Calypso-rt5tf Год назад
hello, keej
i hate u mate
@davidh1876 5 лет назад ⁺¹
Big thanks from Taiwan. I have been asking why not dividing by n since high school...but all I get from my teacher was only "a rule of thumb". Now I know the reason behind and thanks to statquest. BAM!!
@statquest 5 лет назад
Hooray! :)
@GbUnLimiteD 5 лет назад ⁺¹⁴
Yes! I had already feared that the n-? question won't be explained. Glad to hear that you will explain this unsolved mystery in the next video!
@statquest 5 лет назад ⁺¹⁴
Unfortunately, it will be a while before I get to it. I've got covariance and correlation coming up next, then a few machine learning videos, but then I'll loop back to expected values. It's a topic that I've wanted to work on for quite some time.
@2oqp577 5 лет назад
My uneducated guess about n-x is that the bigger the magnitude diff. between the population and your sample size, the larger x would be. Because as this magnitude get smaller and smaller, the need for x to have any significant value, disappears. My biggest question is why would x lead to this unitary value when your sample size is little. But we'll see what Josh explains about that.
@nickp7526 5 лет назад ⁺⁴
Intuitively: the number you're dividing stands for the degrees of freedom you have. In other words: how many data points are allowed to vary freely. The reason that this is 1 less here is, as the video hinted at, because of the sample mean. If someone shared with you n-1 data points of their sample distribution of n points, and you know what the sample mean is, then you can easily calculate what the last data point is. I.e. that last data point doesn't have any freedom to vary, just because it was crucial in defining the sample mean. This doesn't matter if you know what the population mean is, precisely because the sample distribution didn't decide its value. Therefore all n values in a sample distribution with known population mean can be used to make an unbiased estimator, while only n-1 degrees of freedom can be used to have an unbiased estimator when all you known is the sample mean.
Mathematically: en.m.wikipedia.org/wiki/Bias_of_an_estimator
The first example (in the examples tab) shows why it should be n-1, and not n, or n-whatever.
@jsc3417 5 лет назад ⁺⁴
Thank you, 10 years of confusion made clear by this 15 mins of video.
@statquest 5 лет назад ⁺¹
Hooray! I'm glad the video was helpful. :)
@Ana-wx8jm 4 года назад ⁺²
I click the like button before I watch it because I'm always sure I'll love it! Thanks so much for making this series. You'll never know how helpful it has been in my life
@statquest 4 года назад ⁺¹
Hooray!!! Thank you very much! :)
@chathurijayaweera1590 2 года назад ⁺¹
Thank you for this explanation. When I was learning stat in university, I did not understand well, why we divide by (n-1) instead of n to estimate sample variance. You explained it so clearly in a way that I will never forget what I learnt. Thank you Josh !!!
@statquest 2 года назад
Hooray! :)
@mugssyy 4 года назад ⁺²⁴
Michael Scott: Why don't you explain this to me like I'm five?
Josh Starmer: Bammm!!
and understood ...
thank you : ) !
@statquest 4 года назад
BAM! :)
@Deepak-uv8du 3 года назад
@@statquest
Can you provide the slides for all the statistics videos you used to explain the concepts
@statquest 3 года назад ⁺¹
@@Deepak-uv8du I have PDF study guides for some of my videos here: statquest.org/studyguides/
@cristianleoni6852 4 года назад ⁺²
Amazing explanation of why we use the square of the errors instead of the absolute value! I always asked myself that and all the teachers said it was just to give a bigger weight to the errors! We need the statquest on expected value!
@statquest 4 года назад
Thanks! I'm working on the expected value, but it still might be a few months before it's ready.
@theblinkingbrownie4654 11 месяцев назад
I think for even n there wouldn't even be a minimum point, rather a flat line between the 2 middle samples
@yildizkoca8878 8 месяцев назад ⁺¹
This video is such a gem! Thanks for explaining the root of this concept which is not easy to find even in statistics books.
@statquest 8 месяцев назад
Glad it was helpful!
@killua9369 3 года назад ⁺²
I have always hated statistics but I just today found this channel and this guy explains everything elegantly! ❤😊
@statquest 3 года назад ⁺¹
Wow, thank you!
@taotaotan5671 3 года назад ⁺¹
I just read wiki and found that even divided by n-1, we still underestimate the standard deviation (although we don't underestimate the variance anymore). I feel that's somewhat mind-blowing, since calculating sample std is such an ordinary job for statisticians, and it is surprisingly BIASED (and I am sure the standard error formula is also biased)...
@statquest 3 года назад
interesting
@taotaotan5671 3 года назад ⁺²
@@statquest Yeah. This is the wiki page.
en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation
@libertarianPinoy 5 лет назад ⁺¹²
Kids today are so lucky they can review their stats online like this with great teachers.
@statquest 5 лет назад
:)
@anujlahoty8022 5 лет назад ⁺⁷
Awesome and the best video with most simplified explaination.
@statquest 5 лет назад
Thank you! :)
@tumul1474 5 лет назад ⁺²
Statquest, JBstatistics and Khan Academy.....You guys are just amazing !!.....Thank you for all you have done for us
@statquest 5 лет назад ⁺¹
Thank you! :)
@izebit 5 лет назад
Thank you, I haven't known about these channels
@Michael-zn4oq 4 года назад ⁺²
Thank you so much for the clear and simple explanation. This is an example for when showing the proof is better than only trying to give an intuition.
@statquest 4 года назад
Thanks
@anishchhabra5313 2 года назад ⁺¹
This is epic, never got a better or clearer explanation for this particular problem. Hats off!🙌
@statquest 2 года назад ⁺¹
Thanks a ton!
@rajarshibasak347 5 месяцев назад ⁺¹
Aah! Finally end. What a excellent work by you!! Statquest rocks ❤.. Thank you sir. You helped a lot in my carrier ❤.
@statquest 5 месяцев назад
Thanks!
@nizarch22 3 года назад ⁺¹
I don't even remember what I was confused about in particular, but I remember feeling very happy to see this video. Will revisit this in the following days. Psst, you're a gem ;)
@statquest 3 года назад
Thank you very much! :)
@PraveenKumar-yv5zn 4 года назад ⁺¹
This is the best explanation that I've come across for this. And I really liked that you gave a proof for general set of observations. Thanks a lot.
@statquest 4 года назад ⁺¹
Awesome, thank you!
@scuti7073 3 года назад ⁺¹
Man, I always thought that statistics doesn’t make any sense at all and that people should just blindly chug into weird formulas without questioning, but this was absolutely mind opening. Not even khan academy could explain the proof!
@statquest 3 года назад ⁺¹
Thanks!
@ryanmckenna2047 Год назад ⁺¹
This channel is just incredible, well done!
@statquest Год назад
Thank you very much! :)
@DamosyTheFreckle 16 дней назад ⁺¹
Josh, thank you so so so much for this amazing video!!!! It's really really helpful and is certainly a moment of enlightenment for me. But please please pleaseeeeeee, make the video about why specifically divide by one 😢😢😢, pleaseeeee
@statquest 15 дней назад
One day I'll do it! I promise!
@ARM26878 3 года назад
BAM! I have not seen this concept explained better anywhere else ever. Have you gotten around to making the follow-up video on 'expected values' ? Can't thank you enough for your channel
@statquest 3 года назад
I've got the video on expected values ruclips.net/video/KLs_7b7SKi4/видео.html and ruclips.net/video/OSPr6G6Ka-U/видео.html , but there are still a few steps to go after that... :(
@haoqichen7610 2 года назад ⁺¹
The last point about absolute value explains a lot! I was always wondering why squaring data is so much more common than taking absolute values!
@statquest 2 года назад
bam! :)
@tippyandfriend 5 лет назад ⁺⁴
This is excellent, I am looking forward to the next one.
@lyrachang950 Год назад
im currently learning data analytics and trying to figure out ab testing and bam! here i am! thank you so much for making statistics fun and easy to understand! double bam!
@statquest Год назад
Happy to help!
@christopherchen4920 3 года назад ⁺¹
The most impressive explanation I've ever seen.
@statquest 3 года назад
Thanks!
@emmaning992 4 года назад ⁺²
I admire this explanation... Amazing. I really look forward to the expected values video!
@statquest 4 года назад ⁺¹
Thank you. I started working on the expected value video, but it will still be awhile before I finish since I have many other projects to work on.
@edward8064 4 года назад ⁺¹
Mind = Blown.
Thankyou from Indonesia.
@statquest 4 года назад
Thanks!
@alexandermedina4950 3 года назад ⁺¹
I can only have love for these videos, thank you Josh and all the team if you have any.
@statquest 3 года назад
Thank you! It's just me doing all this.
@brucewayne6744 5 лет назад ⁺⁴
Great explanation!!
I'm loving every second of your videos!!! Cheers!!
@statquest 5 лет назад ⁺¹
Thank you! :)
@morenomartinovic4385 5 лет назад ⁺¹
I'm eagerly awaiting the expected values quest! Thank you so much for making these videos, I love watching them before sleep.
@statquest 5 лет назад
Awesome! It's on the to-do list, but it might not be done for awhile. :(
@morenomartinovic4385 5 лет назад ⁺¹
@@statquest That's cool, take your time to keep making awesome videos. I still have loads of your videos on my to-watch list!
@OdysseusKingofIthaca-o4n 6 месяцев назад ⁺¹
Thank you St Josh for this illuminating explanation :)
@statquest 6 месяцев назад
My pleasure!
@magtazeum4071 3 года назад ⁺¹
8:22 `the way he said "Whaat" is so cute.. I'm in love
@statquest 3 года назад ⁺¹
:)
@Drugio24 5 лет назад ⁺⁴
this is literally what I was trying to get a clear understanding on in the last few days? what are the chances? no seriously what are the chances?
@statquest 5 лет назад ⁺¹
That's awesome! :)
@lelamakharadze727 5 лет назад ⁺²⁶
"Future is nooow, BAM " - #LOL #respect #welldone #thanks
@statquest 5 лет назад
Thank you! :)
@Igor-vb1hv 4 года назад ⁺⁶
Thanks for explanation!
I understand that differences between the SAMPLE data and the sample mean are smaller than the differences between the SAMPLE data and the population mean. BUT! We are not interested in the difference between the SAMPLE data and the population mean, rather we are looking for the difference between the TRUE POPULATION data and the population mean (the population variance). And it's not clear why this value would be larger.
I mean sample data is centered around sample mean the same way population data is centered around population mean. Comparing sample data with population mean feels to be misleading.
@statquest 4 года назад
The best estimate we can do is the estimate of the variance around the sample mean, which is probably an underestimate, but not always. So this is the best we can do.
@HamidNourashraf 2 года назад ⁺¹
I love the way you explain these topics, great work!
@statquest 2 года назад
Thanks!
@marinasha2949 Год назад ⁺¹
Good job Josh!! Waiting for StatQuest on Expected Values! I am the one wondering why not dividing by 'n-0.5' or 'n-2'
@statquest Год назад
Thanks!
@nidhiarora4739 4 года назад ⁺¹
I have been SO stressed out about a project I'm working on, and 3:15 made me laugh so hard!!! I didn't even realize how stressed out I was until I caught myself laughing for the first time in weeks. Thank you Josh!!! **sob**
@statquest 4 года назад
Hooray!!! Good luck with your project. I hope it goes well. :)
@ps_v.2.3.20 11 месяцев назад
16:38 , it's not resolved in the expected values video. How to know that prof.
@statquest 11 месяцев назад
Unfortunately I haven't had time to do the follow up video. The best I can do is give you this link for now: online.stat.psu.edu/stat415/lesson/1/1.3
@ps_v.2.3.20 11 месяцев назад ⁺¹
@@statquest thanks for immediate response prof.😊
@timothymattnew 3 года назад ⁺²
I really want to understand why we use n-1 instead of substituting any other number instead of 1. I'm guessing it has something to do with the way we approximate the mean and the variance. I think it's related to properties the normal distribution has and such. I think that to truly understand that analytically I'd have to integrate over all possible outcomes while taking into account all the probabilities and then calculating the average. It really excites me, but I don't know where I can find the information needed to understand the subject in more depth. Can you give me some advice on what textbooks I should read, please? I'd really really appreciate that!
@statquest 3 года назад ⁺¹
See: online.stat.psu.edu/stat415/book/export/html/886
@timothymattnew 3 года назад ⁺²
@@statquest thank you, I will definitely read that!
@mleon12 2 года назад ⁺¹
Thanks!
@statquest 2 года назад
Wow!!! Thank you so much for supporting StatQuest!!! :)
@mukhtarbimurat5106 Год назад ⁺¹
Greatest explanation so far!
@statquest Год назад
Thank you! :)
@thegamingannex5752 2 года назад ⁺¹
Your work is impeccable. BAM!
@statquest 2 года назад
Thank you!
@mansoorbaig9232 4 года назад ⁺¹
This is awesome explanation. Waiting for quest on 'Expected Values'....BAM!
@statquest 4 года назад ⁺¹
Me too. Hopefully I can get to it soon.
@Ujjwalchhabra1 4 года назад ⁺³
You left in a cliff hanger of expected values :((
Love your videos tho, thanks for these!
@statquest 4 года назад ⁺²
I'm working on it, but everything I do takes longer than I would like. :)
@chiragsomani101 3 года назад ⁺¹
ASTOUNDING EFFECTS & EXPLAINATIONS!
SUBSCRIBED TRIPLE BAM!!!
@statquest 3 года назад
bam!
@sunaxes Год назад
Waited the whole video to know why it was n-1 and not n-2 etc... "that mistery will be resolved in the next episode"
Felt like watching an overstretched TV series in some way haha.
I understand it's capital to show first that sample variance underestimate true variance, but could mention earlier that you ll not focus on "why" it is n-1. :p
Thank you though wonderful content!
@statquest Год назад
I tried to be careful with the title of this video with "Why dividing by N underestimates the variance" instead of "Why n-1 gives us an unbiased estimate".
That being said, I really wanted to explain exactly why n-1 works, but the proof is relatively advanced.
@yufeizhan726 4 года назад ⁺¹
I finally know why n-1 is used. Thank you so much!
@statquest 4 года назад
Bam!
@coldbrewed8308 10 месяцев назад ⁺³
Oh no... I'm falling deeper and deeper into this rabbit hole
@statquest 10 месяцев назад
:)
@ROTOBAfilms Год назад ⁺¹
You are a very great teacher, i like your coaching style, keep going on!
@statquest Год назад ⁺¹
Thank you! 😃
@radosawszostak6104 Год назад
Great video! We clearly see that estimated variation is smaller than desired so we have to make it bigger. We can make it by dividing by n-1, but also by n-2 or n-1.5 or n-100. Why n-1?
@statquest Год назад
One day I'll make that video, for now, see: online.stat.psu.edu/stat415/lesson/1/1.3
@shashankupadhyay821 4 года назад ⁺¹
I usually hit like after the first BAMMM. This is some super great stuff Josh.
@statquest 4 года назад ⁺¹
Thank you very much! :)
@keysky_1622 5 лет назад ⁺⁶
wow that n-1 has something to do with E(X)? Im waiting for it!
@dver7349 Год назад ⁺¹
Super interesting! Thanks for your work!
@statquest Год назад
Thanks!
@samarthpatil2599 3 года назад ⁺¹
Loved the video. But didn't understand something clearly. The variance is the least around the calculated mean. But that is only when the data x remains the same right? How can you compare it with the population variance which has a lot more data points and the summation is therefore different?
@statquest 3 года назад ⁺¹
We are not comparing it to the population variance. We are simply comparing the variance of the data calculated around the sample mean compared to the variance of the data calculated around the population mean.
@Kornackifs 11 месяцев назад
1:47
Draw it on the graph how
Are you talking about that curvy line around the histogram?
@statquest 11 месяцев назад
The red line with arrows on each end represents the population standard deviation.
@Kornackifs 11 месяцев назад ⁺¹
@@statquest yeah thanks alot that's what i meant
@anandrathi871 4 года назад
I do understand the way yo calculate min variance
But @ 14:54 How did you conclude in that "Thus valus around sample mean is always less than population mean" ?
@statquest 4 года назад
Because the sample mean is the value that minimizes the variance, any other value will give you a larger variance.
@ginopeduto4264 4 года назад ⁺¹
THX!!! Looking forward for the STATQUEST on expected Values ;))))
@statquest 4 года назад ⁺¹
Me too!
@exoticcoder5365 Год назад ⁺¹
15:46 it’s the god moment 👏🏻👏🏻👏🏻👏🏻
@statquest Год назад
bam!
@shouryanand456 4 года назад ⁺¹
I wish you were my stats teacher!! Amazing job!!!
@statquest 4 года назад
Thank you! :)
@shouryanand456 4 года назад
@@statquest really waiting for the expected value video to get explanation of n-1. When can we expect it?
@statquest 4 года назад
@@shouryanand456 Unfortunately, it might be a while. I've got a full plate until after the summer.
@Marius-vw9hp 5 лет назад ⁺²
I wanted to know why we deduct exactly 1, but I guess that only takes 20 aditional minutes to explain. Hooraay!
Thanks for the videos :)
@statquest 5 лет назад
It's true. We have to dive into expected values and that is a whole new topic.
@vkvkvkvk 5 лет назад ⁺³⁸
baaammm! subscribed.
@statquest 5 лет назад ⁺⁴
Awesome! :)
@nursahidassafaat6283 4 года назад
BAAM! me too
@ujjwal2912 4 года назад ⁺²
Although you make everything look so simple, your teaching pedagogy requires a lot of hardwork( to make the slides particulalry). I hope that every teacher puts in the same kind of hardwork and assume there students to be in 5th grade that way every class will be a pleasureable experience of life.
@statquest 4 года назад
Wow, thank you!
@waddragon 3 года назад
it is because some teachers don't know how to teach. They learn from textbook's concept. Memorize them, then give those back to students. I am not being rude but it is the reality. In order to be able to explain well to new learners, teachers must be able to understand the concepts well. Teaching is a hard skill to master. Nowadays, lot of taught concepts are assumed true or left blank during teaching . That's why if those students become teachers, they won't be able to explain.
@NuclearSpinach 3 года назад ⁺⁴
"The future is now" I'm dying
@statquest 3 года назад ⁺¹
BAM! :)
@rajkumarguptafx3907 6 месяцев назад ⁺¹
Your Voice is magical 🌹🌹🌹
@statquest 6 месяцев назад
Thank you!
@ipmankus 4 года назад ⁺¹
Very nice explanation, god bless you josh!
@statquest 4 года назад
Thank you! :)
@MrBlissTube 5 лет назад ⁺³
Great video!
Where is the one about Expected Values?
I cannot wait with such a cliffhanger! GoT finale can wait...
@statquest 5 лет назад ⁺¹
Very funny! Yes, I have my work to do. I hope to get to expected values before too long.
@MrBlissTube 5 лет назад ⁺¹
@@statquest Thanks a lot for responding! ... and sorry, as I noticed after reading more comments, that you had already answered this question many times. Quest on!
@thepahadiboi 4 года назад ⁺¹
What a explanation.
I don't have money, else I'd have contributed.
The least I could do is share, which I already did.
BAM !
@statquest 4 года назад
BAM! :)
@DesertHash 4 года назад
I thought that the sample variance [ (sigma(Xi - X bar)^2)/n ] is trying to estimate population variance [ (sigma(X - mew)^2)/N ], so at 16:04, why do we care about how the sample standard deviation estimates the variation in the *Data* around the population mean, as opposed to how it estimates the actual population variance (which is what it’s set out to estimate)?
@statquest 4 года назад
Because the estimation of the variation in the data around the population is unbiased.
@sephirothjc 2 года назад ⁺¹
This the best explanation ever
@statquest 2 года назад ⁺¹
Thank you!
@haugstve 5 лет назад ⁺³
Nice! PS. Small typo at 8:42, you say -1 but write x-1.
@statquest 5 лет назад ⁺²
That 'x' is a "times" symbol. So it's "times -1", not "x - 1"
@ginopeduto4264 4 года назад
@@statquest thx I was confused too - by the way - your videos among the very best one can find!!! Thank you so much!!!
@Lsazeh 2 года назад ⁺¹
Thanks so much for the explanation, super clear as always
@statquest 2 года назад
Glad it was helpful!
@MirrorNeuron Год назад
Hi Josh, where did you study about it, is it from Bessel's correction or Karl Pearson. I am interested to ready a bit about the history behind it. Can you please suggest a book or paper where the original discovery was made. Thanks in advance.
@statquest Год назад
The idea for this came from Bessel's correction.
@ThalesBrunoM 4 года назад ⁺⁴
8:21 -> I will watch a thousand times and I will laugh out loud a thousand times 😂
@statquest 4 года назад
Hooray! :)
@shubhamtalks9718 4 года назад ⁺¹
Man, you are great. From where did you learn these concepts? Keep making videos and enlighten us. Thank you.
@statquest 4 года назад ⁺¹
Thanks! :)
@shubhamtalks9718 4 года назад ⁺¹
@@statquest When I try to learn these concepts they seem complicated to me. From where did you learn these concepts?
@statquest 4 года назад ⁺³
@@shubhamtalks9718 The concepts seem complicated because people that do not really understand them try to teach them.
How did I learn them? Years of really hard work. I read everything I can about a subject, then I re-read it. Then I re-read it again. Then I make a program based on my ideas and see what happens. Then I re-read everything over again. And sooner or later I figure it out. But it takes a lot of time and a lot of work. Sometimes I worry I will not succeed, and sometimes I fail, but I keep trying anyway.
@shubhamtalks9718 4 года назад ⁺¹
@@statquest Thanks😁
@koreanbroadcastarchive306 3 года назад ⁺¹
Excellent. Thank you for a great explanation.
@statquest 3 года назад
Glad you enjoyed it!
@iAmTheSquidThing 5 лет назад ⁺³
This intuitively makes more sense to me now. If I take a sample, the sample mean may end up being larger or smaller than the population mean. But the sample variance can never be larger than the population variance, it might be equal to it, but most probably it will be smaller.
@statquest 5 лет назад
That's exactly right. :)
@wobwobvoid420 Год назад
I think you have to be a little bit careful with what you mean by "sample variance" and "population variance". As long as you're comparing an estimated population variance using the sample data and actual population mean vs an estimated population variance using the sample data and sample mean. But, comparing the estimated population variance using the sample data and sample mean vs the actual population variance (all data and actual population mean) doesn't have the guarantee that sample variance will be lower than population variance.
@a950721 4 года назад
Sorry I am still getting confused. At 5:29, in the inequality, both left hand side and right hand side are using the same n, which is the number of samples.
You argued that the right hand side is greater so that we need to make the left hand side larger by dividing n-1.
However, the right hand side is not the actual population variance. The actual population variance should be using a much larger n to calculate.
What we are doing here is to estimate the population variance but not the right hand side.
Thinking to this point, all the linkage seems broken. How can I relate the right hand side to the population variance?
It is true that the inequality holds. But it does not mean also the population variance is always greater than the left hand side.
Thanks for your videos. They inspire me and teach me a lot.
@statquest 4 года назад
Sometimes we know the population mean, but don't know the variance, so we sill have to estimate it. That is what is going on on the right side of the equation.
@hafidhrendyanto2690 3 года назад ⁺¹
Amazing video!
I think that you should teach another subject. Maybe MathQuest? That would be amazing!
@statquest 3 года назад ⁺¹
Maybe one day!
@Patrick_Bentolila Год назад ⁺¹
Clarity brings understanding
@statquest Год назад ⁺¹
Bam! :)
@Lets_MakeItSimple 5 лет назад ⁺¹
Hey josh, yet another cool explanation.
@statquest 5 лет назад
Thank you! :)
@nth_prime 2 года назад
The end of this video suggests that the video on Expected Values will elaborate on why subtracting 1 is not arbitrary. The Expected Values video says that it is the first step towards getting to this video. Does anyone have the appropriate chronological order of the videos? Or if I'm stuck in a time loop, let me know!
@statquest 2 года назад
The expected value is a step in the direction of understanding why subtracting 1 is not arbitrary, but it doesn't get you all the way. Unfortunately I haven't made videos for the rest of the steps yet. If you're in a hurry, check out: online.stat.psu.edu/stat415/lesson/1/1.3
@nth_prime 2 года назад ⁺¹
Thank you! I will look forward to that and check this out for now.
@rishikeshpillay2732 3 года назад
I think there is little problem here @8:47 where we use chain rule .......where that x-1 come I know -1 is derivative of -v but why there is x and where it goes later.
By the way nice explanation .
@statquest 3 года назад ⁺¹
The little 'x' means "times" and the big "X" is a variable. Sorry for the confusion.
@rishikeshpillay2732 3 года назад ⁺¹
@@statquest Thank You so much for explanation .I get it now.
@tiekauntan3264 4 года назад
From 2:38 to 2:40, the symbol for estimated mean was switched to population mean. Great video anyway
@statquest 4 года назад
Ooops. That's a typo. Thanks for pointing that out. I've updated the pinned comment with it.
@Issacashish 3 года назад
Super content in the video. But i have a doubt. Its been mentioned that the sample mean will always be less than the Population mean (14:58). But i think that this is a completely relative term. What if the sampled data has more quantity of samples which have higher value than the population mean? I think the sample mean will be greater than the population mean. Kindly correct me if wrong.
@statquest 3 года назад
I believe you are misunderstanding what the video is saying. The equation shows the sample variance is always less than the population variance and the text says that that relation will always be true unless the sample mean is exactly the same as the population mean.

Следующие

Автовоспроизведение

StatQuest: Random Forests Part 2: Missing data and clustering