R-squared, Clearly Explained!!!

Поделиться
HTML-код
  • Опубликовано: 29 июн 2024
  • R-squared is one of the most useful metrics in statistics. It can give you a sense of how good your model is.
    For a complete index of all the StatQuest videos, check out:
    statquest.org/video-index/
    If you'd like to support StatQuest, please consider...
    Buying The StatQuest Illustrated Guide to Machine Learning!!!
    PDF - statquest.gumroad.com/l/wvtmc
    Paperback - www.amazon.com/dp/B09ZCKR4H6
    Kindle eBook - www.amazon.com/dp/B09ZG79HXC
    Patreon: / statquest
    ...or...
    RUclips Membership: / @statquest
    ...a cool StatQuest t-shirt or sweatshirt:
    shop.spreadshirt.com/statques...
    ...buying one or two of my songs (or go large and get a whole album!)
    joshuastarmer.bandcamp.com/
    ...or just donating to StatQuest!
    www.paypal.me/statquest
    Lastly, if you want to keep up with me as I research and create new StatQuests, follow me on twitter:
    / joshuastarmer
    #statquest #statistics #rsquared

Комментарии • 708

  • @statquest
    @statquest  4 года назад +133

    NOTE: When I first made this video, I was thinking about how R-squared relates to Linear Regression, which will not fit a line worse than the mean of the y-axis values. This is because if the values along the x-axis are truly useless in terms of predicting y-axis values, then the slope of the line used to make predictions will be 0, and the intercept will equal the mean. However, it is possible to simply draw a line that fits the data worse than the mean and get a negative R^2.
    Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/

    • @mattkilgore7323
      @mattkilgore7323 4 года назад +1

      With enough variables in the data set, it would be easy to create a set of r-squared values so that the cumulative percent "explained by" the different variables goes over 100%. That's why I was never a fan of that terminology. Students think it implies causation when it doesn't. Otherwise, great video.

    • @statquest
      @statquest  4 года назад +28

      @@mattkilgore7323 Maybe I should have made it more clear, but if you have a large model with a lot of variables, then you don't add together a bunch of individual R-squared values to find the total R-squared. You calculate a single r-squared value fro the entire model. In other words r-squared refers to the models, not the individual variables.

    • @huiwang2854
      @huiwang2854 4 года назад

      StatQuest with Josh Starmer If you only consider all unbiased lines, (mean of predicted ys equal mean of real ys), then no negative R^2.

    • @sunilkumarsamji8507
      @sunilkumarsamji8507 4 года назад +1

      @@mattkilgore7323 Hi Matt can you explain the point you trying to make in a bit more detailed manner

    • @mattkilgore7323
      @mattkilgore7323 4 года назад +2

      The phrase "explained by" can be deceptive, as students often think it means "caused by." But this is not what it means in the context of r-squared. Does that help?

  • @dannysnee4945
    @dannysnee4945 4 года назад +230

    So glad this channel exists. It's rare that RUclips videos on stats are this well done

    • @statquest
      @statquest  4 года назад +6

      Thanks!

    • @shashankkhare1023
      @shashankkhare1023 4 года назад +4

      @@statquest You are a lifesaver. I am surprised you dont have more subsciptions. I would recommend your channel to my colleagues, thank you so much :)

    • @statquest
      @statquest  4 года назад +7

      @@shashankkhare1023 Thank you very much!!! Recommending my channel to your colleagues is the best complement you can give me. :)

    • @VinodKumar-nn7go
      @VinodKumar-nn7go 2 года назад

      you can watch and learn from Dr. Ami Gates. Her videos are great..

  • @ramprakash7872
    @ramprakash7872 5 лет назад +12

    You have explained the concept so neatly,clearly ( most importantly in an easier manner ) so that one could get deeper understanding of the concept, a fact that lot many text books / videos / articles failed to do. Keep making such videos !

  • @joenah5651
    @joenah5651 3 года назад +39

    Thank you so much for making this sooooooo clear, I've struggled to understand the meaning of R2 for a week and you just made it clear to me in 10 min.

  • @alecvan7143
    @alecvan7143 4 года назад +66

    I can't believe the simple relationship between R^2 and R was never made clear to me! Amazing as always!

    • @statquest
      @statquest  4 года назад +5

      Awesome!!!! Thank you very much.

    • @pablo_brianese
      @pablo_brianese 3 года назад

      I also appreciated his comments on the subject, and him sharing his opinions and intuitions.

    • @bleakmess
      @bleakmess Год назад

      Just a quick question?

  • @muffinman1
    @muffinman1 4 года назад +29

    "sniff/weight relationship" debunked by StatQuest. Give this man a Nobel. :)

  • @DD-hh4tz
    @DD-hh4tz 5 лет назад +7

    Your videos are so easy to understand, and also explains the intuition behind. I really love the way you start the video, unlike other bouring lectures.

    • @statquest
      @statquest  5 лет назад +1

      Thank you so much! :)

  • @dikshyapattanaik3528
    @dikshyapattanaik3528 3 года назад +6

    Your channel is blessing in disguise. Visual aids and the explanations are so smooth and easy to understand. Thank you very much.

    • @statquest
      @statquest  3 года назад

      Thank you very much! :)

  • @cartulinito
    @cartulinito Год назад +1

    I started following you 4 months ago, now I'm starting over from the very first video, I'll watch them all and understand everything.
    Thank you very much for this content.

  • @sanketbadhe3572
    @sanketbadhe3572 4 года назад +6

    I read a lot on R square from different books and articles but this was the really different and very intuitive approach. Visualization is the best way to understand statistics and I think most books lack there.

  • @kinvert
    @kinvert 4 года назад +265

    So all this time I spent sniffing rocks to grow bigger was for nothing???

    • @statquest
      @statquest  4 года назад +21

      Ha! You made me laugh. :)

    • @PeterXLuo
      @PeterXLuo 3 года назад +3

      haha

    • @antonbagaev1771
      @antonbagaev1771 2 года назад +1

      only if you are mouse

    • @andresrossi9
      @andresrossi9 2 года назад +1

      I love you hahahaha

    • @muslimmukhtarkhanov8194
      @muslimmukhtarkhanov8194 Год назад +1

      You should have made a powder out of rocks, that would speed up your growing. Especially if your powder of white color🤣🤣🤣

  • @gri189
    @gri189 Год назад +1

    Just recently found your channel. These are by FAR the most straight forward explanations I found so far. You sir are a godsend.

  • @hewhomustnotbenamed5912
    @hewhomustnotbenamed5912 4 года назад +4

    Added this to my useful tutorials and math playlists.
    Thanks StatQuests.

  • @amirmohamed2428
    @amirmohamed2428 3 года назад +6

    I had stats exam coming up and didn't know this particularly well, Thanks for making it much more simpler!

    • @statquest
      @statquest  3 года назад

      Good luck on the exam! :)

  • @vamanieperumal5262
    @vamanieperumal5262 4 года назад +6

    this is the best channel ever that can exist about statistics :D wonderful explanation and illustrations and the music! :) am glad I found this at the right time !

    • @statquest
      @statquest  4 года назад

      Thank you so much 😀

    • @baxtables
      @baxtables 4 года назад

      Are u glad u found it or are u asking us if u are glad??😆😆

  • @thisis7734
    @thisis7734 4 года назад +5

    Amazing explanation!!Made it very simply for me to understand!! :)
    I went through so much content for this..thank you

    • @statquest
      @statquest  4 года назад +2

      Hooray! I'm glad the video was helpful.

  • @yashshah1936
    @yashshah1936 4 года назад +6

    Josh you are the best!!! Your every video has been helpful to god knows how many times in my studies. Much much love

    • @statquest
      @statquest  4 года назад

      Thank you very much! :)

  • @ivandeetlefs
    @ivandeetlefs 2 года назад +2

    I would rather name this video VERY CLEARLY EXPLAINED. Thank you.

  • @samuelliaw951
    @samuelliaw951 2 года назад +2

    cool! you have cleared all the fogs around r2 in my head once for all. appreciate your explanation!

  • @srikanth9450
    @srikanth9450 3 года назад +4

    I have found a great channel for stats and trix.... BAM! it covers all the areas I want to learn.. Double BAM!! It's indeed clearly explained... Triple BAM!!!

  • @AdilKhan-sh9fv
    @AdilKhan-sh9fv 3 года назад +3

    This channel has become my go-to resource for anything stat related.

    • @statquest
      @statquest  3 года назад +1

      Bam! :)

    • @khanhdovanit
      @khanhdovanit 3 года назад +1

      @@statquest Love your Bam and your singing

  • @tanmaymhatre6370
    @tanmaymhatre6370 3 года назад +2

    love the way you explain things in casual manner

  • @prateeksachdeva1611
    @prateeksachdeva1611 Год назад +1

    Could not have even imagined such intuitive explanation of this topic before watching this video. Thanks Josh!

  • @ashokcgl
    @ashokcgl 2 года назад +1

    Just impeccable. I don't think any other better illustration exists other than this. Thank you

  • @sanjaykrish8719
    @sanjaykrish8719 6 лет назад +8

    Very beautifully explained. Many thanks to the folks of Genetics Department at the University of North Carolina at Chapel Hill.

  • @Love4ever1223
    @Love4ever1223 4 года назад +1

    god bless, i have been searching high and low for this kind of video. Thank you!!!!

  • @Steve-go3zp
    @Steve-go3zp 3 года назад +1

    I have been looking at a variety of stats videos and these are clearly the best. I am so impressed with StatQuest that I renamed my four dogs, "StatQuest," "StatQuest," "StatQuest," and "John Stamos" because, of course...

  • @lilmoesk899
    @lilmoesk899 7 лет назад

    Super useful as always. Please continue with the videos (for example, prediction interval vs. confidence interval or maybe p-values vs. randomization tests or logistic regression...)! I liked your explanation because it never occurred to me that R^2 was basically the same as calculating percent change (diff/original)x100.

  • @ailsasun9308
    @ailsasun9308 Год назад +1

    The introductions are the cutest thing I have ever seen - the videos are also super duper helpful!

  • @wayne02058
    @wayne02058 3 года назад +3

    a simple concept explained simply. thank you for the straight forward explanation

    • @statquest
      @statquest  3 года назад

      Thank you very much! :)

  • @rajsingh9869
    @rajsingh9869 Год назад +1

    really boom...I was confused from past 3 days to understand regression value ...now I understand. Thanks

  • @kowsergazi
    @kowsergazi 5 лет назад +1

    No one could make me understand R Squared in such easy way. Watched many videos. All made it complicated. Thanks.

    • @statquest
      @statquest  5 лет назад

      Hooray!!! I'm glad to hear the video was helpful! :)

  • @fredyrojas7884
    @fredyrojas7884 2 года назад +1

    These videos are pretty cool. I can always come back and refresh concepts.

  • @firasal-nasir1909
    @firasal-nasir1909 3 месяца назад +1

    watching this again, thank you very much. I jumped into more advanced stuff because of your videos. 🙏🙏

  • @minilana470
    @minilana470 4 года назад +8

    You are a rarity ❤️ really love how you explain statistics! Please tell us more 🙏🏻♥️♥️♥️

  • @mohdbasryt
    @mohdbasryt 3 года назад +3

    Great teachers make everything interesting! Thanks Josh

  • @thyang3999
    @thyang3999 4 года назад +2

    Thank you very much, your video is very easy way to understand, makes me want to go through the Statistics course again.

  • @mohammadhassanjafari481
    @mohammadhassanjafari481 2 года назад +1

    I did not see anyone explain the statistics better than you
    God bless you ...

  • @rabbisheryl9613
    @rabbisheryl9613 4 года назад +4

    aha! we meet again, and I thank you again!!! wow, I wish you were teaching my class!!

    • @statquest
      @statquest  4 года назад

      Ha! I'm glad my videos are so helpful. :)

  • @goodester6924
    @goodester6924 4 года назад +2

    Was struggling to understand this concept but this video explained everything!

  • @nithyashreevenkataraman3
    @nithyashreevenkataraman3 3 года назад +1

    Thank you so much, I've learned so much from you in the past week! Very grateful

    • @statquest
      @statquest  3 года назад

      Thank you very much! :)

  • @manikyar7115
    @manikyar7115 2 года назад +3

    Really enjoying your videos. Moreover everything is crystal clear and I am able to understand them. Double BAM

    • @statquest
      @statquest  2 года назад

      Hooray!!! That's great news. BAM! :)

  • @shinhyelee3169
    @shinhyelee3169 5 лет назад +4

    Ah! this is the best video explaining R squared! Thank a lot!

  • @paololara3115
    @paololara3115 3 года назад +1

    Thank you, this was a life-giver! Josh Starmer, you just might have become a part of something which will be big

  • @MrClevermind
    @MrClevermind 4 года назад +1

    Thank you, you're helping us with these videos.

    • @statquest
      @statquest  4 года назад

      Thank you very much! :)

  • @AydinCGur
    @AydinCGur Год назад +1

    Wonderful explanation again. I easily understood the concept. I'm grateful.

  • @stevenpatterson8119
    @stevenpatterson8119 Год назад +1

    This helped me clearly understand R^2. Trying to grasp this from reading a textbook was impossible for me.

  • @nicholesutter80
    @nicholesutter80 3 года назад +1

    Working on my MPA stats final and this video has been so helpful

  • @sebastianc09
    @sebastianc09 Год назад +1

    I'm truly grateful for your videos!

  • @guthrie_the_wizard
    @guthrie_the_wizard 2 года назад +1

    Thanks very much! Your videos rock. Great pacing, excellent points.

    • @statquest
      @statquest  2 года назад

      Thank you very much! :)

  • @pablo_brianese
    @pablo_brianese 3 года назад +2

    Thank you for this precious material

    • @statquest
      @statquest  3 года назад +1

      I'm glad you like it!

  • @mohitsrivastava5880
    @mohitsrivastava5880 4 года назад +1

    Thanks for the simple explanation. Much appreciated.

  • @sweetheart.nikkilee430
    @sweetheart.nikkilee430 4 года назад +2

    wow this is the best explanation ive ever seen, thank u!!!!!

    • @statquest
      @statquest  4 года назад

      Thank you very much! :)

  • @dearwriter9659
    @dearwriter9659 2 года назад +1

    Now I understand the R squared much better! Thank goodness for this video!

  • @mmhamed1
    @mmhamed1 2 года назад +1

    You know what i decided to start watching your videos from the beginnimg .. baaam .. thanks

  • @AmosFolarin
    @AmosFolarin 5 лет назад +12

    Beautiful explanation :)

  • @kumarransing8489
    @kumarransing8489 Год назад +1

    Really good explanation as to why r squared is significant in describing variation in data. Thank you!

  • @footballistaedit25
    @footballistaedit25 2 года назад +1

    Thanks for sharing, Sir. It helps me a lot

    • @statquest
      @statquest  2 года назад +1

      Glad to hear that!

  • @senwang8468
    @senwang8468 4 года назад +5

    这样的创作者请给我来一百个!thank you for your videos!I really appreciate what you have done, and look forward to seeing more of them~

    • @statquest
      @statquest  4 года назад

      Thank you very much! :)

  • @elfmas
    @elfmas 3 года назад +1

    Thanks, every question/doubt that I had instantly got answered about 10 seconds later.

  • @sudhitiwaridwivedi3096
    @sudhitiwaridwivedi3096 4 года назад +2

    Undoubtedly d best and to the point explanation. Thanks a lot

  • @anacarolinamartinelli9065
    @anacarolinamartinelli9065 2 года назад +2

    This video is absoluted amazing! R^2 and R finaly understood!

  • @Gattomorto12
    @Gattomorto12 7 лет назад

    Nice video! I'll recommend it to family and friends.

  • @samuelgrubb2976
    @samuelgrubb2976 Год назад +1

    I never comment, but today is not that day. Thank you so much for this!!!! I am in graduate school and am still struggling to understand these concepts, you're a life saver

  • @mosama22
    @mosama22 2 года назад +1

    Thank you Josh, this was really a beautiful explanation :-)

  • @emmanuelskywalker1581
    @emmanuelskywalker1581 5 лет назад +1

    AWESOME! thanks for the explanation.

    • @statquest
      @statquest  5 лет назад

      Hooray! I'm glad you like the video! :)

  • @KIKI-NJ
    @KIKI-NJ 3 года назад +1

    So proud of me because I'm watching these videos. Very very goood job thanks 😊 👍 👏

  • @jonathanlee8755
    @jonathanlee8755 4 года назад +2

    Amazing video! Thank you very much!

  • @ricardoagnelo2995
    @ricardoagnelo2995 5 лет назад +5

    Good explanation and your videos really are funny... good job

    • @statquest
      @statquest  5 лет назад +1

      Hooray! I'm glad you like the videos and my silly jokes. ;)

  • @KARAB1NAS
    @KARAB1NAS 5 лет назад

    Nice explanation. To the point.

  • @abdullahattia2491
    @abdullahattia2491 Год назад +1

    my dude I understood and I am happy
    8-year-old video is this good
    liked, subbed and thank you!

    • @statquest
      @statquest  Год назад +1

      My dude! Thank you very much! :)

  • @anilsarode6164
    @anilsarode6164 4 года назад +4

    hats off and thanks a lot you will make me cry. thanks once again.

  • @vivekmenon4118
    @vivekmenon4118 4 года назад +1

    Well explained. Thank you!!!

  • @PeterXLuo
    @PeterXLuo 3 года назад +1

    best video explaining R and R squared ever!

  • @kwanpakshing
    @kwanpakshing 3 года назад +1

    The best explaination I can find

  • @carbon273
    @carbon273 4 года назад +1

    Zedstatistics coupled with statQuest is just absolutely magnifique

  • @funnyclipsutd
    @funnyclipsutd 4 года назад +2

    After watching your videos, I aced my stats module!

    • @statquest
      @statquest  4 года назад +1

      TRIPLE BAM!!! Congratulations!!! :)

  • @toast34
    @toast34 9 месяцев назад

    Came here from the Pearson's correlation video. Thank you so much for this
    I just wish that you could show in the video:
    • how (Var(mean)-Var(line)) / Var(mean) is equal to [Covar(x,y) / (Var(x)^-2)(Var(y)^-2)]^2
    • whether (Var(mean)-Var(line)) / Var(mean) using mean and differences from the x-axis also yields the same value
    Again, thank you for the video

    • @statquest
      @statquest  9 месяцев назад +1

      I'll keep that in mind.

  • @koonsickgreen6272
    @koonsickgreen6272 Год назад +1

    Dude..this friggin rocks.. THAKS YOU!!!!!

  • @junepark9591
    @junepark9591 Год назад +1

    Beutifully explained. Thank you so much.

  • @mishtimaithli
    @mishtimaithli 2 года назад +1

    phew...!!! finally this concept is clear in my head :) thank you sooo much

  • @rashminaik6242
    @rashminaik6242 4 года назад +1

    Very well explained!! Thanks

  • @sheldonsebastian7232
    @sheldonsebastian7232 4 года назад +2

    B - E - A - utiful explanation. Thanks

  • @bachxuanquang2837
    @bachxuanquang2837 2 года назад +1

    Very clear explanation!

  • @adnanshahnawaz6808
    @adnanshahnawaz6808 Год назад +1

    "time spent sniffing a rock"! had me cracking😂.... btw thanks josh for putting such great content up... this channel is the my primary source of building my statistics foundations....

  • @m.c.4458
    @m.c.4458 3 года назад +1

    One of best videos, thanks

  • @gbchrs
    @gbchrs 2 года назад +1

    woah thanks for this one too Josh! finally gets R2

  • @tymothylim6550
    @tymothylim6550 2 года назад +1

    Great video! Thanks a lot!

  • @donfeto7636
    @donfeto7636 4 года назад +3

    Thank You Statquest your video and my knowledge of R^2 have a R^2 of 99.99

  • @isa..333
    @isa..333 Год назад +1

    this is bizarelly useful for my exam tomorrow

  • @amj.composer
    @amj.composer 2 года назад +1

    Gosh, this was SO helpful.

  • @vijaypalmanit
    @vijaypalmanit 7 лет назад +2

    one of the best explanation, thanks a lot for making this video..

  • @aamuz1cool
    @aamuz1cool 6 лет назад +9

    Adding to my previous comment , R2 value can be negative when the variance explained by the line is lesser than the variance explained by mean.
    For example var(mean) = 30 and var(line) = 40
    Then R2 = -0.3
    There exists such models , perhaps that could be worst models.

    • @statquest
      @statquest  6 лет назад +8

      This is technically correct, but practically speaking, R-squared is always positive because it is used to compare the least squares residuals for the best fitting model to the least squares residuals for the mean, and the best fitting model can't have larger residuals than the mean, otherwise the best fitting model would be the mean. Does that make sense?

    • @aamuz1cool
      @aamuz1cool 6 лет назад +1

      Completely agree with you in terms of practicality. It doesn't make sense at all. At the end of day you want a model which performs better than the base model. My point was it can be negative. Nevertheless i really like your videos. That comment of mine was just to clarify my understanding and to reach out to you.

    • @statquest
      @statquest  6 лет назад +4

      I was thinking more about the negative R-squared and how it could be used in practice. I mean, like you said, even if your model is terrible, worse than the mean, it still might be nice quantify how terrible it is - and that's where the negative R-squared could come in handy. It still has the same meaning, except now you're quantifying how much worse your model is than the mean. Interestingly, it still works out even if var(terrible model) is so bad that the R-squared is less than -1. For example, if var(mean) = 50 and var(terrible model) = 100, then R-squared = (50 - 100) / 50 = -1, so "terrible model" is 100% worse than the mean. If var(terrible model) = 150, then R-squared = (50 - 150) / 50 = -2, and now terrible model is 200% worse.

    • @aamuz1cool
      @aamuz1cool 6 лет назад +4

      Right , That's my point. From my own experience , I used to train multiple models on a sample dataset and compute their respected R-squared value to choose the best among those models. There I encountered some models returning negative R-squared value. Those models are practically useless and if you agree that happens when your training data is so huge and the algorithm you are using is so insignificant, like using a multi variant regression for a heavily skewed target variable.That was the motivation behind my comment. I appreciate your time to reply back to my comments. I am glad that it grabbed your attention Mr. Josh.

    • @spacedustpi
      @spacedustpi 4 года назад

      @@statquest I asked a question about this too and I assumed you meant the best fitting line (even though it was not explicitly stated in the video), or at least one that performed better than the mean line.

  • @yasinzamani9467
    @yasinzamani9467 5 лет назад +4

    Thank you for this easy to understand video :-)
    I have two suggestion!
    - Time 0:50 -- instead of `strongly related` it is better to say `strongly linear related`! We know that `R` can't explain nonlinear relationships (e.x. Y = X^2)!
    - Time 10:00 -- instead of `0.7^2 = 0.5` it is better to say `0.7^2 \approx (is approximately equal to) 0.5` ;-)

    • @statquest
      @statquest  5 лет назад +3

      Interestingly, and little known, but R^2 can be calculated for equations like y = a + b*x^2. That equation makes a curve, which is not linear, but the equation is _linear in its parameters_ (the parameters are 'a' and 'b', not 'x^2'), and that is what makes a "linear model" linear. A linear model doesn't have result in a straight line, but it must be linear in its parameters. That means you can calculate R^2 for y = a + b*x^2 or even y = a + b*sin(x). Not many people know this though since they don't understand what the "linear" in "linear models" actually refers to.

    • @yasinzamani9467
      @yasinzamani9467 5 лет назад +2

      Yes, and in y = a + b*x^2 or y = a + b*sin(x) it is better to say `y` has a linear relationship with `x^2` or `sin(x)`, not `x`!

  • @DiegoMachida
    @DiegoMachida 2 года назад +1

    You goddamm beautiful man, Im eating your videos like candy nowadays, Im finishing an electrical and comms engineering degree and working with some computer science and I usually get hammered with statistical questions when I finish presenting my models, thanks to your uploads i've held my own against some nasty expert old timers, thank you for this.

    • @statquest
      @statquest  2 года назад

      Awesome! TRIPLE BAM! :)

  • @intfxdx
    @intfxdx 2 года назад +1

    Awesome explanation

  • @beenasfarastodecidetouseve6733
    @beenasfarastodecidetouseve6733 5 лет назад +2

    Mr StatQuest I love you

  • @xiaochengjin6478
    @xiaochengjin6478 5 лет назад +1

    love your videos!

  • @RAJIBLOCHANDAS
    @RAJIBLOCHANDAS 2 года назад +1

    Nice explanation!

  • @shivamdwivedi7723
    @shivamdwivedi7723 2 года назад +1

    Really awesome lecture