R-squared or coefficient of determination | Regression | Probability and Statistics | Khan Academy

Поделиться
HTML-код
  • Опубликовано: 27 янв 2025

Комментарии • 147

  • @alleyygal
    @alleyygal 11 лет назад +113

    I think I've fallen in love with Khan. We've been through everything together; Statistics, Chemistry, you name it.

  • @oscarbergman7173
    @oscarbergman7173 9 лет назад +396

    Khan "Let me do this in an other color" Academy

    • @Cash4Fruit
      @Cash4Fruit 8 лет назад +2

      XD

    • @daksh_sagar
      @daksh_sagar 7 лет назад +2

      This is EPIC !

    • @youtubeaccount0x073
      @youtubeaccount0x073 5 лет назад +4

      Oscar Bergman another*

    • @NM-vw6xq
      @NM-vw6xq 4 года назад +1

      Lol all joking aside, the colors help visualize it a lot better since there's usually a ton of stuff up on the "board" by the end of the video.

  • @VibrationAlignment
    @VibrationAlignment 12 лет назад +109

    and after all of the money that I spent on education... I just learned more without any pants on at home.

    • @10thpass93
      @10thpass93 4 года назад +1

      hy i am from india,i also maths teacher

    • @ayushsoni786
      @ayushsoni786 4 года назад +7

      @@10thpass93 to?? kya??

    • @bennyblanco3322
      @bennyblanco3322 3 года назад

      You watched video after your wedding night

  • @NM-vw6xq
    @NM-vw6xq 4 года назад +28

    Thank you Sal! This was the best explanation of the R squared error I've seen ... makes a LOT more sense than just machine learning courses saying "its a percentage of how much it explains the variability of the data". This shows why that is, brilliant. Such a simple concept when broken down and explained visually step-by-step. You and 3B1B (also a former Khan Academy educator) are amongst the best instructors on YT.

    • @satishchandramedi9729
      @satishchandramedi9729 3 года назад

      Had you watched Applied AI course video on R squared error and came here to understand it in depth?

  • @angelbythewings
    @angelbythewings 3 года назад +3

    Sal makes no presumptions about students coming to watch his lectures, and that's what has been helping me out. I have never met an educator better than him in my entire life, PERIOD

  • @benjamcg
    @benjamcg 2 года назад +3

    1:03 What defines the error?
    1:20 What defines total square error?
    1:37 How y = (mx+b) relates to the error.
    2:55 "What % of the variation in y describes the variation in x?
    6:30 What defines the squared error of the mean of y? (end)
    6:59 What is not described by the variation in x?
    12:38 What is the coefficient of determination, aka R²? (end)
    10:23 What does R² say about the regression line?

  • @mevlutsoylu7481
    @mevlutsoylu7481 8 лет назад +53

    This video "literally" saved my butt. Thank you very much.

  • @chocolatebubbles91
    @chocolatebubbles91 12 лет назад +2

    thank god for him because my own math teacher has absolutely no idea what he's teaching. he's not a statistician, it's his first year teaching stats, so i don't blame him.. but i'd be lost without sal & his vids. thanks again!

    • @nocturnalvisionmusic
      @nocturnalvisionmusic 2 года назад

      Don't think I've ever heard the term *statistician* before. Thanks :D

  • @georgedestino4788
    @georgedestino4788 3 года назад +2

    9:58 r squared is the Coefficient of determination

  • @coconutjoy
    @coconutjoy 12 лет назад +2

    Best explanation ever, imagine how many students around the world you have enriched!

  • @jerilphilip4835
    @jerilphilip4835 4 года назад +4

    Sal the saviour of pilgrims seeking conceptual clarity, as always.

  • @RajvimalCrystal
    @RajvimalCrystal 6 лет назад +1

    Khan you are prodigy!!! only prodigy can teach as clear as crystal

  • @yusufnzm
    @yusufnzm 2 года назад +2

    Why sesubline/sesuby is "What % of variation is not described by the Regression line " instead of "described by"?

    • @19_088SHOVONSHARMA
      @19_088SHOVONSHARMA Год назад

      buddy,i had this problem too. but think it like this.when we talk about the "proportion of unexplained variation," we mean the fraction of the total variability that is not captured or explained by the regression model which means the error or SSE(more fancy term XD). hope this helps u.

  • @fridaylovebird
    @fridaylovebird 12 лет назад +3

    Note potential confusion: "SE" in this video is the (sum of the) Squared Errors. However, usually "s.e." is standard error. Since, technically, we NEVER observe errors (because they are a function of the true regression line--we can estimate residuals, though), what is called "SE" here is actually the Sum of the Squared Residuals (SSR in some texts). I am torn whether to recommend this video to my stats students because of this likely confusion.

    • @numericalmethodsguy
      @numericalmethodsguy 6 лет назад

      It is the sum of the squares of the residuals, not errors.

  • @skulqerX
    @skulqerX 3 года назад +2

    Mr khan , youve been with me since i was at secondary school . And now im trying to understand how to do my papers and statistics, suprised to still see there is so much i can learn from you . Thank you for all of ur lectures.

    • @shubhanshuagarwal901
      @shubhanshuagarwal901 3 года назад

      Can u explain what is the meaning of line " % not explained by regression line" as that ratio regression/mean error

  • @dychui
    @dychui 12 лет назад +1

    Khan you are my statistics hero!

  • @rushikeshkalkar5162
    @rushikeshkalkar5162 2 года назад

    Sal Sir = LEGEND🙌

  • @andattam
    @andattam 6 лет назад +29

    When you say " described by the variation in X," you should just write "described by the line," rather than X. It's really confusing for the rest of the video for those who didn't catch that.

    • @henryksarat
      @henryksarat 5 лет назад +2

      An Luu this comment helped a lot thank you

    • @10thpass93
      @10thpass93 4 года назад +1

      i am maths teacher at india. i want to help students in maths subject

  • @goodnesskabelo
    @goodnesskabelo 5 лет назад +1

    I wish everyone else would like and comment after enjoying a video, these likes and comments motivate us when we are in panic to quickly click into the video and cram before exam

  • @dychui
    @dychui 11 лет назад +8

    Great video. I really appreciate it.

  • @Drew0Ranger
    @Drew0Ranger 2 года назад

    I am glad I watched this, I am glad...glad..I am glad I watched this.

  • @totalinsight9888
    @totalinsight9888 7 лет назад +4

    Wonderful Mr. Khan. You have made this problematic concept seem so simple and 'intuitive'. Keep up this GOOD WORK.

  • @NirmalKumar-sc5tk
    @NirmalKumar-sc5tk 5 лет назад +2

    Thank you Khan Academy for clearing my doubts in so many topics.

  • @SandeepAcademy9
    @SandeepAcademy9 11 лет назад +1

    Can anyone tell me, How SEline (total square error) gives the total variation not described by regression line?
    Also, How does the regression line describe the variation in x?

  • @xgutierrezc
    @xgutierrezc 5 лет назад +2

    I loved the explanation. Came here trying to understand the r2_score function in Python.

  • @shehlamalik5392
    @shehlamalik5392 3 года назад

    Your voice is so sweet.

  • @ibrahimbarro2814
    @ibrahimbarro2814 4 года назад

    Man this saved my life. Thanks so much

  • @anesdj4282
    @anesdj4282 10 лет назад +3

    There is no easier way to explain regression as simple as that. thanks

  • @bhavindhedhi
    @bhavindhedhi 6 лет назад

    The error that you are referring to [ 0:58] is really residuals, right? just clearing things up.

  • @stephenaloia6695
    @stephenaloia6695 3 года назад

    Brilliantly simple and straightforward. So good.

  • @innovation2440
    @innovation2440 8 лет назад +14

    What is "variation described by the line" mean? Can someone help explain please..

    • @李映潔-l1w
      @李映潔-l1w 8 лет назад +1

      Inno Vation needing this also

    • @spacedustpi
      @spacedustpi 7 лет назад +1

      He is referring to the diagonal line. Often called the regression line. Represented by mx+b. The variation of the line is equal to the sum of all the (y-(mx + b))^2 equations he wrote up there in the video.

    • @Luiferhoyos
      @Luiferhoyos 7 лет назад +23

      If all of the points were exactly on the regression line, then the regression line would explain all of the variation. In other words, SE(line) would be zero. Anywhere that the points do NOT lie on the line is variation that is undescribed by the regression.
      For example, a regression line might show that for each hour a student studies, he gets 10% higher on a test. 5 hours earns a 50, 6 hours earns a 60, 7 hours earns a 70. If a student studied for 8 hours and only got a 75, that variation between 75 and the expected 80 would not be described by the regression line and would add to the SE(line).
      Taken from this comment: www.khanacademy.org/video/r-squared-or-coefficient-of-determination?qa_expand_key=kaencrypted_c228fca3982c98caf8466db5f21d3293_97ad4ce3e39b15af2b59bffb1589cc545a0a13b897657bf6c9694237018a45585fa77017167815c3fb5868e6cd97f5a15499b0ead13d4f8bdae6f9823d7f95f6f9c067fbf53f053b72e5221cd3341c5f842cbd96c5140b0868e8e68a46855aa5af32a4cc1fe19ffd136df5b5a492e8a83fd3a28e20965323f00c242a29d98de4c9bf954e71ca493b1e661f7d719a521c5a74b8710cac5e6b8b9f268a9170fad433006065db869f0da574bb11fdd142868bff9ffd8014d6433eebdcfc59aec0e9

    • @razorblake393
      @razorblake393 4 года назад +2

      How far the point are from the line

  • @PetStuBa
    @PetStuBa 6 лет назад +3

    I think it's n-1 ... not n ... there are n-1 degrees of freedom .. (5:57)

    • @mindloop4070
      @mindloop4070 2 года назад

      he is not calculating variance here just to compare with population..just a avearage Squared error so n makes sense

    • @PetStuBa
      @PetStuBa 2 года назад

      @@mindloop4070 yeah good point

  • @sayak
    @sayak 4 года назад

    thanks this was clear explanation

  • @boredomgotmehere
    @boredomgotmehere Год назад

    Always enjoy the explanations of Khan Academy. Precise, succinct and. Intuitively explained.

  • @TriNguyen-xi8ji
    @TriNguyen-xi8ji 11 лет назад +2

    I love you. you saved my life, keep up the good work :D

  • @pradeepkumar9586
    @pradeepkumar9586 6 лет назад

    khan acadamy rocks..

  • @bryanporinski5308
    @bryanporinski5308 4 года назад

    Brilliant teaching!

  • @UZPvNUCaaQdF
    @UZPvNUCaaQdF Год назад

    I love you, man.

  • @ansariarbaz3374
    @ansariarbaz3374 4 года назад

    just one word, awesome!!

  • @keinsent
    @keinsent 9 месяцев назад

    THANKYOU

  • @GThinks
    @GThinks 9 лет назад

    what do "the variation of y" and "the variation of y described by the variation of x" mean again.. ?

  • @NamalIndrajith-MCP
    @NamalIndrajith-MCP 4 года назад

    superb explanation, understood well

  • @ChaceBonanno
    @ChaceBonanno 4 года назад

    Gotta get those Ms and Bs

  • @familycohen4758
    @familycohen4758 2 года назад

    Wonderful explanation. Thank you so much!

  • @cajantomato6497
    @cajantomato6497 11 лет назад +1

    thanks a lot. this really helped

  • @studyselection2881
    @studyselection2881 2 года назад

    Good channel.

  • @Chris-ng9zi
    @Chris-ng9zi 2 года назад +3

    Excellent explanation! But why is the Coefficient of Determination equal to r _squared vs some other quantity? Why not r- cubed or ( r_squared -1)?

  • @AsgharKhan-fd1ul
    @AsgharKhan-fd1ul 12 лет назад

    KHAN IS KING

  • @atuntalib9474
    @atuntalib9474 8 лет назад +3

    i dont hv basic in statistic but need to understand this for managerial economic. what do you mean by mean-of-y? is that the average for all the Y?

  • @razorblake393
    @razorblake393 4 года назад +1

    What do you mean by "not described by the variation in x"

  • @nglazgow
    @nglazgow 5 лет назад

    Thanks!

  • @MeghaShankarArt
    @MeghaShankarArt 5 лет назад +1

    Thankyou so much for this video! Wonderfully explained!

  • @rodaf2
    @rodaf2 10 лет назад +1

    Minor correction: it should be 'fraction' instead of 'percentage' at several parts of the video.

  • @JuSpeJule
    @JuSpeJule 9 лет назад +2

    Oh my god I finally got this! Thank you so much. x

  • @andrerossa8553
    @andrerossa8553 5 лет назад

    Great video. Tks

  • @jaskaransingh0304
    @jaskaransingh0304 Год назад

    Just awesome

  • @serenalizinnqui8474
    @serenalizinnqui8474 4 года назад

    How do we know that the amount of variation in y not caused by variation in x is SE_line -SE_ymean? What does the difference in errors between the two lines have to do with anything?

  • @chris7toronto
    @chris7toronto 14 лет назад

    You are the best!

  • @produdeyay
    @produdeyay 4 года назад

    I don't understand the need to interchange "describe by variation in X" and "describe by the line", are they the same thing?

  • @yangzhou981
    @yangzhou981 10 лет назад

    great video,many thanks

  • @riderblack6401
    @riderblack6401 7 лет назад

    Great video! Thanks sal!

  • @pdwivedi1990
    @pdwivedi1990 10 лет назад

    Great tutorial..

  • @SimAlgyaxSaTun
    @SimAlgyaxSaTun 3 года назад

    Sal is the GOAT

  • @changw_w9413
    @changw_w9413 7 месяцев назад

    Why "how much of total variation is not described by the regression line" = SE(line)? Isn't SE(line) describes the total variation for the regression line? I'm confused by this sentence? Could anyone help to enlighten me? Thanks :)

  • @stefanobourscheid8255
    @stefanobourscheid8255 7 лет назад +2

    I read somewhere that Rˆ2 may be negative, I don't understand how since it is essentially a %

    • @FoxxMulderr
      @FoxxMulderr 7 лет назад +2

      that's an adjusted or predicted R2, because you can't have negative with regular R2.

  • @annashchukina3768
    @annashchukina3768 5 лет назад +1

    Thank you for video, it was really helpful. I have a question.
    What if we have data, which we approximated well with a line L, but L is almost horizontal and close to y=mean(y). The ratio of SE(line) to SE(y) is supposed to be close to 1, thus r^2 is close to 0, although L is a good fit.

  • @ernestamoore4385
    @ernestamoore4385 8 лет назад +1

    How do you estimate the significance of the betas?

  • @mdnguyen287
    @mdnguyen287 13 лет назад

    thank you,it is extremely easy to understand what you are saying, absolutely different from my lecture notes. why can't you be my professor? :)

  • @aayishatabassum9174
    @aayishatabassum9174 10 месяцев назад

    what does variation in x mean exactly ?

  • @jgeorgiou94
    @jgeorgiou94 9 лет назад +87

    this guy must have repetitive strain injury by now repetitive strain injury by now. by now.

    • @junehurt4897
      @junehurt4897 9 лет назад +3

      +Jonathan Georgiou Makes it harder and harder for me to watch Khan Academy math videos. They are good at explaining, but the lecturer's repetitive phrases drive me nuts!

    • @carlos_f
      @carlos_f 8 лет назад +46

      maybe there's people watching these videos that aren't as fluent in english as you are mate so by repeating phrases those peolpe might have another chance to catch the meaning of the phrase.. my 2c

    • @riderblack6401
      @riderblack6401 7 лет назад +3

      It's not his problem, sometimes it is that you don't understand the math.

    • @jazzhopkid
      @jazzhopkid 7 лет назад +5

      Repetition is key to memory.

  • @sera_venus
    @sera_venus 6 лет назад

    maybe i should have studied for my midterm this week

  • @mawais2560
    @mawais2560 4 года назад

    what are possible interpretations and justifications for low r square values in management sceince?

  • @shannonmccosky6258
    @shannonmccosky6258 8 лет назад

    What program is used to take these notes?

  • @sierraclark6129
    @sierraclark6129 3 года назад +1

    “If you declare with your mouth “Jesus is Lord,” and believe in your heart that God raised Him from the dead, you will be saved” (Romans 10:9). Now is the time to accept Jesus as your personal Lord and Savior. Obey His commands and repent of your sins because Jesus is coming back soon. Tomorrow isn’t promised.

  • @rutger5000
    @rutger5000 5 лет назад

    Just checking you can't get postive R^2 values higher than 1 right?

  • @willsonbasyal7883
    @willsonbasyal7883 2 года назад +1

    khaaaaaaaaaannnnnn

  • @mishuga
    @mishuga 14 лет назад

    I'm with v2prc. This is the first statement in the playlist that has me confused. Help would be much appreciated.

  • @Lucyferandtheson003
    @Lucyferandtheson003 2 года назад

    How did he estimate the regression line?? Did he just guess? Or he used some technique?

  • @adam2073
    @adam2073 6 лет назад +1

    Why does he refer to the "variation of X" and the "the line" as the same thing ?

  • @santiagorozo2530
    @santiagorozo2530 5 лет назад +2

    If you play this a 1.5x speed he sounds like Jeff Goldblum

  • @user-zr8vc8nw7e
    @user-zr8vc8nw7e Год назад

    What is it meant by "describe"? "percentage of variation described by percentage of error"?" DESCRIBED"?! what's that mean?

  • @harshitjain7428
    @harshitjain7428 4 года назад

    that R is pearson's coefficient right? don't we denote that with 'r' instead of 'R'?

  • @hectichive889
    @hectichive889 4 года назад

    I still don’t understand why in the world you would use the mean of y instead of just the residual from the regression line?

  • @bhoomeendra
    @bhoomeendra 6 лет назад +1

    GREAT EXPLANATION !! the concept is very clean in head now

  • @t2hmn
    @t2hmn 13 лет назад

    @hapolian He is probably writing with a tool called graphic tablet.

  • @kn0wled9e
    @kn0wled9e 13 лет назад

    thank u so much sir....u r awsome

    • @unhealthyavacado
      @unhealthyavacado 3 года назад

      dude it's been 10 years since you wrote this. are you still alive :D

  • @fusionplaysgames
    @fusionplaysgames 11 месяцев назад

    Me learning after 13 years

  • @vana-t7u
    @vana-t7u 2 года назад

    love you

  • @bunnybunny1988
    @bunnybunny1988 7 лет назад

    how to write on the computer screen like this?

  • @jianzhao8986
    @jianzhao8986 8 лет назад

    What's the relationship btw r and r squared?

  • @wencerich4550
    @wencerich4550 4 года назад

    I don't know where 30% obtain and 1 in the formula for sure am not understand

  • @MrMr-en8fo
    @MrMr-en8fo 5 лет назад

    lecture Urdu ma dayn smjh n ari

  • @ldpy33
    @ldpy33 4 года назад

    So, they both tell us how strong the relationship is between two variables?!

    • @ldpy33
      @ldpy33 4 года назад

      Here's the question: IS THE ANSWER E?
      Which of the following tells us how strong the relationship is between two variables?
      a) the slope of a line
      b) the intercept of a line
      c) the coefficient of determination
      d) the coefficient of correlation
      e) both C and D are correct

  • @ansonlai9153
    @ansonlai9153 5 лет назад

    why are you so good in explaining like the how the fk...

  • @sunilguptamaths
    @sunilguptamaths 4 года назад

    Hy

  • @Himashree_333
    @Himashree_333 3 года назад

    That's not error that's residual

  • @markchristophergemzon1052
    @markchristophergemzon1052 4 года назад

    Who is here because of machine learning?

  • @ajufsd
    @ajufsd 10 лет назад

    Thank you.. but I badly need the notes just made :p

  • @kanishka.b8550
    @kanishka.b8550 3 года назад

    730K views and 2.3K likes? 0.o wut?

  • @chocolatebubbles91
    @chocolatebubbles91 12 лет назад

    he's married. so yeah.. i guess he does.