R-squared or coefficient of determination | Regression | Probability and Statistics | Khan Academy

Поделиться
HTML-код
  • Опубликовано: 23 окт 2024

Комментарии • 148

  • @alleyygal
    @alleyygal 11 лет назад +102

    I think I've fallen in love with Khan. We've been through everything together; Statistics, Chemistry, you name it.

  • @MrKevlax
    @MrKevlax 11 лет назад +101

    and after all of the money that I spent on education... I just learned more without any pants on at home.

    • @10thpass93
      @10thpass93 4 года назад +1

      hy i am from india,i also maths teacher

    • @ayushsoni786
      @ayushsoni786 4 года назад +7

      @@10thpass93 to?? kya??

    • @bennyblanco3322
      @bennyblanco3322 2 года назад

      You watched video after your wedding night

  • @oscarbergman7173
    @oscarbergman7173 8 лет назад +392

    Khan "Let me do this in an other color" Academy

    • @Cash4Fruit
      @Cash4Fruit 7 лет назад +2

      XD

    • @dakshsagar2839
      @dakshsagar2839 7 лет назад +2

      This is EPIC !

    • @youtubeaccount0x073
      @youtubeaccount0x073 4 года назад +3

      Oscar Bergman another*

    • @NM-vw6xq
      @NM-vw6xq 3 года назад +1

      Lol all joking aside, the colors help visualize it a lot better since there's usually a ton of stuff up on the "board" by the end of the video.

  • @NM-vw6xq
    @NM-vw6xq 3 года назад +25

    Thank you Sal! This was the best explanation of the R squared error I've seen ... makes a LOT more sense than just machine learning courses saying "its a percentage of how much it explains the variability of the data". This shows why that is, brilliant. Such a simple concept when broken down and explained visually step-by-step. You and 3B1B (also a former Khan Academy educator) are amongst the best instructors on YT.

    • @satishchandramedi9729
      @satishchandramedi9729 3 года назад

      Had you watched Applied AI course video on R squared error and came here to understand it in depth?

  • @mevlutsoylu7481
    @mevlutsoylu7481 8 лет назад +52

    This video "literally" saved my butt. Thank you very much.

  • @angelbythewings
    @angelbythewings 3 года назад +3

    Sal makes no presumptions about students coming to watch his lectures, and that's what has been helping me out. I have never met an educator better than him in my entire life, PERIOD

  • @benjamcg
    @benjamcg 2 года назад +2

    1:03 What defines the error?
    1:20 What defines total square error?
    1:37 How y = (mx+b) relates to the error.
    2:55 "What % of the variation in y describes the variation in x?
    6:30 What defines the squared error of the mean of y? (end)
    6:59 What is not described by the variation in x?
    12:38 What is the coefficient of determination, aka R²? (end)
    10:23 What does R² say about the regression line?

  • @chocolatebubbles91
    @chocolatebubbles91 12 лет назад +2

    thank god for him because my own math teacher has absolutely no idea what he's teaching. he's not a statistician, it's his first year teaching stats, so i don't blame him.. but i'd be lost without sal & his vids. thanks again!

    • @nocturnalvisionmusic
      @nocturnalvisionmusic 2 года назад

      Don't think I've ever heard the term *statistician* before. Thanks :D

  • @skulqerX
    @skulqerX 3 года назад +2

    Mr khan , youve been with me since i was at secondary school . And now im trying to understand how to do my papers and statistics, suprised to still see there is so much i can learn from you . Thank you for all of ur lectures.

    • @shubhanshuagarwal901
      @shubhanshuagarwal901 3 года назад

      Can u explain what is the meaning of line " % not explained by regression line" as that ratio regression/mean error

  • @RajvimalCrystal
    @RajvimalCrystal 6 лет назад +1

    Khan you are prodigy!!! only prodigy can teach as clear as crystal

  • @coconutjoy
    @coconutjoy 11 лет назад +1

    Best explanation ever, imagine how many students around the world you have enriched!

  • @NirmalKumar-sc5tk
    @NirmalKumar-sc5tk 5 лет назад +2

    Thank you Khan Academy for clearing my doubts in so many topics.

  • @boredomgotmehere
    @boredomgotmehere Год назад

    Always enjoy the explanations of Khan Academy. Precise, succinct and. Intuitively explained.

  • @Drew0Ranger
    @Drew0Ranger 2 года назад

    I am glad I watched this, I am glad...glad..I am glad I watched this.

  • @fridaylovebird
    @fridaylovebird 11 лет назад +3

    Note potential confusion: "SE" in this video is the (sum of the) Squared Errors. However, usually "s.e." is standard error. Since, technically, we NEVER observe errors (because they are a function of the true regression line--we can estimate residuals, though), what is called "SE" here is actually the Sum of the Squared Residuals (SSR in some texts). I am torn whether to recommend this video to my stats students because of this likely confusion.

    • @numericalmethodsguy
      @numericalmethodsguy 6 лет назад

      It is the sum of the squares of the residuals, not errors.

  • @totalinsight9888
    @totalinsight9888 7 лет назад +4

    Wonderful Mr. Khan. You have made this problematic concept seem so simple and 'intuitive'. Keep up this GOOD WORK.

  • @georgedestino4788
    @georgedestino4788 3 года назад +2

    9:58 r squared is the Coefficient of determination

  • @stephenaloia6695
    @stephenaloia6695 3 года назад

    Brilliantly simple and straightforward. So good.

  • @dychui
    @dychui 11 лет назад +1

    Khan you are my statistics hero!

  • @familycohen4758
    @familycohen4758 2 года назад

    Wonderful explanation. Thank you so much!

  • @jerilphilip4835
    @jerilphilip4835 4 года назад +4

    Sal the saviour of pilgrims seeking conceptual clarity, as always.

  • @sayak
    @sayak 3 года назад

    thanks this was clear explanation

  • @dychui
    @dychui 10 лет назад +8

    Great video. I really appreciate it.

  • @ibrahimbarro2814
    @ibrahimbarro2814 4 года назад

    Man this saved my life. Thanks so much

  • @MeghaShankarArt
    @MeghaShankarArt 5 лет назад +1

    Thankyou so much for this video! Wonderfully explained!

  • @xgutierrezc
    @xgutierrezc 4 года назад +2

    I loved the explanation. Came here trying to understand the r2_score function in Python.

  • @goodnesskabelo
    @goodnesskabelo 5 лет назад +1

    I wish everyone else would like and comment after enjoying a video, these likes and comments motivate us when we are in panic to quickly click into the video and cram before exam

  • @andattam
    @andattam 6 лет назад +29

    When you say " described by the variation in X," you should just write "described by the line," rather than X. It's really confusing for the rest of the video for those who didn't catch that.

    • @henryksarat
      @henryksarat 5 лет назад +2

      An Luu this comment helped a lot thank you

    • @10thpass93
      @10thpass93 4 года назад +1

      i am maths teacher at india. i want to help students in maths subject

  • @NamalIndrajith-MCP
    @NamalIndrajith-MCP 4 года назад

    superb explanation, understood well

  • @keinsent
    @keinsent 6 месяцев назад

    THANKYOU

  • @bryanporinski5308
    @bryanporinski5308 4 года назад

    Brilliant teaching!

  • @Chris-ng9zi
    @Chris-ng9zi Год назад +3

    Excellent explanation! But why is the Coefficient of Determination equal to r _squared vs some other quantity? Why not r- cubed or ( r_squared -1)?

  • @shehlamalik5392
    @shehlamalik5392 3 года назад

    Your voice is so sweet.

  • @ansariarbaz3374
    @ansariarbaz3374 3 года назад

    just one word, awesome!!

  • @yusufnzm
    @yusufnzm 2 года назад +2

    Why sesubline/sesuby is "What % of variation is not described by the Regression line " instead of "described by"?

    • @19_088SHOVONSHARMA
      @19_088SHOVONSHARMA 10 месяцев назад

      buddy,i had this problem too. but think it like this.when we talk about the "proportion of unexplained variation," we mean the fraction of the total variability that is not captured or explained by the regression model which means the error or SSE(more fancy term XD). hope this helps u.

  • @andrerossa8553
    @andrerossa8553 5 лет назад

    Great video. Tks

  • @anesdj4282
    @anesdj4282 9 лет назад +3

    There is no easier way to explain regression as simple as that. thanks

  • @yileiwu4040
    @yileiwu4040 6 лет назад +2

    Thanks,I really appreciate it.

  • @jaskaransingh0304
    @jaskaransingh0304 Год назад

    Just awesome

  • @kennguyen1066
    @kennguyen1066 5 лет назад

    Very nicely explained!

  • @nglazgow
    @nglazgow 5 лет назад

    Thanks!

  • @studyselection2881
    @studyselection2881 2 года назад

    Good channel.

  • @yangzhou981
    @yangzhou981 10 лет назад

    great video,many thanks

  • @UZPvNUCaaQdF
    @UZPvNUCaaQdF Год назад

    I love you, man.

  • @annashchukina3768
    @annashchukina3768 5 лет назад +1

    Thank you for video, it was really helpful. I have a question.
    What if we have data, which we approximated well with a line L, but L is almost horizontal and close to y=mean(y). The ratio of SE(line) to SE(y) is supposed to be close to 1, thus r^2 is close to 0, although L is a good fit.

  • @rushikeshkalkar5162
    @rushikeshkalkar5162 Год назад

    Sal Sir = LEGEND🙌

  • @bhoomeendra
    @bhoomeendra 6 лет назад +1

    GREAT EXPLANATION !! the concept is very clean in head now

  • @serenalizinnqui8474
    @serenalizinnqui8474 3 года назад

    How do we know that the amount of variation in y not caused by variation in x is SE_line -SE_ymean? What does the difference in errors between the two lines have to do with anything?

  • @innovation2440
    @innovation2440 8 лет назад +14

    What is "variation described by the line" mean? Can someone help explain please..

    • @李映潔-l1w
      @李映潔-l1w 8 лет назад +1

      Inno Vation needing this also

    • @spacedustpi
      @spacedustpi 6 лет назад +1

      He is referring to the diagonal line. Often called the regression line. Represented by mx+b. The variation of the line is equal to the sum of all the (y-(mx + b))^2 equations he wrote up there in the video.

    • @Luiferhoyos
      @Luiferhoyos 6 лет назад +23

      If all of the points were exactly on the regression line, then the regression line would explain all of the variation. In other words, SE(line) would be zero. Anywhere that the points do NOT lie on the line is variation that is undescribed by the regression.
      For example, a regression line might show that for each hour a student studies, he gets 10% higher on a test. 5 hours earns a 50, 6 hours earns a 60, 7 hours earns a 70. If a student studied for 8 hours and only got a 75, that variation between 75 and the expected 80 would not be described by the regression line and would add to the SE(line).
      Taken from this comment: www.khanacademy.org/video/r-squared-or-coefficient-of-determination?qa_expand_key=kaencrypted_c228fca3982c98caf8466db5f21d3293_97ad4ce3e39b15af2b59bffb1589cc545a0a13b897657bf6c9694237018a45585fa77017167815c3fb5868e6cd97f5a15499b0ead13d4f8bdae6f9823d7f95f6f9c067fbf53f053b72e5221cd3341c5f842cbd96c5140b0868e8e68a46855aa5af32a4cc1fe19ffd136df5b5a492e8a83fd3a28e20965323f00c242a29d98de4c9bf954e71ca493b1e661f7d719a521c5a74b8710cac5e6b8b9f268a9170fad433006065db869f0da574bb11fdd142868bff9ffd8014d6433eebdcfc59aec0e9

    • @razorblake393
      @razorblake393 4 года назад +2

      How far the point are from the line

  • @chris7toronto
    @chris7toronto 14 лет назад

    You are the best!

  • @riderblack6401
    @riderblack6401 6 лет назад

    Great video! Thanks sal!

  • @pdwivedi1990
    @pdwivedi1990 9 лет назад

    Great tutorial..

  • @razorblake393
    @razorblake393 4 года назад +1

    What do you mean by "not described by the variation in x"

  • @pradeepkumar9586
    @pradeepkumar9586 6 лет назад

    khan acadamy rocks..

  • @JuSpeJule
    @JuSpeJule 8 лет назад +2

    Oh my god I finally got this! Thank you so much. x

  • @atuntalib9474
    @atuntalib9474 8 лет назад +3

    i dont hv basic in statistic but need to understand this for managerial economic. what do you mean by mean-of-y? is that the average for all the Y?

  • @produdeyay
    @produdeyay 4 года назад

    I don't understand the need to interchange "describe by variation in X" and "describe by the line", are they the same thing?

  • @bhavindhedhi
    @bhavindhedhi 6 лет назад

    The error that you are referring to [ 0:58] is really residuals, right? just clearing things up.

  • @changw_w9413
    @changw_w9413 4 месяца назад

    Why "how much of total variation is not described by the regression line" = SE(line)? Isn't SE(line) describes the total variation for the regression line? I'm confused by this sentence? Could anyone help to enlighten me? Thanks :)

  • @stefanobourscheid8255
    @stefanobourscheid8255 7 лет назад +2

    I read somewhere that Rˆ2 may be negative, I don't understand how since it is essentially a %

    • @FoxxMulderr
      @FoxxMulderr 7 лет назад +2

      that's an adjusted or predicted R2, because you can't have negative with regular R2.

  • @rodaf2
    @rodaf2 9 лет назад +1

    Minor correction: it should be 'fraction' instead of 'percentage' at several parts of the video.

  • @AsgharKhan-fd1ul
    @AsgharKhan-fd1ul 12 лет назад

    KHAN IS KING

  • @ChaceBonanno
    @ChaceBonanno 4 года назад

    Gotta get those Ms and Bs

  • @rutger5000
    @rutger5000 5 лет назад

    Just checking you can't get postive R^2 values higher than 1 right?

  • @TriNguyen-xi8ji
    @TriNguyen-xi8ji 11 лет назад +2

    I love you. you saved my life, keep up the good work :D

  • @Lucyferandtheson003
    @Lucyferandtheson003 Год назад

    How did he estimate the regression line?? Did he just guess? Or he used some technique?

  • @SandeepAcademy9
    @SandeepAcademy9 11 лет назад +1

    Can anyone tell me, How SEline (total square error) gives the total variation not described by regression line?
    Also, How does the regression line describe the variation in x?

  • @GThinks
    @GThinks 9 лет назад

    what do "the variation of y" and "the variation of y described by the variation of x" mean again.. ?

  • @aayishatabassum9174
    @aayishatabassum9174 7 месяцев назад

    what does variation in x mean exactly ?

  • @mawais2560
    @mawais2560 4 года назад

    what are possible interpretations and justifications for low r square values in management sceince?

  • @ernestamoore4385
    @ernestamoore4385 8 лет назад +1

    How do you estimate the significance of the betas?

  • @sera_venus
    @sera_venus 5 лет назад

    maybe i should have studied for my midterm this week

  • @mdnguyen287
    @mdnguyen287 13 лет назад

    thank you,it is extremely easy to understand what you are saying, absolutely different from my lecture notes. why can't you be my professor? :)

  • @kn0wled9e
    @kn0wled9e 13 лет назад

    thank u so much sir....u r awsome

    • @unhealthyavacado
      @unhealthyavacado 2 года назад

      dude it's been 10 years since you wrote this. are you still alive :D

  • @user-zr8vc8nw7e
    @user-zr8vc8nw7e Год назад

    What is it meant by "describe"? "percentage of variation described by percentage of error"?" DESCRIBED"?! what's that mean?

  • @jgeorgiou94
    @jgeorgiou94 9 лет назад +87

    this guy must have repetitive strain injury by now repetitive strain injury by now. by now.

    • @junehurt4897
      @junehurt4897 8 лет назад +3

      +Jonathan Georgiou Makes it harder and harder for me to watch Khan Academy math videos. They are good at explaining, but the lecturer's repetitive phrases drive me nuts!

    • @carlos_f
      @carlos_f 8 лет назад +46

      maybe there's people watching these videos that aren't as fluent in english as you are mate so by repeating phrases those peolpe might have another chance to catch the meaning of the phrase.. my 2c

    • @riderblack6401
      @riderblack6401 6 лет назад +3

      It's not his problem, sometimes it is that you don't understand the math.

    • @jazzhopkid
      @jazzhopkid 6 лет назад +5

      Repetition is key to memory.

  • @adam2073
    @adam2073 5 лет назад +1

    Why does he refer to the "variation of X" and the "the line" as the same thing ?

  • @harshitjain7428
    @harshitjain7428 4 года назад

    that R is pearson's coefficient right? don't we denote that with 'r' instead of 'R'?

  • @hectichive889
    @hectichive889 4 года назад

    I still don’t understand why in the world you would use the mean of y instead of just the residual from the regression line?

  • @shannonmccosky6258
    @shannonmccosky6258 8 лет назад

    What program is used to take these notes?

  • @mishuga
    @mishuga 14 лет назад

    I'm with v2prc. This is the first statement in the playlist that has me confused. Help would be much appreciated.

  • @PetStuBa
    @PetStuBa 6 лет назад +3

    I think it's n-1 ... not n ... there are n-1 degrees of freedom .. (5:57)

    • @mindloop4070
      @mindloop4070 2 года назад

      he is not calculating variance here just to compare with population..just a avearage Squared error so n makes sense

    • @PetStuBa
      @PetStuBa 2 года назад

      @@mindloop4070 yeah good point

  • @jianzhao8986
    @jianzhao8986 8 лет назад

    What's the relationship btw r and r squared?

  • @t2hmn
    @t2hmn 13 лет назад

    @hapolian He is probably writing with a tool called graphic tablet.

  • @vana-t7u
    @vana-t7u 2 года назад

    love you

  • @SimAlgyaxSaTun
    @SimAlgyaxSaTun 3 года назад

    Sal is the GOAT

  • @willsonbasyal7883
    @willsonbasyal7883 2 года назад +1

    khaaaaaaaaaannnnnn

  • @bunnybunny1988
    @bunnybunny1988 6 лет назад

    how to write on the computer screen like this?

  • @ldpy33
    @ldpy33 4 года назад

    So, they both tell us how strong the relationship is between two variables?!

    • @ldpy33
      @ldpy33 4 года назад

      Here's the question: IS THE ANSWER E?
      Which of the following tells us how strong the relationship is between two variables?
      a) the slope of a line
      b) the intercept of a line
      c) the coefficient of determination
      d) the coefficient of correlation
      e) both C and D are correct

  • @ajufsd
    @ajufsd 9 лет назад

    Thank you.. but I badly need the notes just made :p

  • @santiagorozo2530
    @santiagorozo2530 5 лет назад +2

    If you play this a 1.5x speed he sounds like Jeff Goldblum

  • @ansonlai9153
    @ansonlai9153 5 лет назад

    why are you so good in explaining like the how the fk...

  • @wencerich4550
    @wencerich4550 3 года назад

    I don't know where 30% obtain and 1 in the formula for sure am not understand

  • @fusionplaysgames
    @fusionplaysgames 8 месяцев назад

    Me learning after 13 years

  • @Himashree_333
    @Himashree_333 2 года назад

    That's not error that's residual

  • @sunilguptamaths
    @sunilguptamaths 4 года назад

    Hy

  • @MrMr-en8fo
    @MrMr-en8fo 5 лет назад

    lecture Urdu ma dayn smjh n ari

  • @kanishka.b8550
    @kanishka.b8550 3 года назад

    730K views and 2.3K likes? 0.o wut?

  • @sierraclark6129
    @sierraclark6129 3 года назад +1

    “If you declare with your mouth “Jesus is Lord,” and believe in your heart that God raised Him from the dead, you will be saved” (Romans 10:9). Now is the time to accept Jesus as your personal Lord and Savior. Obey His commands and repent of your sins because Jesus is coming back soon. Tomorrow isn’t promised.

  • @markchristophergemzon1052
    @markchristophergemzon1052 4 года назад

    Who is here because of machine learning?