Covariance vs Correlation with simple data | Covariance vs Correlation Coefficient

Поделиться
HTML-код
  • Опубликовано: 27 июл 2024
  • Covariance vs Correlation with simple data | Covariance vs Correlation Coefficient
    #CovarianceVSCorrelation #UnfoldDataScience
    Hello ,
    My name is Aman and I am a Data Scientist.
    About this video:
    In this video, In explain about covariance and correlation. This is an important statistics concept to know and hence I have explained the difference between correlation and covariance in this video through a simple data. Below topics are explained in this video:
    1. Covariance vs Correlation with simple data
    2. Covariance vs Correlation Coefficient
    3. What is difference between correlation and covariance
    4. Understanding Correlation vs Covariance
    5. How is Covariance different from correlation
    About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
    If you need Data Science training from scratch . Please fill this form (Please Note: Training is chargeable)
    docs.google.com/forms/d/1Acua...
    Book recommendation for Data Science:
    Category 1 - Must Read For Every Data Scientist:
    The Elements of Statistical Learning by Trevor Hastie - amzn.to/37wMo9H
    Python Data Science Handbook - amzn.to/31UCScm
    Business Statistics By Ken Black - amzn.to/2LObAA5
    Hands-On Machine Learning with Scikit Learn, Keras, and TensorFlow by Aurelien Geron - amzn.to/3gV8sO9
    Ctaegory 2 - Overall Data Science:
    The Art of Data Science By Roger D. Peng - amzn.to/2KD75aD
    Predictive Analytics By By Eric Siegel - amzn.to/3nsQftV
    Data Science for Business By Foster Provost - amzn.to/3ajN8QZ
    Category 3 - Statistics and Mathematics:
    Naked Statistics By Charles Wheelan - amzn.to/3gXLdmp
    Practical Statistics for Data Scientist By Peter Bruce - amzn.to/37wL9Y5
    Category 4 - Machine Learning:
    Introduction to machine learning by Andreas C Muller - amzn.to/3oZ3X7T
    The Hundred Page Machine Learning Book by Andriy Burkov - amzn.to/3pdqCxJ
    Category 5 - Programming:
    The Pragmatic Programmer by David Thomas - amzn.to/2WqWXVj
    Clean Code by Robert C. Martin - amzn.to/3oYOdlt
    My Studio Setup:
    My Camera : amzn.to/3mwXI9I
    My Mic : amzn.to/34phfD0
    My Tripod : amzn.to/3r4HeJA
    My Ring Light : amzn.to/3gZz00F
    Join Facebook group :
    groups/41022...
    Follow on medium : / amanrai77
    Follow on quora: www.quora.com/profile/Aman-Ku...
    Follow on twitter : @unfoldds
    Get connected on LinkedIn : / aman-kumar-b4881440
    Follow on Instagram : unfolddatascience
    Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
    Watch python for data science playlist here:
    • Python Basics For Data...
    Watch statistics and mathematics playlist here :
    • Measures of Central Te...
    Watch End to End Implementation of a simple machine learning model in Python here:
    • How Does Machine Learn...
    Learn Ensemble Model, Bagging and Boosting here:
    • Introduction to Ensemb...
    Build Career in Data Science Playlist:
    • Channel updates - Unfo...
    Artificial Neural Network and Deep Learning Playlist:
    • Intuition behind neura...
    Natural langugae Processing playlist:
    • Natural Language Proce...
    Understanding and building recommendation system:
    • Recommendation System ...
    Access all my codes here:
    drive.google.com/drive/folder...
    Have a different question for me? Ask me here : docs.google.com/forms/d/1ccgl...
    My Music: www.bensound.com/royalty-free...

Комментарии • 186

  • @saikiranreddymekala1346
    @saikiranreddymekala1346 Год назад +5

    In the formula of cov(x,y) the denominator is N-1 . Can you please correct this sir!

    • @UnfoldDataScience
      @UnfoldDataScience  Год назад

      I will pin this on top of the video.

    • @pkavenger9990
      @pkavenger9990 Год назад +10

      actually if you are applying this formula on sample then its N-1 otherwise if you are applying on population then its N.

    • @vaibhavpandey7398
      @vaibhavpandey7398 Год назад +1

      @@pkavenger9990 thanks

  • @prasadgpa6813
    @prasadgpa6813 3 года назад +5

    You nail it in every your videos . You sell the simplified knowledge . Keep it up and may God bless you.. Can't wait for more such videos :)

  • @user-pb6pt4rw1l
    @user-pb6pt4rw1l Год назад

    Amazing video! Such simple explanation, you earned a loyal subscriber today :)

  • @SomnathGupta-gu2rm
    @SomnathGupta-gu2rm 8 месяцев назад +1

    You are a Genius Sir. Thank You So Much for making these Concepts Simple and Lucid. May God Bless You 🙏🙏💐💐

  • @gaytriray7019
    @gaytriray7019 3 года назад +10

    You are amazing at what you do! Your passion and dedication is beyond words! Thankyou so much sir.

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад +1

      Thanks Gayatri for motivating me through your comment.

  • @andresgrtz
    @andresgrtz Год назад

    Thank you! Great teacher!

  • @niii5715
    @niii5715 Год назад

    how calm u r while talking man ,excellent explaination! Hats off 🙇‍♂

  • @karthikaanilkumar
    @karthikaanilkumar Год назад

    IT'S REALLY AN EASY CLASS AND THE WAY OF YOUR PRESENTATION IS GREAT.

  • @siddheshbhalerao3013
    @siddheshbhalerao3013 Год назад

    Thank you sir for clean explanation.

  • @omdodmani3205
    @omdodmani3205 11 месяцев назад

    Loved it 😊! you made it very easy to understand thanks !

  • @ayushagarwal7284
    @ayushagarwal7284 Год назад

    Awesome video sir...Keep shining😀

  • @sexycurse
    @sexycurse 2 года назад

    Amazing explanation brother..Good Job..👍👍👍

  • @ajaykumarsahoo1404
    @ajaykumarsahoo1404 3 года назад +6

    Hey I have a doubt. When you change the value of a y variable from 32 to 48 .. Will it mean remain same that means mean will not change? If change then how can you subtract the previous mean from the new value of y?

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад

      Let me check once.

    • @simplicityandchaos976
      @simplicityandchaos976 2 года назад +4

      Changing the value of Y variable from 32 to 48 changes overall mean of Y and hence it cannot be subtracted from the previous.

  • @UnfoldDataScience
    @UnfoldDataScience  2 года назад

    Access Hindi, English courses here- www.unfolddatascience.com/s/store
    Plz register on the website

  • @yoharihernandez
    @yoharihernandez 2 года назад

    Thank you so much! it was hard for me to understand this concept until I found this video. Please keep doing more videos!

  • @ketakishitut2713
    @ketakishitut2713 Год назад

    Very well explained, thanks

  • @anaskhan4841
    @anaskhan4841 2 года назад

    Thank you Aman❤

  • @kavyanagesh8304
    @kavyanagesh8304 Год назад

    You are the best teacher! Got goosebumps while listening to your lecture. Thank you so much!

    • @UnfoldDataScience
      @UnfoldDataScience  Год назад

      Thanks Kavya. Pls share with friends as well. Keep learning

  • @best_movies3736
    @best_movies3736 2 года назад

    You are simply the best! The MasterBlaster in Data Science

  • @champabanerjee1208
    @champabanerjee1208 2 года назад

    You made it so clear....thank you so much 🙏❤️

  • @animetalks2129
    @animetalks2129 Год назад

    sir can you suggest a tabelau course for me ? i am confused where to learn from ?.

  • @vidyaanvekar
    @vidyaanvekar 2 года назад +1

    Thanks for the wonderful explanation. You made my understanding very concrete.

  • @sameerpandey5561
    @sameerpandey5561 3 года назад +1

    Beautifully explained!!...Thanks for such content

  • @sanjeevkmr5749
    @sanjeevkmr5749 3 года назад

    Amazing explanation. Please keep doing the great work. This channel deserves more!!!
    I have heard that before training a ML model, it is advised to remove highly correlated features, Can you explain why?

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад +1

      Hi Sanjeev, I created a detailed video for this question, Watch it on my channel today 7pm IST. :)

    • @sanjeevkmr5749
      @sanjeevkmr5749 3 года назад

      @@UnfoldDataScience Thanks a lot!

  • @shabiyaahlam3217
    @shabiyaahlam3217 2 года назад

    Helpd me a lot!

  • @soheilaahmadi4807
    @soheilaahmadi4807 2 года назад +1

    Was amazing. Happy to find your tutorials on RUclips.

  • @pokejishnu
    @pokejishnu 3 года назад +1

    Amazing explanation Aman bhai ... Made it sound so simple. Thanks this helps.

  • @adityapatki151
    @adityapatki151 3 года назад +4

    Hey I am data scientist too and really like your content . Can you make a video about how to select (statistically) control group size for marketing campaign?

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад +1

      Thanks Aditya. Let me think through it. Thanks for asking.

  • @financenanchahal8801
    @financenanchahal8801 2 года назад

    Amazing explanation sir.....

  • @_itachi7904
    @_itachi7904 3 года назад

    after so much of head banging finally I understood covariance & correlation....thank you so much...

  • @parol3271
    @parol3271 3 года назад

    Thanks a lot . A very detailed explanation . Great yaar🙏👍👌

  • @sudhakarvasa2688
    @sudhakarvasa2688 Год назад

    When Yi value is changed from 32 to 48 mean of y will also change

  • @k.vvishwanathan7341
    @k.vvishwanathan7341 Год назад

    Pls clear my doubt.covariance is the way two variables move. Whether positive or negative but what is correlation of those Variables. How much have they moved ? Like if covariance is nearby to -1 then the two variables move in the opposite direction?

  • @sadhnarai8757
    @sadhnarai8757 3 года назад +1

    Much needed,thanks :)

  • @ragess4rari100
    @ragess4rari100 2 года назад +1

    Thank you sir. You're a great teacher.

  • @Nannyhere
    @Nannyhere Месяц назад

    Just watched it before exam and in one go ,i understood the concept ❤keep making such videos sir 🥰🥰

  • @jrsolomon5960
    @jrsolomon5960 3 года назад

    Thanks...this is very clear.

  • @SumitSinghXd
    @SumitSinghXd 3 года назад +1

    Great work .. These two terms were alien for me and the online website have complicated it more. Thanks to you I have understood it completely. just a single doubt , I have seen in many websites the denominator for variance is taken N-1 and in your video its N . which one should I go for

  • @emineakpnar6215
    @emineakpnar6215 3 года назад

    it helped me a lot, thank you🙌

  • @user-qd2dp8ru9w
    @user-qd2dp8ru9w Месяц назад

    V good, thanks

  • @kanamarlapudinaresh9934
    @kanamarlapudinaresh9934 3 года назад

    Thanks much.
    In last formula, you mentioned (standard deviation of x) ( standard deviation of y ) in denominator. How we will calculate and can you explain.

  • @nagamuthu4382
    @nagamuthu4382 3 года назад

    i would like to compare the advances level in banks. is covariance useful to compare the gross advances of public and private sector banks for 10 years.

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад +1

      May be you can use advance technique. Its a simple technique.

  • @Storiesbymanas
    @Storiesbymanas 3 года назад

    nicely explained. thank you!

  • @bipintiwari8751
    @bipintiwari8751 2 года назад

    why exponential is calculated can you please explain that as well.

  • @vcalls9146
    @vcalls9146 3 года назад +1

    How the new data is handled after the model is moved to production. Example: During model development the categorical data is converted to 1 and 0 using one hot encoding... When the new data is applied in production how the categorical data or text data is processed..

  • @RamanKumar-ss2ro
    @RamanKumar-ss2ro 3 года назад +1

    Thanks a lot for this topic.

  • @onuragmaji
    @onuragmaji 3 года назад

    Good work bro ur teaching style is very cool, keep posting such great content

  • @binitkumarsingh8409
    @binitkumarsingh8409 2 года назад

    how to find standard deviation of x and y..basically that denominator

    • @UnfoldDataScience
      @UnfoldDataScience  2 года назад

      We no need to compute from scratch, tool will help to do so

  • @sudheerrao9820
    @sudheerrao9820 3 года назад

    Thank you for the video Aman...if four features are positive correlated and four features are negative correlated out of 10 features in dataset...what should we do...I mean which are needs to be dropped and why?

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад

      Good question Sudheer. To keep it short, choose the variables which are highest corelated with your target variable(Either positive or negative).

  • @haidiazaman3326
    @haidiazaman3326 3 года назад

    fantastic channel, def deserves more views

  • @soumyaranjansethi1790
    @soumyaranjansethi1790 3 года назад

    Amazing sir thank you

  • @humanafees8059
    @humanafees8059 3 года назад

    Thank u so much I m.doing msc buisness analytics from Scotland seriously for every questions that hit me I come to ur channel I m.ur new subscriber God bless you

  • @deepseaoflove3683
    @deepseaoflove3683 3 года назад +1

    Sir.. Divide by N Or (n-1) for finding convariance? In some lectures it is showing as by ( n-1)

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад +1

      Does not matter if your sample size is large.
      See these answers:
      math.stackexchange.com/questions/2936143/do-i-use-n-or-n-1-as-the-denominator-for-covariance

    • @johnsubhash
      @johnsubhash 3 года назад

      @@UnfoldDataScience then, will it matter if it’s the case of small data set..?
      What should we take then.??

  • @udayteja6595
    @udayteja6595 Год назад +1

    sir, when you have changed 32 as 48 then the mean also should change . So, it will effect all the variances in numerator , not only the last one.

  • @RishabhRishab
    @RishabhRishab 3 года назад

    What purpose Covariance value / number is serving ? If I say sign of Covariance tells nature of relationship and Covariance value tells strength of relationship and there is no need of correlation .... how is this statement wrong ?

    • @UnfoldDataScience
      @UnfoldDataScience  2 года назад

      Good question, however covariance is base of correlation hence that concept came first.

  • @learningsinlife
    @learningsinlife 2 года назад

    mean will also change if 48 is made new observation

  • @vinayverma9121
    @vinayverma9121 2 года назад

    You are amazing buddy you explained it so simply

    • @UnfoldDataScience
      @UnfoldDataScience  2 года назад

      Thanks Vinay for your positive feedback. Please share with others as well who could be benefited from such content.

  • @usmanzahid3711
    @usmanzahid3711 Год назад

    What exactly co variance is

  • @gangavijayan8906
    @gangavijayan8906 9 месяцев назад

    Why covariance is divided by standard deviation?

  • @rishabhsheoran6959
    @rishabhsheoran6959 3 года назад +1

    Kaafi achhcha samjhaya bhai! Good work!

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад +1

      Thanks Rishabh, thoda share kar dijie apne groups me :)

  • @afn8370
    @afn8370 Год назад

    thanksss

  • @tusharbedse9523
    @tusharbedse9523 2 года назад

    Nicely explained again Aman!

  • @jay_with_the_real6381
    @jay_with_the_real6381 3 года назад +1

    Excellent thanks!

  • @m12gaming81
    @m12gaming81 6 месяцев назад

    sir ur doing great work luv u

  • @bayz4918
    @bayz4918 4 месяца назад

    it is good keep it up

  • @jigyasasoni1812
    @jigyasasoni1812 Год назад

    with in a 1:30 sec.... i can say, you are the best teacher.

  • @salmanjaved2816
    @salmanjaved2816 3 года назад +1

    Thanks bro 👍

  • @sandipansarkar9211
    @sandipansarkar9211 2 года назад

    finished watching

  • @karthickk6587
    @karthickk6587 2 года назад

    Amazing Aman

  • @launchdome3219
    @launchdome3219 3 года назад +1

    very helpful...good job

  • @suhatharsanyoganathapillai2203
    @suhatharsanyoganathapillai2203 2 года назад +1

    Thank you

  • @omkarnarayankar5275
    @omkarnarayankar5275 2 года назад

    Best explanation

  • @jananiravinag
    @jananiravinag 3 года назад +1

    Great resource!

  • @chikoo8486
    @chikoo8486 3 года назад

    The unit you are talking about in covariance is +ve and -ve ??

  • @mosama22
    @mosama22 2 года назад +1

    Thank you Amen :-) :-)

  • @shubhamkumarjain1329
    @shubhamkumarjain1329 3 года назад

    Thank you so much... It helped a lot... But i want to know why it ranges between 1 and - 1 and not above that...

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад

      Welcome Shubham, its because of internal mathematical formula.

  • @kiranpol1601
    @kiranpol1601 3 года назад +1

    wow... What a explanation

  • @rediscovermath
    @rediscovermath Месяц назад

    When you changed 32 to 48, mean will also change.

  • @yogitabasnal
    @yogitabasnal 2 года назад

    Great explanation

  • @lakshaykhandelwal4284
    @lakshaykhandelwal4284 3 года назад

    very well explained.. 👍

  • @megabuzu2726
    @megabuzu2726 3 года назад +1

    Perfect explanation

  • @rohitbhosale4614
    @rohitbhosale4614 3 года назад

    You are a champ!

  • @arihantchoudhary
    @arihantchoudhary 2 года назад

    ❤❤❤

  • @priyamkataria573
    @priyamkataria573 2 года назад

    Great content 🔥🔥

  • @balajinatarajan1051
    @balajinatarajan1051 2 года назад +1

    Excellent video

  • @leilyb5224
    @leilyb5224 2 года назад +1

    Perfect 👍👍👍👍

  • @saurabsen3686
    @saurabsen3686 3 года назад

    Great presentation that is simple and to the point. However could not fully grasp when calculating correlation, dividing cov(x,y) by sd x multiplied by sd y yields a value between -1 and 1. Why so? Kindly revert.

    • @UnfoldDataScience
      @UnfoldDataScience  2 года назад

      The explanation will be little more mathematical, please see the discussion here Saurabh:
      math.stackexchange.com/questions/564751/how-can-i-simply-prove-that-the-pearson-correlation-coefficient-is-between-1-an

  • @madhurakhaire6583
    @madhurakhaire6583 2 года назад

    amazing explination

  • @Manya2017
    @Manya2017 Год назад

    Can you share the link of the data science group so that I can also join

  • @sandipansarkar9211
    @sandipansarkar9211 3 года назад

    great explanation

  • @D.H.Bangalore
    @D.H.Bangalore 2 года назад

    Looks like while increasing the value of y you forgot to increase the mean of y

  • @humanafees8059
    @humanafees8059 3 года назад

    One suggestion can u please make a video for detailed probability for beginners please it a request

  • @shoooooooooorts8002
    @shoooooooooorts8002 2 года назад

    Please make video on statistics for data science A-Z

  • @ashokmeena9631
    @ashokmeena9631 3 года назад +1

    Sir what about virtual Interview

    • @UnfoldDataScience
      @UnfoldDataScience  3 года назад

      Fill the form in the previous video. I will. Share invite.

  • @dr.sagarfirke2687
    @dr.sagarfirke2687 3 года назад

    Eexcelent

  • @anilkumarsharma8901
    @anilkumarsharma8901 Год назад

    Super computer ki power windows interface main karwa do phir Duniya following karegi Vedic math par reserch karwavo

  • @anilkumarsharma8901
    @anilkumarsharma8901 Год назад

    More computer power means more success

  • @vaibhavpandey7398
    @vaibhavpandey7398 Год назад

    I want to ask, ye t shirt sale pe ayi thi.. Wahi se li thi na 🤣🤣🤣🤣 joking. Liked ur video

  • @sanyamsingh4907
    @sanyamsingh4907 3 года назад

    kids learn from udemy
    legends learn from unfold data science

  • @khanraiyan123
    @khanraiyan123 2 года назад

    denominator should be N-1