The Gini Impurity Index explained in 8 minutes!

Поделиться
HTML-код
  • Опубликовано: 1 окт 2024

Комментарии • 102

  • @Leonardo-jv1ls
    @Leonardo-jv1ls 3 года назад +60

    This is the exact meaning of "The simplest and best explanation".

  • @ssshukla26
    @ssshukla26 3 года назад +18

    Wow... It was this simple... Certainly I didn't learn this simply enough to understand at my uni... Not my prof's fault btw...

  • @hansenmarc
    @hansenmarc 2 года назад +19

    Best explanation of Gini impurity I’ve ever seen. Thank you!

  • @RogerVandervort
    @RogerVandervort 3 года назад +13

    This explanation is, by far, one of the most simple and direct. It drives an intuitive understanding of the calculation.

  • @TheDavidlloydjones
    @TheDavidlloydjones 3 года назад +4

    Bug Report: Audio vs. video glitch at 0:57~1:01.
    Spoken "on the right it's gonna be 0.47."
    Video shows 0.7.

  • @marcinstrzesak346
    @marcinstrzesak346 Год назад +2

    Very good explanation. Thank you

  • @kishorab
    @kishorab 2 года назад +1

    Is Gini index being calculated with replacement. Blue,red,green,yellow squares consist of items being paired with themselves. If an item is picked it can only be paired with itself by replacing it back.

  • @michelcusteau3184
    @michelcusteau3184 Год назад +1

    You mention that Gini Impurity is going to give values between the range of 0 - 1, However from other sources it says that the Gini Impurity only going to output values between the range of 0 - 0.5 . Is this a mistake in the video?

  • @usmanriaz94
    @usmanriaz94 2 года назад +1

    I thought maximum value of gini index is .5. i am confused. can somebody help ?

  • @reverse_engineered
    @reverse_engineered 3 года назад +5

    What a great and simple way to explain it! I love these visual demonstrations.

  • @bryanbischof4351
    @bryanbischof4351 3 года назад +4

    Great visualizations and explanations.

  • @supersql8406
    @supersql8406 3 года назад +3

    The best gini index explanation!!

  • @JaviOrman
    @JaviOrman 3 года назад +4

    What an intuitive explanation!

  • @ahmadawad4782
    @ahmadawad4782 3 года назад +1

    It seems that the link is wrong. Gives error 404, page not found.

    • @SerranoAcademy
      @SerranoAcademy  3 года назад +1

      Thank you Ahmad! Fixed

    • @ahmadawad4782
      @ahmadawad4782 3 года назад +1

      @@SerranoAcademy Thanks. Just purchased an ebook copy. Can't wait to read through.

    • @SerranoAcademy
      @SerranoAcademy  3 года назад +2

      @@ahmadawad4782 so glad to hear, thank you! I hope you like it! :)

  • @srisrinivas9873
    @srisrinivas9873 3 года назад +2

    Very intuitive and easy to grasp. Thanks for your effort Luis Serrano.

  • @ritikchopra4429
    @ritikchopra4429 3 года назад +1

    Hey, great explanation but I have a doubt, why are we allowed to pick the same element twice?

  • @denisr5250
    @denisr5250 4 месяца назад

    This was an awesome explanation! One small question (maybe correction?) - at around 7:20 shouldn't the Gini index of the diverse set be 1 - (0 + 0 + ... +0) since the probability of getting the same element twice is 0 - there are 10 unique elements i.e. no duplicates, so it's impossible to pick two of the same item.

  • @angjelinhila927
    @angjelinhila927 Год назад

    Je suis confus. Isn't max gini standardized to 0.5? In other words 1 - (0.5^2 + 0.5^2) = 0.5?

  • @oatmilk9545
    @oatmilk9545 Год назад

    I don't get the last example with 10 different classes. in this case, we're never going to have a pair of equal elements (which you started your video with); and in the square where we seek for intersections of two classes, we'll have just an empty cell for each pair of elements from the same class because, again, their pairing isn't possible

  • @marekglowacki2607
    @marekglowacki2607 3 года назад +1

    Great explanation! Could you make an video on Gini Impurity Index vs Gini Coefficient?

  • @VidyaBhandary
    @VidyaBhandary 3 года назад +1

    Awesome explanation ! Thank you for this.

  • @zukofire6424
    @zukofire6424 Год назад

    Thanks Pr. Serrano for this! It helps prepare for my exam! :)

  • @nijat6704
    @nijat6704 2 года назад

    As I know the Gini index ranges between 0 and 0.5. So the answer that you found seems wrong

  • @srinivasachary7392
    @srinivasachary7392 3 года назад

    Wow... Great. Superb Explanation

  • @skamal4u
    @skamal4u 2 года назад +1

    one of the best explanation ever . so simple and easy to follow 👏👏👏

  • @fabio336ful
    @fabio336ful Год назад

    Th explanation I was looking for!

  • @shaporovanatalia6805
    @shaporovanatalia6805 Год назад

    Best explanation ! Thank you!

  • @zaidkidwai7831
    @zaidkidwai7831 2 года назад

    Very well explained, thank you

  • @arshadkazi4559
    @arshadkazi4559 2 года назад

    Amazing, just what I wanted!

  • @TheSoonAnn
    @TheSoonAnn Год назад

    thanks for explanation, concise and clear

  • @grantsmith3653
    @grantsmith3653 2 месяца назад

    Perfect! Thank you!!

  • @VivianSam-l6i
    @VivianSam-l6i 10 месяцев назад

    Great explanation, able to understand in one go!!

  • @vaibhavmishra232
    @vaibhavmishra232 Год назад

    very geniously explained

  • @forpublicstuff728
    @forpublicstuff728 2 года назад

    Awesome! Thank you.

  • @celismaroliveira6081
    @celismaroliveira6081 4 месяца назад

    That is the best explanation of Gini impurity I’ve ever seen!
    Even 8-year-old children can get it. Amazing!
    Congrats Luis Serrano/Serrano Academy!!

  • @wanderbeautyE
    @wanderbeautyE Год назад

    Thank you for your explanation!!! I finally understood what GINI impurity index means!! :D

  • @stephennjuki4206
    @stephennjuki4206 Год назад

    thanks. very succinct.

  • @prackertracker7189
    @prackertracker7189 3 года назад

    2:33 here you say gini is the propability of picking two distinct data points of a data set. At the end you present a totally diverse data set and say the gini index is 0.9. How is that possible since the propability of picking two totally different data points i 100% because we only have distinct and none data points that are the same?

    • @loftyTHEOWNER
      @loftyTHEOWNER 2 года назад +1

      I understood there is no sampling. Just a matrix of all the observations. So for 10 different objects, we have a matrix 10x10 and the elements on the diagonals are equal of course, so you d0 (100 - 10) / 100 = 0.9

  • @fadhlallahbaklouti9111
    @fadhlallahbaklouti9111 2 месяца назад

    Love the explanation

  • @刘鹏宇-k7f
    @刘鹏宇-k7f 2 года назад

    Thank you very much.

  • @noelthomasbejoy3089
    @noelthomasbejoy3089 3 года назад

    if theres only oneof them ,how can 1/10 ^2 exist.Since it cant be selected twice?

  • @eric_bonucci_data
    @eric_bonucci_data Год назад

    This definition of the Gini index is different from the one in Introduction to Statistical Learning with R (Equation 8.6 p.335), could you please elaborate on that ? Thank you

    • @eric_bonucci_data
      @eric_bonucci_data Год назад +1

      I just figured it out : the sum of the proportion of training observations over all classes is equal to 1, so sum(pk(1-pk)) = 1 - sum(pk^2)

    • @eric_bonucci_data
      @eric_bonucci_data Год назад +1

      Knowing that other definition from ISLR also helps to understand why the Gini index can be seen as the probability of sampling two observations of different class in the dataset.

  • @ger9551
    @ger9551 2 года назад

    what happens if all gini index is 0?

  • @karmabender
    @karmabender 3 года назад

    Awesome explanation. This is part of Decision Tree algorithm but you are not making any video on Decision Tree. How Decision tree algo makes nodes and condition itself without applying our own if else statement? Clear explanation on internal working of Decision Tree is not available on youtube that how it works from scratch only using python without using any library like sklearn.

  • @ian-haggerty
    @ian-haggerty 4 месяца назад

    Awesome! You've sold another book :)

    • @SerranoAcademy
      @SerranoAcademy  4 месяца назад

      Yay thanks! Enjoy, and lemme know what you think!

  • @thegreatdream8427
    @thegreatdream8427 3 года назад

    This is basically a measure of average distance between pairs of points in a space. In this case all the points are vertices of a regular unit simplex, so if two elements are the same they're the same point, and if different their distance is 1. If instead you have degrees of difference - distances in type-of-thing-space - the simple formula using squares would stop working, but it would fit the real world better. :)

  • @736939
    @736939 11 месяцев назад

    The real scientist can explain everything by the simple terms. You're the real scientist and thank you very much, unfortunately there are not so many scientist (especially physicians) who are able to use simple language.

  • @shubha07m
    @shubha07m Год назад

    THE Best explanation of Gini index ever, YOU are awesome!

  • @thanhtung24
    @thanhtung24 Год назад

    Best explanation

  • @milenkoobradovic2896
    @milenkoobradovic2896 2 года назад

    👏👏👏

  • @laviusdev3763
    @laviusdev3763 2 года назад

  • @johannahultgren2887
    @johannahultgren2887 Год назад

    Wow this was so good explained!😍 i'm an AI and neuroscience student and your videos are helping me out a lot!🙏

  • @mshirazbaig6055
    @mshirazbaig6055 2 года назад

    Good explanation

  • @MritunjayKumar-ck4hx
    @MritunjayKumar-ck4hx 3 года назад

    amazing

  • @developerboy8341
    @developerboy8341 3 года назад

    Probably best I got the best intuition of Gini index from it, can't thank you enough Man.

  • @chanduiit42
    @chanduiit42 3 года назад

    The best..the best one

  • @camzbeats6993
    @camzbeats6993 3 месяца назад

    Top

  • @tooniatoonia2830
    @tooniatoonia2830 2 года назад

    This man called Luis is a genius, I take Udacity course because of you!

  • @tourdesource
    @tourdesource 2 дня назад

    Shouldn't we eliminate the diagonal? It doesn't make sense to pick the same element twice.

    • @SerranoAcademy
      @SerranoAcademy  День назад +1

      @@tourdesource good point! I thought the same thing, since it makes sense to not take the diagonal, but for some reason they defined it that way. Removing the diagonal, the formula changes from
      1-p_1^2-…-p_k^2
      to
      1-1/n - [p_1^2-…-p_k^2]*(n-1)/n
      So at the end, it will give the same decision tree.

    • @tourdesource
      @tourdesource День назад

      Got it. Simpler formula, same result. Thanks a lot for answering!

  • @xxelurraxx232
    @xxelurraxx232 Год назад

    This was a fantastic explanation of the formula! The visuals helped a ton. Thank you so much!

  • @shubhamtalks9718
    @shubhamtalks9718 2 года назад

    Amazing!!!

  • @abail7010
    @abail7010 7 месяцев назад

    This is such a good and intuitive explanation. Well done and thank you!!

  • @jerrerock
    @jerrerock Год назад

    Thanks.

  • @abdelrhmanrhyaseen6194
    @abdelrhmanrhyaseen6194 2 года назад

    Amazing

  • @apristen
    @apristen 3 года назад

    thanks for great easy to understand explanation!!!

  • @alokranjancs
    @alokranjancs 2 года назад

    I rarely put comments on youtube but this is such a nice explanation of the concept. Thank you

  • @flaviospadavecchia5126
    @flaviospadavecchia5126 3 года назад

    Thank you, Luis! I'm enjoying your book very much :)

  • @joragondafacultyeeedept309
    @joragondafacultyeeedept309 Год назад

    Great Serrano. Best of the presentations I have come across. You are a great teacher. Kudos

  • @scooby95219
    @scooby95219 3 года назад

    very good explanation. thank you!

  • @abdulkarim.jamal.kanaan
    @abdulkarim.jamal.kanaan 3 года назад

    this is the best explanation; I hope the book is as easy to understand as this one :)

  • @kabilakamal8269
    @kabilakamal8269 3 года назад

    Well explained 👍
    Another precise detailed video like that of “Matrix Factorization” 😂
    Please can I have your contact email. I’d like to reach you personally. Thank you

    • @SerranoAcademy
      @SerranoAcademy  3 года назад

      Thank you Kabila! Absolutely, the best way to get in touch is through here serrano.academy/contact/

  • @mattcobras
    @mattcobras 2 года назад

    that's awesome, I like your lesson.

  • @alexbuchko323
    @alexbuchko323 Год назад

    This is an amazing explanation, I didn't know it was that simple!

  • @shashanktripathi3034
    @shashanktripathi3034 3 года назад

    This Really helped Great Work
    Thanks

  • @mohamedgaal5340
    @mohamedgaal5340 3 года назад

    Thanks for this concise explanation!

  • @Q793148210
    @Q793148210 2 года назад

    This is the best gini index video by far ! thankyou

  • @MariaFlorenciay
    @MariaFlorenciay 2 года назад

    Very well explained!

  • @abeferszt2408
    @abeferszt2408 8 месяцев назад

    One of the best explanations I've seen

  • @alphonseinbaraj7602
    @alphonseinbaraj7602 3 года назад

    Wonderful... Great explanation

  • @alioraqsa
    @alioraqsa Год назад

    best explanation i've seen so far

  • @kirtichandrakomarraju5164
    @kirtichandrakomarraju5164 2 года назад

    Explained like a King !!

  • @emrahyener402
    @emrahyener402 3 года назад

    You are great! keep going please!

  • @murilopalomosebilla2999
    @murilopalomosebilla2999 3 года назад

    Really well explained! Thanks!!

  • @alexvass
    @alexvass 9 месяцев назад

    Thanks

    • @SerranoAcademy
      @SerranoAcademy  9 месяцев назад

      @alexvass thank you so much for your kindness!!

  • @vaggelisntaloukas2016
    @vaggelisntaloukas2016 Год назад

    Thanks!

    • @SerranoAcademy
      @SerranoAcademy  Год назад

      Thank you so much for your kind contribution Vaggelis! 😊

  • @siddarthbali12
    @siddarthbali12 3 года назад

    Great explanation

  • @bhaveshvoswal
    @bhaveshvoswal 3 года назад

    Nice