K Means Clustering Algorithm Example in Python - V1

Поделиться
HTML-код
  • Опубликовано: 13 дек 2024

Комментарии • 59

  • @YiannisPi
    @YiannisPi  4 года назад +15

    Hey Everyone! Let me know if you like this video in the comments below! Thanks!

  • @geofflyons3438
    @geofflyons3438 4 года назад +8

    Wow! I love how you explain every line of code so clearly. You are a great teacher. Thank you so much.

    • @YiannisPi
      @YiannisPi  4 года назад

      Glad it was helpful!

  • @Nick-gs4em
    @Nick-gs4em 8 месяцев назад

    3 minutes in I was so excited and confident in my ability to learn this. What a boss teacher!

  • @Asylum_M
    @Asylum_M 4 года назад +3

    Thank you for this great video! I really wish you could make a follow-up in-depth video about various clustering algorithms, going in detail about ways to deal with large number of dimensions (PCA often makes data uninterpretable), categorical features and how to correctly select dimensions to project your clusters on.

  • @gmanfromdibloc2479
    @gmanfromdibloc2479 3 года назад

    The best Data science channel. You really are good at this man!

  • @scottgilsdorf2938
    @scottgilsdorf2938 4 года назад +3

    Again thank you for a very concise lesson! I appreciate the time you put into everything that goes into your lessons! A lot of good take always!

    • @YiannisPi
      @YiannisPi  4 года назад

      Thanks Scott! Glad you liked it!

  • @andreasp.189
    @andreasp.189 4 года назад

    Excellent tutorial Yiannis - attention to detail and explaining step by step in a brief and concise manner! Keep ur videos coming and all the very best! U deserve it!!!

    • @YiannisPi
      @YiannisPi  4 года назад

      Glad it was helpful!

  • @Sfisoul
    @Sfisoul 3 года назад +1

    Thank you so much for this example. I like how concise you are with your explanations. Definitely top 3 data science channels for me now! Subscribed!

  • @fajnefajne4339
    @fajnefajne4339 2 месяца назад

    HI Yiannis, Than kyou so much for that series. Tha is great. I love to listen your tutorials. They are very practical and informative. :-)

  • @elainee7116
    @elainee7116 3 года назад

    This Kmeans series is great. Have subscribed for future learnings. Thanks, and please make more content about data science.

  • @tomjohnas134
    @tomjohnas134 4 года назад +1

    Great content Yiannis! Keep them coming!

  • @atb0007
    @atb0007 2 года назад

    holy shit! your channels really underrated! i hope you blow up

  • @trololollolololololl
    @trololollolololololl 4 года назад +1

    Very good video, good that u add supplementory videos from such great ytbers like statquest. It shows how u really like us to know things, cant wait for future videos

  • @sourabhsoni8771
    @sourabhsoni8771 4 года назад +1

    your content is amazing sir with end to end including theory & visualize concept

    • @YiannisPi
      @YiannisPi  4 года назад

      Thanks man! This is what the community requested! So I deliver :)

    • @YiannisPi
      @YiannisPi  4 года назад

      @wise guy Nop, I am not an "institution" so I cannot offer that. That's why the course is really cheap; exactly as much as I need for the monthly payment to keep the page running. The course is geared towards learning how to handle data, clean / transform / join / visualize and generate insights. It's all about the skills! :)

  • @sudippandit7051
    @sudippandit7051 7 месяцев назад

    This is great session!

  • @camtu4010
    @camtu4010 2 года назад

    Thank you for this video with explaining really ez to understand and your code is so clear. Hope you will make more content in like this

  • @Belakavadi
    @Belakavadi Год назад

    awesome explanation

  • @victor.c.nwachukwu9076
    @victor.c.nwachukwu9076 2 года назад

    You made this look easy. Thank you

  • @Emy-ib1be
    @Emy-ib1be Год назад

    Hi! Please, when transforming string data to numerical, does the method or strategy used affect or have an impact on the algorithm results? For example if I decide to replace months by 1,2,3,..,12 or if I use the get dummies function.

  • @khadijaalam4336
    @khadijaalam4336 3 года назад +1

    Great work! I love your way of explaining everything and then give the reasoning as well! Can you please explain how to plot the clusters of k-mean? Thanks a lot in advance! :)

    • @YiannisPi
      @YiannisPi  3 года назад +1

      I explain plotting in part 2 of this series. Check it out!

    • @khadijaalam4336
      @khadijaalam4336 3 года назад

      @@YiannisPi Yes I checked that out! Thank you once again! :)

  • @trailbleza
    @trailbleza 3 года назад

    Excellent Video! You are a life saver.

  • @KziziGhizlanita-fj9vq
    @KziziGhizlanita-fj9vq Год назад

    God job can you help me to get a code example of topological data analysis and clustering using Persistent Homology. Thanks.

  • @Magix4D
    @Magix4D 3 года назад

    Thank a lot Yiannis. You applied a pd.get_dummies because all your features were characters. What if you have a mix of numerical and characters in the original data set? Example: country, client, product, sales, market share ? can i face a bug if my "product" feature contains 100000+ distinct values? thanks so much !!!

  • @alexuche9304
    @alexuche9304 Год назад

    hello pls i have a question, can i get the link to the dataset origin

  • @AbrarMuhtasim
    @AbrarMuhtasim 2 года назад

    make a video on ''customer segmentation and clustering in retail using machine learning'' using real retail dataset

  • @siddeghamid8587
    @siddeghamid8587 3 года назад

    Amazing video. Thumbs up

  • @hsumin3302
    @hsumin3302 2 года назад

    Thanks for sharing.

  • @aminedahane5874
    @aminedahane5874 4 года назад

    Good Job Sir

  • @farhanakbar8425
    @farhanakbar8425 3 года назад

    thank you this is really helpful !

    • @YiannisPi
      @YiannisPi  3 года назад

      Glad it was helpful!

  • @razzi50
    @razzi50 4 года назад +2

    Can we have your Github to enable us walk through your codes

    • @YiannisPi
      @YiannisPi  4 года назад

      Sure! it's in the desc!

  • @MrNabiwishes
    @MrNabiwishes 4 года назад

    just wanted to ask about , you havent done scaling . Why ?

    • @YiannisPi
      @YiannisPi  4 года назад +1

      Hey Nabi. Scaling comes into place when you have big scales and you want to normalize the data within a particular range; sometimes it helps in speed of calculations and outcome. In our case, we only have zeros and ones so there is no need for scaling. Hope this helps!

    • @MrNabiwishes
      @MrNabiwishes 4 года назад

      @@YiannisPi yes thanks for the answere , one more scenario if you you have only binary data in such case without scaling will work but if .if in case you have a data set which has a numerical and binary column . do we need scaling only for numerical column or scaling should be for both data type . One more question attached to this . selection criteria of variable for clustering should have what variance value for the better clustering .....

    • @YiannisPi
      @YiannisPi  4 года назад

      @@MrNabiwishes If you have both then, you only (might need to) scale the numeric. But it depends, your numbers may not need scaling the first place. Not sure what you mean from the second question :)

    • @MrNabiwishes
      @MrNabiwishes 4 года назад

      @@YiannisPi how to select variables which would be significant for clustering .depending on their data type . what strategy you use for variables selection before clustering is applied.

    • @YiannisPi
      @YiannisPi  4 года назад

      @@MrNabiwishes Watch all the series on K means to see how I use PCA before clustering! That is one of the many techniques!

  • @mariatereza4348
    @mariatereza4348 2 года назад

    👏👏👏👏👏👏👏👏👏