Project 13. Customer Segmentation using K-Means Clustering with Python | Machine Learning Projects

Поделиться
HTML-код
  • Опубликовано: 10 сен 2024
  • Hi! I will be conducting one-on-one discussion with all channel members. Checkout the perks and Join membership if interested: / @siddhardhan Check membership Perks: / @siddhardhan
    . This video is about Customer Segmentation using K-Means Clustering. This is an important example of Market Basket Analysis in Machine Learning and Data Science.
    All presentation files for the Machine Learning course as PDF for as low as ₹200 (INR): Drop a mail to siddhardhans2317@gmail.com
    Enroll at One Neuron to learn from 100 courses in one subscription with 5% discount: courses.ineuro...
    Machine Learning Projects Playlist: • Machine Learning Projects
    Machine Learning Course with Python Playlist: • Machine Learning Cours...
    Hello everyone! I am setting up a donation campaign for my RUclips Channel. If you like my videos and wish to support me financially, you can donate through the following means:
    From India 👉 UPI ID : siddhardhselvam2317@oksbi
    Outside of India? 👉 Paypal id: siddhardhselvam2317@gmail.com
    (No donation is small. Every penny counts)
    Thanks in advance!
    Let's build a Community of Machine Learning experts! Kindly Subscribe here👉 tinyurl.com/md...
    I am making a "Hands-on Machine Learning Course with Python" in RUclips. I'll be posting 3 videos per week: Monday Evening; Wednesday Evening; Friday Evening.
    Dataset file: www.kaggle.com...
    Colab File Link: colab.research...
    Download the Course Curriculum File from here: drive.google.c...
    LinkedIn: / siddhardhan-s-741652207
    Telegram Group: t.me/siddhardhan
    Facebook group: www.facebook.c... Instagram: / siddhardhan23

Комментарии • 126

  • @MrKapilsingh
    @MrKapilsingh 3 года назад +5

    I watched many videos but none of them has better explanation than yours. Great job!!

    • @Siddhardhan
      @Siddhardhan  3 года назад +2

      Thanks a ton, brother 😇

  • @mat9095
    @mat9095 4 месяца назад +1

    Awesome video 💯💯...understood each and every line you said....just wonderful.....keep going...all the best 👍

  • @khushiprasad
    @khushiprasad Месяц назад +1

    Thanks for your amazing content❤🎉.

  • @gramps4017
    @gramps4017 2 года назад +3

    Massive respect mate!!! Thanks for sharing your knowledge

  • @kaiiiser7
    @kaiiiser7 3 года назад +1

    You are an absolute savior

  • @heitorandrade6387
    @heitorandrade6387 2 года назад +4

    Really good ! If you could provide in future an example of segmentation for a B2B business , it would be great. Thanks !

  • @manepallisaisanthosh3728
    @manepallisaisanthosh3728 3 года назад +1

    bro your videos are helping very much pls continue bro..and attach handwritten material it will help a lot

    • @Siddhardhan
      @Siddhardhan  3 года назад +2

      Hi! Thanks for your positive words 😇 I'll see what I can do about the notes

  • @user-wg8vf8pj2s
    @user-wg8vf8pj2s 9 месяцев назад

    Sooo helpful.
    You gave explanation precisely with points.
    Thank you so much 👍🏻

  • @onaopemipoAmobi
    @onaopemipoAmobi Год назад

    You just solved a problem. Thank you for sharing this.

  • @GoodLuckForever-wi9kb
    @GoodLuckForever-wi9kb 9 месяцев назад

    Thanks
    Like Your method to teach basics alongwith advance topics.
    Appreciate

  • @santoshr1708
    @santoshr1708 Год назад +1

    Thank you sir. You are doing great work.

  • @priyankathakur1691
    @priyankathakur1691 Год назад

    Thank u for making such good videos. PLease keep making more videos on different projects of other unsupervised learning models.

  • @farazrizvi416
    @farazrizvi416 2 года назад +6

    Love your videos man, you have a great way of explaining your code and the concepts.
    I wanted to ask 2 questions; firstly, does it make sense to add the array of clusters back to the original dataframe so you can identify which customers the clusters represent? It would make graphing the clusters easier and also allow you to optimize hyperparameters without needing to retype the code.
    Secondly, can you do videos on deploying models? I couldn't find any good videos of that. - Thanks a lot!

  • @samirpatel7892
    @samirpatel7892 Год назад

    easy understanding about the k mean clustering

  • @fatmamansour8606
    @fatmamansour8606 2 года назад

    excellent video and explanation too. Thank you so much and definitely will watch the rest of the videos.

  • @victorialeigh2726
    @victorialeigh2726 2 года назад +2

    Hi Siddhardhan
    It's a really really nice lecture. Very considerate and detailed, explaining everything the learners need / want to know!
    Happy new year to you!

    • @Siddhardhan
      @Siddhardhan  2 года назад

      Thanks a ton! Happy New year!

  • @rohinihrudhay5600
    @rohinihrudhay5600 2 месяца назад

    Thank you so much man you really helped me a lot.

  • @ginnaamulya461
    @ginnaamulya461 2 года назад +1

    Clear explanation 👍

  • @PriyanshGupta2024
    @PriyanshGupta2024 Год назад

    Awesome tutorial! Amazing and clear explanation.

  • @csebreach8565
    @csebreach8565 3 года назад +8

    Can i use this in my resume and linked in profile .. If i use what are the hashtags and conditions to be followed..By the way you are supporting pillar to all upcomers for candidates like me.. thank you

    • @Siddhardhan
      @Siddhardhan  3 года назад +24

      hi! i don't demand any citation and credit. Just understand the project completely before putting it in your resume, so that you can answer the questions related to this project asked by the interviewer.

    • @csebreach8565
      @csebreach8565 3 года назад +3

      @@Siddhardhan thank you so much

  • @adityamahamuni7365
    @adityamahamuni7365 3 года назад +1

    This one was superbly explained, especially the difficult to understand syntax of .iloc[ ] and one in the scatter plot too. Also I'm a huge fan of Data Visualisation, absolutely love it.
    I had a question: how can we plot a similar scatterplot using Seaborn? Could you please show it in one of the future Unsupervised Learning project videos?
    As always, thanks a lot!🙌🏻

    • @Siddhardhan
      @Siddhardhan  3 года назад

      sure! will do that. use sns.scatterplot( )

    • @adityamahamuni7365
      @adityamahamuni7365 3 года назад

      @@Siddhardhan are all the arguments the same?

  • @raveenachakrapani1090
    @raveenachakrapani1090 3 года назад

    Highly helpful. Awesome explanation !!

  • @alfattahsuyadi3618
    @alfattahsuyadi3618 Год назад

    Thank you very much

  • @anandtalware2283
    @anandtalware2283 Год назад

    I am confused, what if we have more than 2 features/columns,, then while plotting clusters,, then x[y_pred=0, 0] is for 1st cluster and 1st colum on x-axis,, then what about more than 2 columns while plots? I hope you would reply this comment and solve my confusion😢😢😢

  • @copaceanubobi6101
    @copaceanubobi6101 3 года назад

    i hope you make a project for image segmentation using k-means clustering algorithm .Thank you for all the projects done

  • @gujaratfortunegiants8263
    @gujaratfortunegiants8263 7 месяцев назад

    I have a doubt... here for plotting the clusters we used 2 data in x array... but what if i have more than 2 features? plz suggest

  • @gaddipatisrikanth1177
    @gaddipatisrikanth1177 2 года назад

    sir i got an error while plotting elbow graph
    it was showing that X and Y should be in same dimension!!
    pl suggest what to do!!!

  • @KiddyKidz167
    @KiddyKidz167 2 года назад

    So nice explanation

  • @newmail56
    @newmail56 2 года назад

    Sir i can't understand where in the entire code the centroid are being calculated before finding out the WCSS from the data set ?? please explain

  • @growingfire
    @growingfire 3 месяца назад

    Thanks a lot !

  • @dineshjothiram3677
    @dineshjothiram3677 3 года назад

    Your work was awesome and really appreciated sir.

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      Thanks a ton😇

    • @dineshjothiram3677
      @dineshjothiram3677 3 года назад

      @@Siddhardhan sir in this project I can't understand KMeans syntax sir. (Video is blur sir)

    • @dineshjothiram3677
      @dineshjothiram3677 3 года назад

      @@Siddhardhan sir in this project I can't understand KMeans syntax sir. (Video is blur sir)

  • @awantha2404
    @awantha2404 3 года назад +1

    Hi! Thanks for the content. Really appreciate your work.
    Can you do a video on ML project personality Prediction system by analyzing CVs?
    It would be really helpful.

  • @Vartwol
    @Vartwol 10 месяцев назад

    Thanks . This one really helped
    Only one doubt ,how canwe display the clustered dataitems in tabular form instead of scatter plot?

  • @sandipansarkar9211
    @sandipansarkar9211 2 года назад

    finished watching

  • @achyuthkayala279
    @achyuthkayala279 Месяц назад

    Bro what are the pros and cons that made you to do this project

  • @rahulgaud4340
    @rahulgaud4340 Год назад

    Thanks a lot.

  • @manojchoukiker3232
    @manojchoukiker3232 3 года назад

    your lecture series very good for understating and nicely explain, i request to you please increase the font size or increase the screen size. t very small font so its difficult to read. thanks sir . please continue....

  • @gkakash4789
    @gkakash4789 3 года назад

    It's really helpful to me...

  • @agnivamitra8775
    @agnivamitra8775 2 года назад

    Sir if we had more than two featured columns in k means clustering then how will we plot the clusters in multi dimension like 4d 5d?

  • @Yuvakishore29
    @Yuvakishore29 2 месяца назад

    Sir can we keep these projects in our resume

  • @hamzaarshad8952
    @hamzaarshad8952 Год назад

    plt.scatter(clustering_data[y_pred==0,0],clustering_data[y_pred==0,1],s=50, c='green',label="cluster 1")
    plt.scatter(clustering_data[y_pred==1,0],clustering_data[y_pred==1,1],s=50, c='red',label="cluster 2")
    plt.scatter(clustering_data[y_pred==2,0],clustering_data[y_pred==2,1],s=50, c='blue',label="cluster 3")
    this code is showing an error sir. can you fix it?
    this is the error:
    TypeError Traceback (most recent call last)
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
    3628 try:
    -> 3629 return self._engine.get_loc(casted_key)
    3630 except KeyError as err:
    ~\anaconda3\lib\site-packages\pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
    ~\anaconda3\lib\site-packages\pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
    TypeError: '(array([False, False, False, ..., False, True, False]), 0)' is an invalid key
    During handling of the above exception, another exception occurred:
    InvalidIndexError Traceback (most recent call last)
    ~\AppData\Local\Temp\ipykernel_17080\3756939222.py in
    1 ## plotting all the clusters and their centriods:
    2 plt.figure(figsize=(12,6))
    ----> 3 plt.scatter(X[y==0,0],X[y==0,1],s=50, c='green',label="cluster 1")
    4 plt.scatter(X[y==1,0],X[y==1,1],s=50, c='red',label="cluster 2")
    5 plt.scatter(X[y==2,0],X[y==2,1],s=50, c='blue',label="cluster 3")
    ~\anaconda3\lib\site-packages\pandas\core\frame.py in __getitem__(self, key)
    3503 if self.columns.nlevels > 1:
    3504 return self._getitem_multilevel(key)
    -> 3505 indexer = self.columns.get_loc(key)
    3506 if is_integer(indexer):
    3507 indexer = [indexer]
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
    3634 # InvalidIndexError. Otherwise we fall through and re-raise
    3635 # the TypeError.
    -> 3636 self._check_indexing_error(key)
    3637 raise
    3638
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in _check_indexing_error(self, key)
    5649 # if key is not a scalar, directly raise an error (the code below
    5650 # would convert to numpy arrays and raise later any way) - GH29926
    -> 5651 raise InvalidIndexError(key)
    5652
    5653 @cache_readonly
    InvalidIndexError: (array([False, False, False, ..., False, True, False]), 0)

  • @54-nileshparab96
    @54-nileshparab96 Год назад

    Do i need to scale independent variables before clustering?

  • @user-gr7wj2kn5e
    @user-gr7wj2kn5e Год назад

    Hello Siddhardhan, How many dataset r u taking for Training and Testing?

  • @adarshn8043
    @adarshn8043 Год назад

    Super video and good explanation
    What is the accuracy

  • @toniiicarbonelll287
    @toniiicarbonelll287 2 года назад

    amazing!

  • @prashu25925
    @prashu25925 3 года назад

    Brilliant....thanks

  • @ChandraKanthmamilla
    @ChandraKanthmamilla 3 года назад

    Thanks for uploading this video. Can you please upload a video on how this customer segmentation works in any domain with end to end implementation?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I am planning to make videos on deployment after ML and DL course.

  • @debasmitadash9974
    @debasmitadash9974 2 года назад

    Is there any video explaining the theory of k-means clustering? I would like to watch that but couldn't find in your playlist.

  • @khansamar6889
    @khansamar6889 2 года назад

    can we use only one column i.e based on spending score?

  • @harinirk3365
    @harinirk3365 3 года назад +1

    Sir here you are comparing annual income and spending score here. If i want to compare all the attributes ( age, gender, annual income)vs spending score how to proceed sir? Can I get the code for that?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I don't have the code for it at the moment. you can consider multiple dimensions. but you may not get proper clusters.

  • @manepallisaisanthosh3728
    @manepallisaisanthosh3728 3 года назад +1

    bro also upload a video on EDA and its DTALE tool

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      I'll try to include that in future courses

  • @MuhammadKamran-ii4rh
    @MuhammadKamran-ii4rh 3 года назад

    Hi bro glad to see but waiting for neural network video

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! those will be covered in deep learning course.

  • @prithviraj2366
    @prithviraj2366 3 года назад

    That was awesome video bro. Can you teach us like where to start and basics of Python. Like from the scratch to learn. thanks in advance

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! you can learn from this playlist: ruclips.net/p/PLfFghEzKVmjsNtIRwErklMAN8nJmebB0I
      it is from basics and also contains python basics.

  • @kollukulurusaitharun7682
    @kollukulurusaitharun7682 2 года назад

    Mr Siddhardhan can u do a project on Handwritten Digits recognition with convolution neural networks.

  • @melodylyricskannada
    @melodylyricskannada 3 года назад

    Can we download the clustered file with Names,Ages?? if yes can you explain

    • @Siddhardhan
      @Siddhardhan  3 года назад

      I don't get you. can you please elaborate

  • @sachinvithubone4278
    @sachinvithubone4278 3 года назад

    thanks for video.
    For clustering problem we no need to split data into train and test set?
    we are not used other features variables.. if in dataset having more columns then also we can use few columns?

    • @Siddhardhan
      @Siddhardhan  3 года назад +2

      we generally don't split the data for K-Means. if we want to predict new Values, we may need to split it. it depends...
      and yes, if we have more suitable columns, we would use them. let's say that there are 3 features. in that case, we would make a 3d scatter plot.

  • @LoneWolf-rj1px
    @LoneWolf-rj1px 2 года назад

    Please show the silhouette score for unsupervised learning!

  • @mohsinimam2048
    @mohsinimam2048 2 года назад

    How to use KMeans for more than 2 features?

  • @mohamedaskar418
    @mohamedaskar418 3 месяца назад

    Please post the vedio about DBSCAN projects bro

  • @santanubhattacharya9865
    @santanubhattacharya9865 3 месяца назад

    Feature scaling karna hoga..

  • @ayushrai654
    @ayushrai654 3 года назад

    What are the shortcomings in this project? Are there any areas of improvement in this project and what are they? Can you please suggest how can we make this project even better?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! you can do some model optimization. after that you can work on deployment.

  • @ReginaLawrence-wu3ov
    @ReginaLawrence-wu3ov Год назад

    Thanks for your video sir.Would you pls share the dataset and source code 🙏

  • @sashigole7588
    @sashigole7588 3 года назад

    Nice one really helpful!! Can you do anything on student performance predictions using machine learning ??

    • @Siddhardhan
      @Siddhardhan  3 года назад +3

      hi! I'll include it in my curriculum

  • @adiityabairwa3444
    @adiityabairwa3444 3 года назад

    Sir please start one series(theory + projects) of Deep learning like ML... We are waiting...

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! I'll make DL course separately after completing ML

  • @pallavipatil9589
    @pallavipatil9589 2 года назад

    Can I use it for data science project

  • @ieltswithtaru3411
    @ieltswithtaru3411 3 года назад

    Hi, how can we visualize the clusters it via tsne?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! you can go through this documentation: scikit-learn.org/stable/modules/generated/sklearn.manifold.TSNE.html

  • @SHASHANK-u3i
    @SHASHANK-u3i Год назад

    how to get accuracy in unsupervised learning?

  • @pushkarkulkarni6888
    @pushkarkulkarni6888 3 года назад

    Doing a great job man! Can you suggest me a nice laptop for building complex ML and DL projects? Which laptop are you using?

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! thanks for your positive words! I am using dell g5. i won't recommend it for Deep Learning. you can watch this video of Siraj Raval. He has explained which laptop is best for ML: ruclips.net/video/dtFZrFKMiPI/видео.html

  • @vismayabv482
    @vismayabv482 2 месяца назад

    Code to find accuracy for the model

  • @bhavyav8355
    @bhavyav8355 3 года назад

    can u explain how the centroids are placed in the graph and how the labels like 0,1,2,3,4 is assigned to data points

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I explained it in a short way while implementing the model. you can check that out. I'll make a detailed video on this later while discussing about ML models. it's hard to explain it in the comments.

  • @harshchandra4384
    @harshchandra4384 Год назад

    But this data is labeled right?

  • @ragavanrs97
    @ragavanrs97 2 года назад

    what is the main object of this project

  • @bhargavsolanki6386
    @bhargavsolanki6386 3 года назад

    Why didn't you run 10th shell?

  • @Ash-hz7nc
    @Ash-hz7nc 3 года назад

    Why you are considering only the last two columns? you have to consider all the columns as gender and age play a role as well.

    • @Siddhardhan
      @Siddhardhan  3 года назад

      it's because I wanted to create a 2 dimensional cluster. and annual income is a good parameter to choose from. of course, you can consider more parameters as well.

  • @abhiskype5291
    @abhiskype5291 3 года назад

    Sir please tell clear explanation of spending scores ?
    Spending score means what?

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      higher score means that a person is spending more money to buy more products.

  • @mahfuzkabirpulak1154
    @mahfuzkabirpulak1154 3 года назад

    What are you using Pandas or Jupiter?

  • @manojgattam8449
    @manojgattam8449 3 года назад

    Bro do you have any report based on this project or any research Paper on this project?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I don't make reports for my project videos

    • @manojgattam8449
      @manojgattam8449 3 года назад

      @@Siddhardhan Bro Can you send your email so that I can contact with you?

  • @vedant3827
    @vedant3827 2 года назад

    can i run the same code in ides like vs code??

    • @Siddhardhan
      @Siddhardhan  2 года назад

      yes, u can

    • @vedant3827
      @vedant3827 2 года назад

      @@Siddhardhan can i link a csv file from drive directly to this code with its url??

  • @cherubyGreens
    @cherubyGreens 3 года назад

    Can you do a live session explaining ML projects?

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! I haven't thought about it. I'll definitely consider that.

  • @arshdeepwalia1251
    @arshdeepwalia1251 3 года назад

    Hey bro - do you teach? Ready to get classes

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I am mainly focusing on teaching in RUclips. at the moment, I am not giving individual tutoring.

  • @sandipansarkar9211
    @sandipansarkar9211 2 года назад

    finished coding

  • @pallapothubhargavramfromib2244
    @pallapothubhargavramfromib2244 3 года назад

    What video can we expect on Monday

    • @Siddhardhan
      @Siddhardhan  3 года назад

      Monday & Wednesday ML Course videos will be posted. this Monday, video will be on vectors & vector operations..

  • @hhmmon5550
    @hhmmon5550 2 года назад

    May I know your github account?

  • @hammadyasir2174
    @hammadyasir2174 3 года назад

    kindly upload subtitle also

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      I'll include auto subtitles

  • @johnnyannan8341
    @johnnyannan8341 2 года назад

    Thank you very much