Project 13. Customer Segmentation using K-Means Clustering with Python | Machine Learning Projects

Поделиться
HTML-код
  • Опубликовано: 23 дек 2024

Комментарии • 130

  • @gramps4017
    @gramps4017 2 года назад +4

    Massive respect mate!!! Thanks for sharing your knowledge

  • @samirpatel7892
    @samirpatel7892 2 года назад

    easy understanding about the k mean clustering

  • @heitorandrade6387
    @heitorandrade6387 3 года назад +4

    Really good ! If you could provide in future an example of segmentation for a B2B business , it would be great. Thanks !

  • @kaiiiser7
    @kaiiiser7 3 года назад +1

    You are an absolute savior

  • @ginnaamulya461
    @ginnaamulya461 3 года назад +1

    Clear explanation 👍

  • @ArtisticGautam318
    @ArtisticGautam318 11 дней назад

    Really Amazing!

  • @MrKapilsingh
    @MrKapilsingh 3 года назад +5

    I watched many videos but none of them has better explanation than yours. Great job!!

    • @Siddhardhan
      @Siddhardhan  3 года назад +2

      Thanks a ton, brother 😇

  • @santoshr1708
    @santoshr1708 2 года назад +1

    Thank you sir. You are doing great work.

  • @onaopemipoAmobi
    @onaopemipoAmobi 2 года назад

    You just solved a problem. Thank you for sharing this.

  • @mat9095
    @mat9095 8 месяцев назад +1

    Awesome video 💯💯...understood each and every line you said....just wonderful.....keep going...all the best 👍

  • @victorialeigh2726
    @victorialeigh2726 2 года назад +2

    Hi Siddhardhan
    It's a really really nice lecture. Very considerate and detailed, explaining everything the learners need / want to know!
    Happy new year to you!

    • @Siddhardhan
      @Siddhardhan  2 года назад

      Thanks a ton! Happy New year!

  • @farazrizvi416
    @farazrizvi416 3 года назад +6

    Love your videos man, you have a great way of explaining your code and the concepts.
    I wanted to ask 2 questions; firstly, does it make sense to add the array of clusters back to the original dataframe so you can identify which customers the clusters represent? It would make graphing the clusters easier and also allow you to optimize hyperparameters without needing to retype the code.
    Secondly, can you do videos on deploying models? I couldn't find any good videos of that. - Thanks a lot!

  • @manepallisaisanthosh3728
    @manepallisaisanthosh3728 3 года назад +1

    bro your videos are helping very much pls continue bro..and attach handwritten material it will help a lot

    • @Siddhardhan
      @Siddhardhan  3 года назад +2

      Hi! Thanks for your positive words 😇 I'll see what I can do about the notes

  • @DigsHere
    @DigsHere Год назад

    Sooo helpful.
    You gave explanation precisely with points.
    Thank you so much 👍🏻

  • @priyankathakur1691
    @priyankathakur1691 Год назад

    Thank u for making such good videos. PLease keep making more videos on different projects of other unsupervised learning models.

  • @GoodLuckForever-wi9kb
    @GoodLuckForever-wi9kb Год назад

    Thanks
    Like Your method to teach basics alongwith advance topics.
    Appreciate

  • @PenAndSpecs007
    @PenAndSpecs007 2 года назад

    Awesome tutorial! Amazing and clear explanation.

  • @khushiprasad
    @khushiprasad 4 месяца назад +1

    Thanks for your amazing content❤🎉.

  • @adityamahamuni7365
    @adityamahamuni7365 3 года назад +2

    This one was superbly explained, especially the difficult to understand syntax of .iloc[ ] and one in the scatter plot too. Also I'm a huge fan of Data Visualisation, absolutely love it.
    I had a question: how can we plot a similar scatterplot using Seaborn? Could you please show it in one of the future Unsupervised Learning project videos?
    As always, thanks a lot!🙌🏻

    • @Siddhardhan
      @Siddhardhan  3 года назад

      sure! will do that. use sns.scatterplot( )

    • @adityamahamuni7365
      @adityamahamuni7365 3 года назад

      @@Siddhardhan are all the arguments the same?

  • @csebreach8565
    @csebreach8565 3 года назад +8

    Can i use this in my resume and linked in profile .. If i use what are the hashtags and conditions to be followed..By the way you are supporting pillar to all upcomers for candidates like me.. thank you

    • @Siddhardhan
      @Siddhardhan  3 года назад +25

      hi! i don't demand any citation and credit. Just understand the project completely before putting it in your resume, so that you can answer the questions related to this project asked by the interviewer.

    • @csebreach8565
      @csebreach8565 3 года назад +3

      @@Siddhardhan thank you so much

  • @KiddyKidz167
    @KiddyKidz167 2 года назад

    So nice explanation

  • @adarshn8043
    @adarshn8043 Год назад

    Super video and good explanation
    What is the accuracy

  • @Vartwol
    @Vartwol Год назад

    Thanks . This one really helped
    Only one doubt ,how canwe display the clustered dataitems in tabular form instead of scatter plot?

  • @AmitSoni-us7iq
    @AmitSoni-us7iq 2 месяца назад

    Why we did not check for Outliers ?

  • @raveenachakrapani1090
    @raveenachakrapani1090 3 года назад

    Highly helpful. Awesome explanation !!

  • @fatmamansour8606
    @fatmamansour8606 3 года назад

    excellent video and explanation too. Thank you so much and definitely will watch the rest of the videos.

  • @newmail56
    @newmail56 3 года назад

    Sir i can't understand where in the entire code the centroid are being calculated before finding out the WCSS from the data set ?? please explain

  • @manojchoukiker3232
    @manojchoukiker3232 3 года назад

    your lecture series very good for understating and nicely explain, i request to you please increase the font size or increase the screen size. t very small font so its difficult to read. thanks sir . please continue....

  • @NithyaHV
    @NithyaHV Год назад

    Hello Siddhardhan, How many dataset r u taking for Training and Testing?

  • @54-nileshparab96
    @54-nileshparab96 Год назад

    Do i need to scale independent variables before clustering?

  • @sandipansarkar9211
    @sandipansarkar9211 2 года назад

    finished watching

  • @rohinihrudhay5600
    @rohinihrudhay5600 6 месяцев назад

    Thank you so much man you really helped me a lot.

  • @gaddipatisrikanth1177
    @gaddipatisrikanth1177 3 года назад

    sir i got an error while plotting elbow graph
    it was showing that X and Y should be in same dimension!!
    pl suggest what to do!!!

  • @gujaratfortunegiants8263
    @gujaratfortunegiants8263 11 месяцев назад

    I have a doubt... here for plotting the clusters we used 2 data in x array... but what if i have more than 2 features? plz suggest

  • @dineshjothiram3677
    @dineshjothiram3677 3 года назад

    Your work was awesome and really appreciated sir.

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      Thanks a ton😇

    • @dineshjothiram3677
      @dineshjothiram3677 3 года назад

      @@Siddhardhan sir in this project I can't understand KMeans syntax sir. (Video is blur sir)

    • @dineshjothiram3677
      @dineshjothiram3677 3 года назад

      @@Siddhardhan sir in this project I can't understand KMeans syntax sir. (Video is blur sir)

  • @copaceanubobi6101
    @copaceanubobi6101 3 года назад

    i hope you make a project for image segmentation using k-means clustering algorithm .Thank you for all the projects done

  • @gkakash4789
    @gkakash4789 3 года назад

    It's really helpful to me...

  • @debasmitadash9974
    @debasmitadash9974 2 года назад

    Is there any video explaining the theory of k-means clustering? I would like to watch that but couldn't find in your playlist.

  • @sachinvithubone4278
    @sachinvithubone4278 3 года назад

    thanks for video.
    For clustering problem we no need to split data into train and test set?
    we are not used other features variables.. if in dataset having more columns then also we can use few columns?

    • @Siddhardhan
      @Siddhardhan  3 года назад +2

      we generally don't split the data for K-Means. if we want to predict new Values, we may need to split it. it depends...
      and yes, if we have more suitable columns, we would use them. let's say that there are 3 features. in that case, we would make a 3d scatter plot.

  • @harinirk3365
    @harinirk3365 3 года назад +1

    Sir here you are comparing annual income and spending score here. If i want to compare all the attributes ( age, gender, annual income)vs spending score how to proceed sir? Can I get the code for that?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I don't have the code for it at the moment. you can consider multiple dimensions. but you may not get proper clusters.

  • @khansamar6889
    @khansamar6889 2 года назад

    can we use only one column i.e based on spending score?

  • @LoneWolf-rj1px
    @LoneWolf-rj1px 2 года назад

    Please show the silhouette score for unsupervised learning!

  • @MuhammadKamran-ii4rh
    @MuhammadKamran-ii4rh 3 года назад

    Hi bro glad to see but waiting for neural network video

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! those will be covered in deep learning course.

  • @agnivamitra8775
    @agnivamitra8775 2 года назад

    Sir if we had more than two featured columns in k means clustering then how will we plot the clusters in multi dimension like 4d 5d?

  • @kollukulurusaitharun7682
    @kollukulurusaitharun7682 3 года назад

    Mr Siddhardhan can u do a project on Handwritten Digits recognition with convolution neural networks.

  • @bhavyav8355
    @bhavyav8355 3 года назад

    can u explain how the centroids are placed in the graph and how the labels like 0,1,2,3,4 is assigned to data points

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I explained it in a short way while implementing the model. you can check that out. I'll make a detailed video on this later while discussing about ML models. it's hard to explain it in the comments.

  • @manepallisaisanthosh3728
    @manepallisaisanthosh3728 3 года назад +1

    bro also upload a video on EDA and its DTALE tool

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      I'll try to include that in future courses

  • @mohsinimam2048
    @mohsinimam2048 2 года назад

    How to use KMeans for more than 2 features?

  • @johnnyannan8341
    @johnnyannan8341 2 года назад

    Thank you very much

  • @anandtalware2283
    @anandtalware2283 Год назад

    I am confused, what if we have more than 2 features/columns,, then while plotting clusters,, then x[y_pred=0, 0] is for 1st cluster and 1st colum on x-axis,, then what about more than 2 columns while plots? I hope you would reply this comment and solve my confusion😢😢😢

  • @ayushrai654
    @ayushrai654 3 года назад

    What are the shortcomings in this project? Are there any areas of improvement in this project and what are they? Can you please suggest how can we make this project even better?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! you can do some model optimization. after that you can work on deployment.

  • @prashu25925
    @prashu25925 3 года назад

    Brilliant....thanks

  • @SHASHANK-u3i
    @SHASHANK-u3i 2 года назад

    how to get accuracy in unsupervised learning?

  • @achyuthkayala279
    @achyuthkayala279 4 месяца назад

    Bro what are the pros and cons that made you to do this project

  • @ChandraKanthmamilla
    @ChandraKanthmamilla 3 года назад

    Thanks for uploading this video. Can you please upload a video on how this customer segmentation works in any domain with end to end implementation?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I am planning to make videos on deployment after ML and DL course.

  • @Yuvakishore29
    @Yuvakishore29 6 месяцев назад

    Sir can we keep these projects in our resume

  • @playatanu
    @playatanu 2 месяца назад +1

    Thanks

  • @ieltswithtaru3411
    @ieltswithtaru3411 3 года назад

    Hi, how can we visualize the clusters it via tsne?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! you can go through this documentation: scikit-learn.org/stable/modules/generated/sklearn.manifold.TSNE.html

  • @mohamedaskar418
    @mohamedaskar418 7 месяцев назад

    Please post the vedio about DBSCAN projects bro

  • @melodylyricskannada
    @melodylyricskannada 3 года назад

    Can we download the clustered file with Names,Ages?? if yes can you explain

    • @Siddhardhan
      @Siddhardhan  3 года назад

      I don't get you. can you please elaborate

  • @rahulgaud4340
    @rahulgaud4340 Год назад

    Thanks a lot.

  • @santanubhattacharya9865
    @santanubhattacharya9865 6 месяцев назад

    Feature scaling karna hoga..

  • @toniiicarbonelll287
    @toniiicarbonelll287 3 года назад

    amazing!

  • @tendulkartejesh3318
    @tendulkartejesh3318 Месяц назад

    Hello Brother. I'm going through all your Machine Learning playlist. I could find videos for Supervised Learning algorithms from Intuition, math behind it and building models from scratch. But I couldn't find the same for Unsupervised Learning algorithms brother. If there are videos regarding Unsupervised Learning algorithms from scratch. Could you please share the link here? Or else could you please do videos on Unsupervised Learning from Scratch, because that would be a great use to us bro. Could you please do videos on Unsupervised Learning too?

  • @mahfuzkabirpulak1154
    @mahfuzkabirpulak1154 3 года назад

    What are you using Pandas or Jupiter?

  • @bhargavsolanki6386
    @bhargavsolanki6386 3 года назад

    Why didn't you run 10th shell?

  • @prithviraj2366
    @prithviraj2366 3 года назад

    That was awesome video bro. Can you teach us like where to start and basics of Python. Like from the scratch to learn. thanks in advance

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! you can learn from this playlist: ruclips.net/p/PLfFghEzKVmjsNtIRwErklMAN8nJmebB0I
      it is from basics and also contains python basics.

  • @manojgattam8449
    @manojgattam8449 3 года назад

    Bro do you have any report based on this project or any research Paper on this project?

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I don't make reports for my project videos

    • @manojgattam8449
      @manojgattam8449 3 года назад

      @@Siddhardhan Bro Can you send your email so that I can contact with you?

  • @AwanthaK
    @AwanthaK 3 года назад +1

    Hi! Thanks for the content. Really appreciate your work.
    Can you do a video on ML project personality Prediction system by analyzing CVs?
    It would be really helpful.

  • @vedant3827
    @vedant3827 3 года назад

    can i run the same code in ides like vs code??

    • @Siddhardhan
      @Siddhardhan  3 года назад

      yes, u can

    • @vedant3827
      @vedant3827 3 года назад

      @@Siddhardhan can i link a csv file from drive directly to this code with its url??

  • @pushkarkulkarni6888
    @pushkarkulkarni6888 3 года назад

    Doing a great job man! Can you suggest me a nice laptop for building complex ML and DL projects? Which laptop are you using?

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! thanks for your positive words! I am using dell g5. i won't recommend it for Deep Learning. you can watch this video of Siraj Raval. He has explained which laptop is best for ML: ruclips.net/video/dtFZrFKMiPI/видео.html

  • @ReginaLawrence-wu3ov
    @ReginaLawrence-wu3ov Год назад

    Thanks for your video sir.Would you pls share the dataset and source code 🙏

  • @adiityabairwa3444
    @adiityabairwa3444 3 года назад

    Sir please start one series(theory + projects) of Deep learning like ML... We are waiting...

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! I'll make DL course separately after completing ML

  • @sashigole7588
    @sashigole7588 3 года назад

    Nice one really helpful!! Can you do anything on student performance predictions using machine learning ??

    • @Siddhardhan
      @Siddhardhan  3 года назад +3

      hi! I'll include it in my curriculum

  • @ragavanrs97
    @ragavanrs97 2 года назад

    what is the main object of this project

  • @abhiskype5291
    @abhiskype5291 3 года назад

    Sir please tell clear explanation of spending scores ?
    Spending score means what?

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      higher score means that a person is spending more money to buy more products.

  • @growingfire
    @growingfire 6 месяцев назад

    Thanks a lot !

  • @arshdeepwalia1251
    @arshdeepwalia1251 3 года назад

    Hey bro - do you teach? Ready to get classes

    • @Siddhardhan
      @Siddhardhan  3 года назад

      hi! I am mainly focusing on teaching in RUclips. at the moment, I am not giving individual tutoring.

  • @harshchandra4384
    @harshchandra4384 Год назад

    But this data is labeled right?

  • @vismayabv482
    @vismayabv482 6 месяцев назад

    Code to find accuracy for the model

  • @Ash-hz7nc
    @Ash-hz7nc 3 года назад

    Why you are considering only the last two columns? you have to consider all the columns as gender and age play a role as well.

    • @Siddhardhan
      @Siddhardhan  3 года назад

      it's because I wanted to create a 2 dimensional cluster. and annual income is a good parameter to choose from. of course, you can consider more parameters as well.

  • @pallapothubhargavramfromib2244
    @pallapothubhargavramfromib2244 3 года назад

    What video can we expect on Monday

    • @Siddhardhan
      @Siddhardhan  3 года назад

      Monday & Wednesday ML Course videos will be posted. this Monday, video will be on vectors & vector operations..

  • @hamzaarshad8952
    @hamzaarshad8952 Год назад

    plt.scatter(clustering_data[y_pred==0,0],clustering_data[y_pred==0,1],s=50, c='green',label="cluster 1")
    plt.scatter(clustering_data[y_pred==1,0],clustering_data[y_pred==1,1],s=50, c='red',label="cluster 2")
    plt.scatter(clustering_data[y_pred==2,0],clustering_data[y_pred==2,1],s=50, c='blue',label="cluster 3")
    this code is showing an error sir. can you fix it?
    this is the error:
    TypeError Traceback (most recent call last)
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
    3628 try:
    -> 3629 return self._engine.get_loc(casted_key)
    3630 except KeyError as err:
    ~\anaconda3\lib\site-packages\pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
    ~\anaconda3\lib\site-packages\pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
    TypeError: '(array([False, False, False, ..., False, True, False]), 0)' is an invalid key
    During handling of the above exception, another exception occurred:
    InvalidIndexError Traceback (most recent call last)
    ~\AppData\Local\Temp\ipykernel_17080\3756939222.py in
    1 ## plotting all the clusters and their centriods:
    2 plt.figure(figsize=(12,6))
    ----> 3 plt.scatter(X[y==0,0],X[y==0,1],s=50, c='green',label="cluster 1")
    4 plt.scatter(X[y==1,0],X[y==1,1],s=50, c='red',label="cluster 2")
    5 plt.scatter(X[y==2,0],X[y==2,1],s=50, c='blue',label="cluster 3")
    ~\anaconda3\lib\site-packages\pandas\core\frame.py in __getitem__(self, key)
    3503 if self.columns.nlevels > 1:
    3504 return self._getitem_multilevel(key)
    -> 3505 indexer = self.columns.get_loc(key)
    3506 if is_integer(indexer):
    3507 indexer = [indexer]
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
    3634 # InvalidIndexError. Otherwise we fall through and re-raise
    3635 # the TypeError.
    -> 3636 self._check_indexing_error(key)
    3637 raise
    3638
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in _check_indexing_error(self, key)
    5649 # if key is not a scalar, directly raise an error (the code below
    5650 # would convert to numpy arrays and raise later any way) - GH29926
    -> 5651 raise InvalidIndexError(key)
    5652
    5653 @cache_readonly
    InvalidIndexError: (array([False, False, False, ..., False, True, False]), 0)

  • @cherubyGreens
    @cherubyGreens 3 года назад

    Can you do a live session explaining ML projects?

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      hi! I haven't thought about it. I'll definitely consider that.

  • @hhmmon5550
    @hhmmon5550 3 года назад +1

    May I know your github account?

  • @hammadyasir2174
    @hammadyasir2174 3 года назад

    kindly upload subtitle also

    • @Siddhardhan
      @Siddhardhan  3 года назад +1

      I'll include auto subtitles

  • @sandipansarkar9211
    @sandipansarkar9211 2 года назад

    finished coding

  • @rajneeshkumar-o5p
    @rajneeshkumar-o5p 2 месяца назад

    Hindi subtitles 😅😅😅

  • @alfattahsuyadi3618
    @alfattahsuyadi3618 Год назад

    Thank you very much