Finally! Powerful Cluster Analysis with Microsoft Excel is HERE!

Поделиться
HTML-код
  • Опубликовано: 11 сен 2024

Комментарии • 20

  • @DaveOnData
    @DaveOnData  7 месяцев назад +1

    Stuck waiting on Microsoft to grant you access to Python in Excel? Don't wait! Check out my video on how you can get started NOW: ruclips.net/video/0iV4FtpSriY/видео.html

  • @michaelt312
    @michaelt312 7 месяцев назад +2

    Still new to Python and learning. Really thankful for your videos.
    But I do know that likes, comments, and shares will get this video in front of more people.

    • @DaveOnData
      @DaveOnData  7 месяцев назад

      Thank you so much for your continued support of the channel! It is much appreciated.

  • @banihas22
    @banihas22 7 месяцев назад +1

    Awesome series David!

    • @DaveOnData
      @DaveOnData  7 месяцев назад +1

      Thank you! I am glad you are enjoying the content. I'm on a mission to uplevel the quality of Python in Excel content on RUclips.

  • @FelixLizama
    @FelixLizama 5 месяцев назад +1

    hey @daveondata Is there any way that we can get the excel file so we can do this ourselves?

    • @DaveOnData
      @DaveOnData  4 месяца назад

      @FelixLizama - I've added a GitHub repo link to the video description where you can get the Excel workbook file.

    • @FelixLizama
      @FelixLizama 4 месяца назад

      @@DaveOnData thanks!.. you are the best!

  • @ScreamingTurtleRacing
    @ScreamingTurtleRacing 5 месяцев назад +1

    hey David, How do you handle data sets that are incomplete, for example some rows have no value for a specific column?

    • @DaveOnData
      @DaveOnData  5 месяцев назад +1

      Great question! Assuming you can't correct the data (which is common), here are some strategies you can employ:
      1 - Use an algorithm that can handle missing values.
      2 - If only a small percentage of observations have missing data, remove the observations.
      3 - Remove the feature with missing data.
      4 - Find a “proxy feature” for the feature with missing data.
      5 - Fill in the missing data (i.e., impute the missing data).

    • @ScreamingTurtleRacing
      @ScreamingTurtleRacing 5 месяцев назад +1

      @@DaveOnData Thank you for such a complete response! I tried putting in a value that would flag them as not actual data -99999 for example, but I think that just throws the who analysis off. Next I will try removing the rows with null data, then after characterizing the clusters, I will see where the rows with null data might fit back in. sort of a post-hoc assignment to a cluster. Thanks again!

    • @DaveOnData
      @DaveOnData  5 месяцев назад

      You are welcome. Happy data sleuthing!

  • @user-je9iq8kb1d
    @user-je9iq8kb1d 7 месяцев назад +1

    This was great. Thank you for posting this.
    I have a question - can we read and write data from local folder ? For eg: can I run pandas.read_csv() and read a csv file located at C:/User ?

    • @DaveOnData
      @DaveOnData  7 месяцев назад +1

      Python in Excel cannot directly access the local file system, networks, or the Excel object model for security. However, you can use Power Query to source data from local files, databases, APIs, etc. For more details on Python in Excel security, check out the following Microsoft article:
      support.microsoft.com/en-us/office/data-security-and-python-in-excel-33cc88a4-4a87-485e-9ff9-f35958278327

    • @DaveOnData
      @DaveOnData  7 месяцев назад +1

      BTW - I will have an upcoming video demonstrating sourcing 300,000 rows from SQL Server into Python in Excel via Power Query. Stay tuned!

  • @ScreamingTurtleRacing
    @ScreamingTurtleRacing 5 месяцев назад +1

    Hey, great video. I have very little experience in this area and I got this running in an hour or so. I am trying to cluster customer sights(lat/long) in to sales territories. The issue is that with K-means I get all different size clusters. I have done some digging, and I want to try kmedoids clustering. Is it possible to use this method in Excel/Python? Thanks

    • @DaveOnData
      @DaveOnData  5 месяцев назад +1

      A quick scan of the libraries included with Anaconda Python (which Python in Excel uses) showed that k-medoids doesn't appear to be available at this time. The main machine learning library provided with Python in Excel is scikit-learn which does not offer k-medoids. However, you could use a local Python installation and use a package like the following: python-kmedoids.readthedocs.io/en/latest/

    • @ScreamingTurtleRacing
      @ScreamingTurtleRacing 5 месяцев назад +1

      @@DaveOnData thank you, I will try to do a local implementation of Python.

  • @oil_lord
    @oil_lord 7 месяцев назад +1

    When are they fully releasing python in excel for everyone

    • @DaveOnData
      @DaveOnData  7 месяцев назад +1

      Microsoft has not announced the date for Python in Excel General Availability. However, you don't need to wait to start building skills. Check out this video on my channel for more details: ruclips.net/video/0iV4FtpSriY/видео.html