Validating a Hierarchical Cluster Analysis

Поделиться
HTML-код
  • Опубликовано: 21 авг 2024
  • In this video I show how to determine the most appropriate number of clusters based on the agglomeration schedule in a hierarchical cluster analysis.

Комментарии • 23

  • @Tanujtiwari
    @Tanujtiwari 4 года назад +1

    thank you very much. my Master's Thesis is complete coz of you!

  • @user-wh1pp2qr7v
    @user-wh1pp2qr7v 2 месяца назад

    Thank you very much, but how did you enter the label in your hierarchical analytical box that presented string results in the dendrogram?

    • @Gaskination
      @Gaskination  2 месяца назад

      I don't think I understand the question. Do you mean the 1, 2, 3, etc. in the blue boxes? It's just a powerpoint square shape that I typed in. The labels on the dendrogram are placed their automatically by SPSS. If you don't see those, perhaps this will help: ruclips.net/video/3bAPwFern_4/видео.htmlsi=brvFqLZ2g0rZkUnO

  • @chefberrypassionateresearcher
    @chefberrypassionateresearcher 4 месяца назад

    Professor, Can I do MGA on the groups created after cluster analysis. And do I need to do measurement Invariance for this kind of groups because they all belong to the same study population. Please reply?

    • @Gaskination
      @Gaskination  4 месяца назад

      Yes, you can do a multigroup analysis on a model using the cluster membership number as the grouping variable. An invariance test will still confirm the measurement invariance (which is not guaranteed within a single sample, even if they are all from the same study population)

    • @chefberrypassionateresearcher
      @chefberrypassionateresearcher 4 месяца назад

      @@Gaskination How does measurement invariance actually work incase of groups created from the same population?

    • @Gaskination
      @Gaskination  4 месяца назад

      @@chefberrypassionateresearcher Even within the same population there are sub-populations. For example, within a company, there are IT workers and accountants. These are different.

  • @nayivassaf
    @nayivassaf 5 лет назад

    But what that distances mean on the X axis? for example, between the bottom cluster line almost on ten and the right line almost on 25

    • @Gaskination
      @Gaskination  5 лет назад

      It represents the distance between clusters. The larger the x-value the larger the distance (difference) between clusters.

  • @gibson4ever28
    @gibson4ever28 2 года назад

    Are the coefficients in the table the same as the heterogeneity measures or how can I determine the individual heterogeneity measures for each cluster solution (Like for 2 clusters, 3 clusters etc...)

    • @Gaskination
      @Gaskination  2 года назад +1

      Yes, those are the heterogeneity measures.

    • @gibson4ever28
      @gibson4ever28 2 года назад

      @@Gaskination Thank you!

  • @jarlath2000
    @jarlath2000 5 лет назад

    Hi James, my data set has around 400 observations and produces quite a smooth line graph from the agglomeration schedule. Do you have any advice as to how to determine the appropriate number of clusters in this case?

    • @Gaskination
      @Gaskination  5 лет назад +1

      In this case, choose a useful number of clusters. 1 or 2 clusters is usually not very helpful. More than six becomes hard to interpret. So, usually 3-5 or six is the best choice.

    • @jarlath2000
      @jarlath2000 5 лет назад

      @@Gaskination Thanks a bunch, your videos have been a real help

  • @mayurgo10
    @mayurgo10 7 лет назад

    hi , my data set is of 900 observations in which i am not get the line graph , can you help

    • @Gaskination
      @Gaskination  7 лет назад

      If you are checking the box for Agglomeration schedule, and it is still not producing the line graph, then I'm not sure. Sorry about that.

  • @flamenkomuva
    @flamenkomuva 7 лет назад

    is there a way to determine variable importance in hierarchical clustering, like you can in two-step?

    • @Gaskination
      @Gaskination  7 лет назад

      flamenkomuva unfortunately, not that I'm aware of.

    • @flamenkomuva
      @flamenkomuva 7 лет назад

      ok, thanks anyway! and thanks for your videos! :}

  • @leihongwu5879
    @leihongwu5879 7 лет назад

    shall the grilled chicken nugget separate at the level "3" but not the "4"?

    • @Gaskination
      @Gaskination  7 лет назад

      The other way. They separate at level four, but not three.

    • @leihongwu5879
      @leihongwu5879 7 лет назад

      if you look at the distance (X-axis), the distance between grilled chicken and other two occurred even earlier than the second branch, which means they are not "level 4" separation but should be "level 2" instead. You cannot judge a separation simply based on the balancing/number of nodes.