Hierarchical Cluster Analysis [Simply explained]

Поделиться
HTML-код
  • Опубликовано: 14 июл 2024
  • What is Hierarchical Cluster Analysis? And how is it calculated?
    A hierarchical cluster analysis is a clustering method that creates a hierarchical tree of objects to be clustered (Dendrogram). The tree represents the relationships between the objects and shows how the objects are clustered at different levels.
    ► Load sample data set
    datatab.net/statistics-calcul...
    ► Online Calculator Hierarchical Cluster Analysis
    datatab.net/statistics-calcul...
    ► Hierarchical Cluster Analysis Tutorial
    datatab.net/tutorial/hierarch...
    ► E-BOOK
    datatab.net/statistics-book
    00:00 What is Hierarchical Cluster Analysis?
    00:31 Example of Hierarchical Cluster Analysis
    00:50 Calculate hierarchical cluster analysis
    06:32 Calculate hierarchical cluster analysis online

Комментарии • 55

  • @4chanFootballMemes
    @4chanFootballMemes 5 месяцев назад +28

    I loved learning about "Heyrakikal" clustering

  • @amobindubuisi2631
    @amobindubuisi2631 Месяц назад +3

    this is an extremely good material. top-notch. never seen something so easily explained as done on this content.

  • @odiakaolika5715
    @odiakaolika5715 4 месяца назад +8

    You just made my evening with your simple explanation.

    • @datatab
      @datatab  3 месяца назад

      Glad it was helpful and many thanks for your feedback! Regards Hannah

  • @ozgurogur1297
    @ozgurogur1297 Год назад +2

    I found it very understandable and simple. thanks a lot!

  • @alexfrancois
    @alexfrancois Год назад +1

    Beautifully explained, thanks! 🙏 Incredibly clear.

  • @shawnkim6287
    @shawnkim6287 10 месяцев назад

    thank you so much. you clarified a lot!!!!
    😀

  • @masteroftheworld001
    @masteroftheworld001 10 месяцев назад

    well explained thank you so much

  • @Motivasi.Quotes
    @Motivasi.Quotes 22 дня назад

    such a very good vidio. Thank u so much for your explanation

  • @saurabhjoshi3010
    @saurabhjoshi3010 7 месяцев назад

    nicely explained

  • @nakirambau7632
    @nakirambau7632 8 месяцев назад +3

    thank you so much, you have explained it so well

    • @datatab
      @datatab  8 месяцев назад +1

      Glad it was helpful!

  • @manuelleitner3196
    @manuelleitner3196 Год назад +3

    Great video, thank you!!!

  • @ricardorivashernandez4023
    @ricardorivashernandez4023 9 месяцев назад

    Real good!

  • @osmancetinkaya8930
    @osmancetinkaya8930 Год назад +10

    How might be the sqr of 17 (16+1) =equal to 3,162 ? it must be 4,123 is not?

    • @manuelruelas3496
      @manuelruelas3496 10 месяцев назад +4

      The error is that the x distance is 3 (from 1 to 4) not 4, so it’s the sq root of 10.

  • @matheusdelima1743
    @matheusdelima1743 Год назад +1

    Great content. I'm a fan :)

    • @datatab
      @datatab  Год назад +1

      Glad it was helpful and many thanks for your nice feedback! Regards Hannah

    • @iqraahmad130
      @iqraahmad130 Год назад

      youre kinda cute

  • @rileyharper7679
    @rileyharper7679 8 месяцев назад +4

    The Euclidean distance horizontal component at 2:17 should be 3 not 4 since 4 - 1 = 3. Also, the manhattan distance should be 4 and the maximum distance should be 3 for the same reason.

    • @playbros332
      @playbros332 6 месяцев назад +1

      I agree they are wrong, but shouldn't it be square root of 17, which is 4.12?

    • @fabianr9394
      @fabianr9394 6 месяцев назад

      Because you go 3 steps to the right and 1 up; so sqrt(3^2 + 1^2)​@@playbros332

  • @ibrahimabubakarzango9803
    @ibrahimabubakarzango9803 3 месяца назад +1

    Pls endeavour to avoid making mistakes thanks for comment section i could have got it so difficult to comprehend. That aspect of sqrt of 17 is terrible. But u did well and this video is good too

    • @datatab
      @datatab  3 месяца назад

      Hi thanks for youre feedback! We try to avoid mistakes, sorry for that and for the resulting trouble! Regards, Hannah

    • @Oladayo1
      @Oladayo1 3 месяца назад

      well, that's because it's the sqrt of 10 not sqrt of 17. The mistake was using 4 instead of 3

  • @rodidoesburg4061
    @rodidoesburg4061 6 месяцев назад

    How do you name the clusters? Just from left to right, so cluster 1, cluster 2, cluster 3. Or are there more methods to name a cluster?

  • @samuraixyz22
    @samuraixyz22 Год назад +2

    I would like to RUclips tutorials like this. Do you have recommendations on what softwares to use?

    • @datatab
      @datatab  Год назад

      DATAtab : )

    • @samuraixyz22
      @samuraixyz22 Год назад

      @DATAtab where can you learn more about it?

  • @ibethdiaztapia1033
    @ibethdiaztapia1033 Месяц назад

    hi. it should 3 - 1 for euclidean as the formula is square of XB1-XA1

  • @luisamar8214
    @luisamar8214 2 месяца назад

    How you calculate the distances between Lisa, Joe with the others?? you have a group of positions not just one... how do you do that? thankss!

    • @datatab
      @datatab  2 месяца назад

      Hi, in this case you would first calcualte the center between Lisa and Joe and then the diestance from this center to one other Person. Regards Hannah

  • @fredh3152
    @fredh3152 2 месяца назад +1

    i love your accent

  • @nazhifmuh.kasyfan2148
    @nazhifmuh.kasyfan2148 2 месяца назад +1

    I would like to ask, is Hierarchical Cluster Analysis always associated with the Euclidean Distance? Thank you

    • @datatab
      @datatab  2 месяца назад

      Hi many thanks for your question, Hierarchical Cluster Analysis (HCA) is not always associated with the Euclidean distance. While Euclidean distance is commonly used, HCA can work with various distance metrics depending on the nature of the data and the analysis goals.
      Here are some common distance metrics used in HCA:
      - Euclidean Distance: This is the straight-line distance between two points in a multi-dimensional space. It's one of the simplest and most widely used distance metrics.
      - Manhattan Distance (also known as City Block or L1 distance): This is the sum of absolute differences between coordinates. It can be suitable when diagonal movement isn't meaningful.
      - Cosine Similarity: This measures the cosine of the angle between two vectors, commonly used in text analysis and other contexts where vector magnitude might vary.
      - Mahalanobis Distance: It accounts for correlations in data by incorporating the covariance matrix, making it suitable for data with different scales and correlations among variables.
      - Minkowski Distance: A generalization of Euclidean and Manhattan distances, with a parameter 'p' to control the degree of the norm.
      - Correlation-based Distance: This distance uses the correlation between data points rather than absolute differences. It's common in gene expression analysis or other contexts where relationships between variables matter more than absolute values.
      I hope this was helpful : ) Regards Hannah

  • @maxwellspyk494
    @maxwellspyk494 Год назад +1

    hi where can i find the elbo method

    • @datatab
      @datatab  Год назад

      Oh sorry, it will be there soon!!!

  • @python4ncert202
    @python4ncert202 Год назад

    Nice video!
    I want to know the name of algorithm that you have used here to explain hierarchical clustering.

    • @Nothingimportant1
      @Nothingimportant1 Год назад

      I want too, but it is hight probable that she does not tell us. Statistics saying.

    • @muhammadwaseem_
      @muhammadwaseem_ Год назад

      @@Nothingimportant1 AGNES

  • @mahidahmed7
    @mahidahmed7 Год назад +1

    klaaastarrrrss

  • @s.h.a6472
    @s.h.a6472 15 дней назад

    خدا خیرت بده بانو

  • @ahmad3823
    @ahmad3823 2 месяца назад +1

    4-1=3 though!

  • @asrarbw
    @asrarbw 4 месяца назад

    Claaaastars 😂

  • @user-vo4ew1gx
    @user-vo4ew1gx Год назад +4

    Excellent explanation. Why it takes too long to create a new video?

    • @datatab
      @datatab  Год назад +2

      Good question! : ) We need almost two weeks to prepare the topic and to create the slides! Regards Hannah

    • @user-vo4ew1gx
      @user-vo4ew1gx Год назад

      @@datatab i hope it will be fast :)

  • @PaulKam1997
    @PaulKam1997 Год назад +1

    is and not und at 3:15

  • @abdulaziznazarov9661
    @abdulaziznazarov9661 5 месяцев назад +1

    i think you have a mistakes with calculating