Shapiro-Wilk test

Поделиться
HTML-код
  • Опубликовано: 3 июл 2024
  • The Shapiro-Wilk test to test for deviations from normality. Also includes an introduction to Q-Q plots, and how they can be used to graphically assess normality.
  • НаукаНаука

Комментарии • 15

  • @keerthanavivin450
    @keerthanavivin450 2 года назад

    Great video! Understood the concept perfectly. A big thanks.

  • @logantyner2214
    @logantyner2214 3 года назад +5

    The way you broke this down makes it incredibly easy to understand. thank you!

  • @langleymcentyre2754
    @langleymcentyre2754 4 месяца назад

    Thank you for making this video it really clarified the concepts for me

  • @aliakbarsafdari4084
    @aliakbarsafdari4084 3 года назад

    This was an excellent video, thanks.
    So, my dataset has columns with very varying counts of values (from 8 to >250).
    Can I use this test as a rough estimation of normality for each column? Or should I prepare Q-Q plots for each?
    Also, if I wish to seek the statistical significance between these columns (they measure the same parameter), what test would you recommend?
    Thanks again.

  • @krahoc
    @krahoc 3 года назад +5

    Remember Shapiro-Wilk's test should not be used for populations greater than 50, so basically it's quite useful for small populations even if it's a bit sensitive.
    Anyway, great video, maybe you should develop a bit the p-value part explaining how you could interpret it as a confidence indicator about H0.

    • @user-hu1sn5or7j
      @user-hu1sn5or7j 3 года назад

      Why such a limitation? Which test is best for a population greater than 50?

    • @krahoc
      @krahoc 3 года назад +3

      @@user-hu1sn5or7j the limitation is inheritant to the test method. In their paper, Shapiro and Wilk use a table to compare the W value to a tabulated critical W, but the table goes only from 3 to 50.
      There is a way to "extend" the test by tabulating yourself the critical values with Monte-Carlo simulation, but that's beyond the Shapiro-Wilk's paper. ;)
      I don't really know what's behind the scenes of the R calculations but I think that's what is done here.

    • @diazjubairy1729
      @diazjubairy1729 2 года назад

      @@krahoc but i've read recent paper talking about shapiro-wilk test is good enough up until 1000 datas

    • @krahoc
      @krahoc 2 года назад +1

      @@diazjubairy1729 if you compute the critical W values by Monte-Carlo simulation, you may use it with any amount of values but the more data you test the less discriminant it will be, just like every normality test. That's the curse of big samples :')

  • @mobileentertainment212
    @mobileentertainment212 Год назад

    could we get more explanation on the numerator? on the actual derivation of the variable a

  • @tomp4925
    @tomp4925 Год назад

    Is there a formula to calculate "a"?

  • @HuzMusFinanceData
    @HuzMusFinanceData 3 года назад

    Thank you for this Video.. In simple words, explain if the P-value is < 0.05 so Data is normal or Abnormal?

    • @swastikpatro6436
      @swastikpatro6436 2 года назад +2

      Normal

    • @HuzMusFinanceData
      @HuzMusFinanceData 2 года назад

      @@swastikpatro6436 Thank you. ❤

    • @vikramjoshi3452
      @vikramjoshi3452 5 месяцев назад

      If p < 0.05, the data are not normally distributed. Remember, the null hypothesis for Shapiro.test() is that the data are normally distributed.