How to detect outliers in SPSS

Поделиться
HTML-код
  • Опубликовано: 27 янв 2025

Комментарии •

  • @danielgooner4ever
    @danielgooner4ever 8 лет назад +39

    Nice video, SPSS always breaks my brain

  • @JonesAcademy7
    @JonesAcademy7 4 года назад +1

    Thank you for this video. Years later and it is still helpful.

  • @macacoman
    @macacoman 7 лет назад +1

    This is one of my favorite channels on youtube! Thorough yet clear. Keep up the good work man!

  • @lisaheaney31
    @lisaheaney31 5 лет назад +1

    Thanks for providing citations! Really helpful.

  • @AnelyBek
    @AnelyBek 4 года назад

    Thank you Dr. How2stats!

  • @gabitroyano
    @gabitroyano 4 года назад

    Thank you for the explanation! It's very good and simple! Thanks a lot!

  • @yvesburtworthington3244
    @yvesburtworthington3244 8 лет назад

    Thanks you for helping me with my homework in Advanced Statistics

  • @anitacarrier9386
    @anitacarrier9386 2 года назад

    My lecturer told me not to use box plots to check for outliers as it only uses the median and interquatile range rather than the mean, he then advised me to create z-scores to find outliers as this is based on the mean, however, he only showed us how to do that manually and not with spss.

  • @snakeyjake7
    @snakeyjake7 5 лет назад

    Really helpful, informative and to the point. Thanks!

  • @dsavkay
    @dsavkay 10 месяцев назад

    Thanks, great insight! 💯

  • @vbeija
    @vbeija 7 лет назад

    Thank you for the instructions and references.

  • @willjfit9345
    @willjfit9345 4 года назад +1

    How do you remove the outliers?

  • @mittadileepkumar3756
    @mittadileepkumar3756 7 лет назад

    Thank you so much for an amazing explanation. :)

  • @saro4761
    @saro4761 8 лет назад

    Thanks so much for this valuable information

  • @YooBro219
    @YooBro219 3 года назад

    Sir you the GREAT

  • @kyrank.4321
    @kyrank.4321 7 лет назад

    Thanks, this was very helpful

  • @nargisali7298
    @nargisali7298 3 года назад

    In multivariate analysis, a Zscore = 3.2 would be an outlier if the data set contain 1000 cases?

  • @kritikadmonty8991
    @kritikadmonty8991 5 лет назад

    Can we use the method of labelling outliers for non-normal data ? If not how do we identify outlier in non normal data?

    • @how2stats
      @how2stats  5 лет назад

      Depends on how non-normal the distribution is. I'd say skew less than .50 should be fine. There are outlier detection methods for non-normal distributions, but I haven't learned them yet!

  • @tsehayneshgedefew5310
    @tsehayneshgedefew5310 3 года назад

    I do have two questions.first is it mandatory to check normality for individual contnious variables or one by one secondly can we check normality of our data after coding?

  • @RajeshChaudhary
    @RajeshChaudhary 4 года назад

    It would be great to know about a technique in SPSS to identify an outlier based on standard deviation. Could you please guide on this?

  • @komaljerawla5699
    @komaljerawla5699 5 лет назад

    once you detect an outlier what do you do next? do v remove it from the data set?

    • @how2stats
      @how2stats  5 лет назад

      Good question. I usually winsorize it: ruclips.net/video/WJuB0vZp6w4/видео.html

  • @devez7
    @devez7 5 лет назад

    so how do u choose the 3 multiplier? u did the same thing

    • @how2stats
      @how2stats  5 лет назад +3

      You don't have to "choose" anything. SPSS automatically reports results with the 1.5 and 3.0 multipliers (circles and stars, respectively).

  • @ThePookie25
    @ThePookie25 8 лет назад

    Thank you for this!

  • @ricardovonschoettler
    @ricardovonschoettler 5 лет назад

    Thanks for the video, it has helped me in my research work. But if I have a query, in the case of time series, if we want to assess normality, should this be done only on the component called "noise"? Thanks

  • @milenah2227
    @milenah2227 5 лет назад

    Good work, thank you for the video! But I've got the problem that my variable is metric with a huge range from 3 to 12 000 000, that is why I can't detect the extreme outliers (multiplier 3.0) visually in the boxplot visualization. The scale is too wide to identify the values that are too low. How can I solve that problem?

    • @how2stats
      @how2stats  5 лет назад

      Extreme outliers can distort the visual appeal of a box plot. You might consider simply reporting that the value of 12 000 000 was an outlier and dealt with (either removed or Winsorised). Then, re-do the box plot.

  • @SaadKhanYousafzai
    @SaadKhanYousafzai 8 лет назад +1

    Hi there. First of all I have to thank you for such amazing videos. Secondly I have a problem and I have tried hard to find a solution but all in vain. I had some missing data and on top of it I also removed few outliers. I have multiple variables for single subject. I tried to do a repeated measure ANOVA but just because of one missing variable for a subject, all other variables are also ignored and I am loosing subjects. A had 23 subjects but ANOVA analyze just 14. If I put ZERO in missing varaible's place it gives me lower MEAN value. Please tell me how to fix the missing data so I can analyse all the subjects and it should also not affect my MEANS for all the varaibles.
    P.S: I can not to any computation method (I have seen your MCAR videos) to predict the values. It will mess up my data very bad.

  • @shaunlikescheese
    @shaunlikescheese 6 лет назад

    Does the 2.2 multiplier break down at all when applied to larger data sets? Say, n = 600?

    • @how2stats
      @how2stats  6 лет назад

      Yes. I'd use 2.2 multiplier for samples between 20 and 300. Thereafter, I'd use a multiplier of 3.0.

    • @shaunlikescheese
      @shaunlikescheese 6 лет назад

      Is there research supporting this though?

    • @how2stats
      @how2stats  6 лет назад

      Yes, check out Hoaglin's research; he might say it in this paper: Hoaglin, D. C., Iglewicz, B., & Tukey, J. W. (1986). Performance of some resistant rules for outlier labeling. Journal of the American Statistical Association, 81(396), 991-999.
      Or another paper in that time period.

  • @ElizabethPepple
    @ElizabethPepple 5 лет назад

    Thank you!

  • @diogotalhinhas1146
    @diogotalhinhas1146 2 года назад

    muy bueno

  • @HieuNguyen-ju4zl
    @HieuNguyen-ju4zl 5 лет назад

    Thank you

  • @alexsisccdr
    @alexsisccdr 8 лет назад +2

    Great videos. Where can I get the Excel you are using to calculate outliers based on the 2.2 multiplier?

  • @slsmithy8075
    @slsmithy8075 6 лет назад

    Hi, probably a dumb questions, but when you go from the Var1 data set to Var2 data set, what would you call the "error bars" in the var2 graph, because technically the top error bar isnt the "maximum" as the "maximum" is the outlier. Thanks.

    • @how2stats
      @how2stats  6 лет назад

      It's a fine question. They correspond to the 25th (low bar or lower quartile) and 75th (high bar or upper quartile) percentiles.

    • @slsmithy8075
      @slsmithy8075 6 лет назад

      @@how2stats I thought the 25th and 75th percentile were the top and bottom lines of the box?
      Im asking what would you call the error bar above and below the box, given the outlier is the 'maximum'.

  • @diogotalhinhas1146
    @diogotalhinhas1146 2 года назад

    grazi mile

  • @Sharpdus
    @Sharpdus 5 лет назад

    so how do you delete this damn 12