The Right Way to Detect Outliers Outlier Labeling Rule (part 2)

Поделиться
HTML-код
  • Опубликовано: 7 сен 2011

Комментарии • 25

  • @DougGirard
    @DougGirard 11 лет назад +6

    That reference written out for anyone interested: Hoaglin, D. C., Iglewicz, B., and Tukey, J. W. (1986). Performance of some resistant rules for outlier labeling. Journal of American Statistical Association, 81, 991-999.

  • @thomasboerman662
    @thomasboerman662 6 лет назад

    Thanks!, great job!

  • @co20b
    @co20b 8 лет назад +13

    What if the data has a non-normal distribution?

  • @RemiTa1st
    @RemiTa1st 8 лет назад

    thank you!!

  • @how2stats
    @how2stats  11 лет назад +1

    A part 3 should have been apparent at the end of the video. Google 'the right way to detect outliers part 3' and it should come up at the top of the search results.

  • @dtabed
    @dtabed 9 лет назад

    Thank you

  • @Khanryu
    @Khanryu 6 лет назад

    Using the identical formula as posted here, this did not work for me for the upper quartiles for some reason (verified by comparing to "outliers" in SPSS which use 1.5). I had to put =A2-C6 for "lower" and =C2+C6 for "upper" directly, then it worked like a charm.

  • @Nikkinukks
    @Nikkinukks 11 лет назад +1

    The value for g=1.5 always remains the same for every calculation irrespective of the quartiles, mean, median values or does it changes with ....

  • @minnie0672
    @minnie0672 7 лет назад

    Do you use the absolute value if you end up with a negative number?

  • @WahranRai
    @WahranRai 6 лет назад

    Did you heard about box plot ?

  • @fatemeh2560
    @fatemeh2560 11 лет назад

    Thanks a lot, Just want to be sure, you are saying that If I want to give the reference of this method I should cite the third article in the video? or this is your method and the "g" value is from that article? I've found the article but I need to read it some more to understand it thoroughly... thanks again:)

  • @fatemeh2560
    @fatemeh2560 11 лет назад

    What happens next? It was getting very interesting but unfortunately the video ended suddenly :(

  • @Clark4345
    @Clark4345 11 лет назад +3

    Great demonstation of the Outlier Labeling Rule BUT the video ended before the explanation of why g = 1.50 is not a good value. What is the method for finding the correct value?

    • @capuchono
      @capuchono Год назад

      Using g = 1.5 gets outliers wrong 50% of the time.
      Using g = 2.2 is much better, as the video says in part 3.

  • @isak6626
    @isak6626 7 лет назад +6

    You don't mention this in you video, but how many times can/should this rule be applied to the same data-set? I.e. if I use it once, remove those outliers, and then find new ones after having applied the rule a second time - what then?

    • @clidiere
      @clidiere 5 лет назад

      I wondered the exact same thing

  • @how2stats
    @how2stats  11 лет назад +1

    It's not my method. Cite the third article.

  • @MrMela69
    @MrMela69 11 лет назад

    I followed the method but gave me negative Q1???

  • @chrisnoone1442
    @chrisnoone1442 10 лет назад

    Hi there,
    Great videos. I'm wondering what to do in the case of getting a negative value for the lower bound. In my data Q1 =1639.995 & Q3 = 2913.56 so with a G of 2.2 I get a lower bound of -1161.848. Is there something I can do? Or is this rule only valid for distributions that are already close to normal before checking for outliers?

    • @oruammattia86
      @oruammattia86 9 лет назад

      Hi, I have the same problem, have you found a solution?

    • @how2stats
      @how2stats  8 лет назад +1

      +Grace C The IQR does tend to work only when the data are fairly normally distributed. If your data are not normally distributed, you should consider using bootstrapping; you won't have to worry about outliers. I have a video on bootstrapping to get you started: ruclips.net/video/9VjzPnoUBJQ/видео.html

  • @domiky97
    @domiky97 5 лет назад

    How can I calculate the lower quartile (without excel)?

    • @how2stats
      @how2stats  5 лет назад

      One option in SPSS: Analyze -> Descriptive Statistics -> Descriptives -> Explore; click the 'Statistics' button; select the Percentiles option; click 'Continue'; click 'OK'

  • @suzanaub
    @suzanaub 9 лет назад +1

    58.6 + 23 = 81.6, right? not 81.1

    • @ThoRara
      @ThoRara 9 лет назад +1

      Suzana Ulian Benitez Has got something to do with the rounding. 1.5*15 is 22.5, not 23. If you add 58.6 to 22.5, you will get 81.1.