Multiple regression - Checking Assumptions - for Beginners

Поделиться
HTML-код
  • Опубликовано: 15 июл 2024
  • This video can be used in conjunction with the "Multiple Regression - The Basics" video ( • Multiple Regresssion -... ).
    In this video, I show you how to check multiple regression assumptions in a few steps using IBM SPSS.
    Although it is not exactly the same as SPSS, you can download a free program, PSPP, that is similar to SPSS: www.gnu.org/software/pspp/get.... It is close enough to SPSS that you should be able to follow along with this video using PSPP.
    I used materials from the following books for this video:
    a. Lind, D, Marchal, W, & Wathen, S. (2012). Statistical Techniques in Business and Economics (15th Edition). Boston: McGraw-Hill. ISBN-13: 978-0-07-340180-5 (Textbook web resources: www.mhhe.com/lind15e)
    b. Field, A. (2013). Discovering Statistics Using IBM SPSS Statistics (4th Edition). London: Sage Publications Ltd. ISBN-13: 978-1446249185
    To add the ability to increase the playback speed of RUclips videos, go to the link below and click on the link to request the HTML5 viewer. It will allow you to change the speed of playback by clicking on the gear icon in the bottom right of your RUclips video screen (the same gear you use to change the quality). You should do this - playing my videos at 1.5 speed makes them seem better. :+)
    / html5

Комментарии • 65

  • @ellaluzpicavet
    @ellaluzpicavet 4 года назад +5

    I came here for a good explanation of the assumptions of multiple regression, and left with statistics wisdom. Plus the long lost Andy Field table which I couldn't for the life of me find in the book. All in all great video

    • @weislearners
      @weislearners  4 года назад

      Thank you very much for you kind comments, Ella. I'm glad that you found the video useful.

  • @ekwosam
    @ekwosam 9 лет назад +5

    Thanks for the video especially for the remedial actions included to avoid assumptions violations and summing everything up nicely. Keep up the good work..

  • @weislearners
    @weislearners  9 лет назад

    You're welcome.
    Thank you for your kind words.

  • @xxmyohmyxx
    @xxmyohmyxx Год назад

    This was so So helpful - having each discussed individually but all in the same place with a clear explanation of what each accomplishes. And the way you have the slide set up (using colored text and boxes) is helpful as well. Thank you so much for posting! I will be viewing many more of your videos.

  • @Chocorett0
    @Chocorett0 4 года назад +1

    this is one of the best videos to check for assumptions. Thank you so much!

  • @katieglover5684
    @katieglover5684 3 месяца назад

    Thank you for helping me with my dissertation analysis!
    Big help, now I've got to figure out how to run the analysis :D

  • @JackReynoldsMath
    @JackReynoldsMath 8 лет назад +3

    One of the best videos on multi regression. Thanks so much. Great job!

  • @nicholejordan3025
    @nicholejordan3025 9 лет назад +1

    This is an excellent video!

  • @rafsunmashraky9954
    @rafsunmashraky9954 9 лет назад +2

    Excellent Video... Thank You very much...

  • @mildredaviles4829
    @mildredaviles4829 8 лет назад +1

    thank you. your video is very helpful.

  • @emmasplantz
    @emmasplantz 2 года назад +1

    This was SO helpful wow. I've been trying to find a video that explains assumption testing clearly and yours is spot on. Thanks so much!

    • @emmasplantz
      @emmasplantz 2 года назад +1

      hey quick question, can I do a multiple regression analysis with one continuous DV and multiple categorical IVs (that have 2 or more categories each) ?

    • @weislearners
      @weislearners  2 года назад

      Thank you for your kind comments. I really appreciate them.

    • @weislearners
      @weislearners  2 года назад +1

      @@emmasplantz yes, you can, but you need to look at using dummy variables.

    • @weislearners
      @weislearners  2 года назад

      @Emma's plants I have a video using dummy variables and qualitative variables that may help. It happens to be in Excel, not SPSS, but if you're familiar with both, you shouldn't have much trouble transferring the ideas to SPSS: ruclips.net/video/y5WN_lz95DE/видео.html

    • @emmasplantz
      @emmasplantz 2 года назад

      @@weislearners thanks a lot !

  • @MoHAbdi-fx4uc
    @MoHAbdi-fx4uc 3 года назад +1

    Very clear explanations, well done and many thanks for the effortless

    • @weislearners
      @weislearners  3 года назад

      Thank you very much for your kind words. There is much room for improvement and your comments encourage me to do so, Mo!

  • @fritzmuller8761
    @fritzmuller8761 2 года назад +1

    Thanks for this great video.

    • @weislearners
      @weislearners  2 года назад

      You're welcome! I appreciate your comments.

  • @adamhussein6880
    @adamhussein6880 7 лет назад +1

    Thanks. It is very interesting

  • @johnboscobahungirehe2697
    @johnboscobahungirehe2697 5 лет назад +1

    Very impressive and full of knowledge

    • @weislearners
      @weislearners  5 лет назад

      Thank you very much for your comments! I hope you found it helpful.

  • @JosePerez-dg1is
    @JosePerez-dg1is 3 года назад +1

    Very well explained, thank you very much.

    • @weislearners
      @weislearners  3 года назад

      You're welcome. Thank you for visiting my page.

  • @saichaithrik7134
    @saichaithrik7134 4 года назад +1

    thank u sir this video helped me a lot. u explained very good and ur slides are so helpful once again thank u sir

    • @weislearners
      @weislearners  4 года назад

      You are most welcome! Thank you for your kind comments!

  • @Guzo360
    @Guzo360 3 года назад

    The best lecture thank you👍👍👍

    • @weislearners
      @weislearners  3 года назад

      I really appreciate your compliment! Have a great rest of the week!

  • @ilias4267
    @ilias4267 9 лет назад

    Thank you very much for your amazing video. Sorry for asking but I cannot find a simple answer to the following problem. I want to check if 2 correlation coefficients in a multiple regression (1 analysis, 1 sample) are significantly different between them. Do you know if there is an etc. online formula or an other way to find out?
    Thank you in advance.

  • @HealthbeautyluckyshahBlogspot
    @HealthbeautyluckyshahBlogspot 7 лет назад

    I have one question. For doing regression do all the variable should have a correlation with the dependent variable? Like my dependent variable does not have a significant correlation with two independent variables, when I do hierarchal multiple regression and remove these I get a bit larger R square than when I do with them.

  • @sintafra9833
    @sintafra9833 3 года назад +1

    This is saving my dissertation

    • @weislearners
      @weislearners  3 года назад

      I'm glad it helps. Keep grinding! You'll finish!

    • @sintafra9833
      @sintafra9833 3 года назад +1

      @@weislearners Thank you Im doing my best :D

  • @newtonocharimenyenya2458
    @newtonocharimenyenya2458 3 года назад +1

    A Great Piece

  • @alicepailhes3407
    @alicepailhes3407 6 лет назад +2

    Hello , does anyone know how to test for linearity and homoscedascticity when you have a binary independent variable on SPSS?

  • @amyhall2622
    @amyhall2622 7 лет назад +16

    I feel like I'm learning stats from Owen Wilson

  • @applegreen3086
    @applegreen3086 4 года назад +1

    Thank Youuuu !

  • @dilargachewtegegn1393
    @dilargachewtegegn1393 7 лет назад

    I am very much interested on this video, the best video. Can i use categorical variables as independent variable in the case of simple linear regresstion

    • @weislearners
      @weislearners  7 лет назад

      Yes, but unless your categorical variables are binary, you'll have to use dummy coding (re-code the categorical variable into separate binary variables). Here's a pretty good explanation of dummy coding: www.psychstat.missouristate.edu/multibook/mlt08m.html (full disclosure: I have no affiliation with the site).

  • @husseinel-sayed64
    @husseinel-sayed64 7 лет назад +1

    Hi.. Thanks for an informative video. I have a question though. I am studying the relationship between 13 variables (one indpendent and 12 dependents). Now each variable is measured using 4 or 5 items on a questionnaire. So in total I have 61 indicators or items. How would I go about checking the linearity assumption because the DV consists of 5 items and IV's have 56 items. Appreciate your feedback. Thanks.

    • @weislearners
      @weislearners  7 лет назад +1

      To be clear, I am not a statistician. With that in mind, I believe you would have to resort to hierarchical linear modeling or structural equation modeling, rather than multiple regression. Multiple regression is used for a single dependent variable, not multiple.

  • @mutindafestus5619
    @mutindafestus5619 6 лет назад

    this is a good video
    i also have special problem of a simple linear regression
    where my data has plenty of outliers
    i identified the outliers +high leverage points and removed them on reruning the regression ,a new set of outliers appears
    i tried this thrice and every time i remove outliers a new set emerge i am not sure how much i should remove

    • @weislearners
      @weislearners  6 лет назад

      To be clear, I am not a statistician - I just teach stats. My expertise is in information systems and healthcare. Now that the disclaimer is out of the way :) - it is not correct statistical method to just remove outliers and other extreme values. You have to have justification for why you are removing them...and wanting your regression to work is not an acceptable reason.
      An example of a reasonable justification is that you find the value of 233 years in a data set of ages. Unless you are including biblical characters, people don't live to 233 years in today's world. So, the 233 could be a typo when someone meant to type 23, 33, 32, etc. Unless you have a way of determining the actual age of the participant, you can remove it.
      Without knowing more about your data set, I suspect you may not have enough observations (data points) in your data set. Each time you remove outliers/extreme values, the "new" data points in the data are sufficiently spread out that you end up with new outliers/extreme values. You may want to run the Sample Size calculator--specifically, "A-priori Sample Size Calculator for Multiple Regression"--at danielsoper.com (disclaimer: I receive nothing to refer you to the site) to confirm that you have an adequate sample size. The site is free and has a lot of helpful statistics tools and information.

  • @jonathanl5123
    @jonathanl5123 7 лет назад

    Thank you for the very helpful video. I am still not clear on whether these assumptions need to be met on the sample data or on the whole population data?

    • @weislearners
      @weislearners  7 лет назад

      Thank you, Jonathan.
      The assumptions are for the sample data. If you have the population data, you don't need to make inferences (use inferential statistics), you can just calculate the values you need.

    • @jonathanl5123
      @jonathanl5123 7 лет назад

      Sorry. I am new at this and my question wasn't clear. The population data set has 4000 data points, and I am selecting for sample data set 400 data points randomly to build the model and make inferences to the population. My question was are the assumptions tested on the sample data set or population set. Thank you.

    • @weislearners
      @weislearners  7 лет назад

      No need to apologize at all.
      If I understand you correctly, you would do the assumptions test on the 400 data points, the sample data.

  • @monicapereira4567
    @monicapereira4567 6 лет назад

    Have you done a video on how to do ridge regression in SPSS. I am struggling to find how to do it.

    • @weislearners
      @weislearners  6 лет назад

      Hi Monica! No, I haven't. I found this macro that might help, put out by IBM: www.ibm.com/support/knowledgecenter/SSLVMB_20.0.0/com.ibm.spss.statistics.help/synmac_ridgereg.htm
      INCLUDE '[installdir]/Samples/English/Ridge regression.sps'.
      RIDGEREG DEP=varname /ENTER = varlist
      [/START={0**}] [/STOP={1**}] [/INC={0.05**}]
      {value} {value} {value }
      [ /K=value] .
      [installdir] is the installation directory.

    • @weislearners
      @weislearners  6 лет назад

      Here are instructions about the macros: www.ibm.com/support/knowledgecenter/en/SSLVMB_20.0.0/com.ibm.spss.statistics.help/synmac_caution.htm

  • @subbit1
    @subbit1 Год назад

    I am confused about how to tell the difference when analyzing the independence and linearity? Both times it is said that the points should be scattered without a clear pattern. Am I misunderstanding something here? What is the exact difference in graphically checking for those two assumptions?

  • @Nima.Mahmoud
    @Nima.Mahmoud 12 часов назад

  • @FekaduM
    @FekaduM 3 года назад +1

    Do you have the same explanation using EXCEL? Thanks

    • @weislearners
      @weislearners  2 года назад

      Although the focus isn't directly on assumptions, I talk about the assumptions in a Correlation video (ruclips.net/video/qmgiMZOerVM/видео.html), and a simple linear regression video (ruclips.net/video/PU5_VR8sSxs/видео.html). Please take a look at those. If they don't suffice, please let me know. I'm in the middle of a move, but I'll see if I can put something together that is more to your liking.
      Thank you for commenting!
      Have a great week!

  • @TexansFan218
    @TexansFan218 3 года назад +2

    Explain like I'm five.

    • @weislearners
      @weislearners  3 года назад

      Do you mean I need to explain it clearer or that I explained it clearly enough, please?
      Thank you for watching my video!