Man, you've saved my assignment- not only that but you've helped me *actually* understand what I'm doing, rather than just following along with unexplained instructions like in class practicals!
What a nice surprise! I'm working on a statistics project right now and typed how to check assumptions into Google, clicked on the first result. Didn't think it would be one of my favorite RUclipsrs!
i watched your video twice professor, and it really helped me writing and concluding my final desertion. Really worth watching, easily understandable, my respect for you!
I have returned to this same video about a hundred times, it is awesome! I am wondering if you would be able to make a video showing how to do a ridge regression in SPSS. there are no videos showing how to use this new function!
I found this video easier to follow. Thank you Dr. Grande for the detailed information. It was very useful the explanation on how to conduct the assumptions and especially the values, what to look for and the explanation of it.
Thank you for explaining a difficult issue in such a simple way Dr. Grande. It was very helpful for me. I want to ask that if one of these assumptions is not met (i.e. multicollinearity of the variables) is there any way to correct this condition? Or should I renounce from Lineer Regression and do another analysis?
Awesome video that explains the multiple regression in a clear step by step manner. My only remaining concerns is what if my data deviated from the set assumptions (e.g. mulicolinearilty; normality). The data used seem perfect. What if it wasn't?
Thanks for this informative video! Can I just check, youve mentioned for the normality tests if the p value falls below .005 then it violates normality assumption, however so far at uni ive been told if they fall below 0.05 (not .005) then it would be considered a violation. Which value should we use as a guide?
For people not familiar with linear algebra independent variable is what statisticians use typically on the x axis and y axis is outcome of straight line y=B*x aka dependent variable.
It was interesting when you pointed out the durability of treatments and how symptoms decrease when treatment continues but not so much when treatment stops. This is a good research topic.
Dear Dr.Todd, your video is the very helpful to me, what consequences do I have if my std.residual min is -3.790, thank you so much in advance if you could help me 🙏🏼🙏🏼🙏🏼
Sir I have three variables. 1. Academic burnout 2. MENTAL health 3. Self efficiency How hypothesis and objective is that how Academic burnout impact our mental health and Self efficiency so in this case what our IV and DV and which one should we use linear or multiple regression? Please sir respond 🙏
Thanks Dr.Todd. We read in statistical books that regression residuals have normal distribution. But you check normal distribution of dependent variable. which one is true?
Thank you. What if we find that the dependent variable is not normally distributed. How can we continue? Also can you please refer me to anothet linear regression video of yours, if any, where one of the independent variables is nominal?
I want to thank your for all your videos, I have a query on this one. I have not met the assumption of normality. There is one record that seems to be throwing it out. Should I remove the record?
Thank you for explation. I am very happy that there are videos about spss program and analysis such is multiple regression that help to learn and understand about this important part of analysing results. I have just one question about part correlation, can we interpret it as unique contribution of predictor as procentage when we multiple it with 100 or we first must square it and then multiple it with 100? Thanx for effort, all good, God bless you
Thanks so much for this video. When carrying out my regression, my scatterplot has variables above and below 3 and -3, respectively - the standard residual range is -4.450 to 6.184. My dataset is about n=5500 but there are multiple values outside the -3 to 3 range. When trying to find explanations for this, I came across a video that suggested heteroscedasticity but I'm not sure if it was applicable to my case or not. Are there any tips or help you could provide about this situation? Thank you again for the video
Thank you for your video! Very clear and helpful. I have some questions. I have an independent variable that has a correlation below .3 with the dependent variable, how does that affect interpretation? It's .265. Also, my data is not normally distributed, but I'm using a sample that is reflective of a population and asking about rare behaviours, thereby diversions from normality are likely to represent difference in the population. Would that mean that I am okay to go on and interpret my results despite lack of normality?
Mr. Grande, you mentioned the correlation values as one of the assumptions. But I could not find any reference to the correlation value between dependent and independent variables greater than .30. I would be very glad if you could help with this.
dear sir, thanks for the video. i have a doubt. in the coefficients table, the constant is our B0 and the values written below it (instandardized) are our B1right? This B1 value is then written in Z-score terms in the Standarized beta column. right???
I heard another scholar used the partial correlation as the covariance each predictor explains uniquely when the other predictors are held constant. How that is different from using the part to explain the unique contribution of each predictor?
There are a lot of details in this video. I think it would be very good to follow along if/when I need to perform a multiple linear regression. For now it is a bit much to really follow.
Thank you so much Dr. Todd for your videos. They are very helpful. Please, can a categorical variable eg gender, socioeconomic class be used as one of the predictor variables in multiple linear regression?
Hi Dr. Grande. I have a question. Knowing that when the independent variables are at individual level of analysis and the dependent variable is at group level of analysis the levels of analysis have to match before we perform a multiple regression, what is better: (1) To divide the group score for the dependent variable by the number of individual cases in a group and give each case an individual score for that dependent variable, or (2) To add up the independent variable scores of all members in a group and use that score to conduct the analysis on the group level of analysis. However, when we do (2), the sample size decreases, which creates problems. What do you think?
Dr. Grande, thank you very much for the video. One thing I wanted to ask is whether we are interested in the normality of residuals of the model, and not the raw scores?
Dr. Todd Grande, thank you a lot, this was very helpful. I have a question, can I make a multiple regression analysis with independent variables, which are different on scales, for example, one questionnaire subscales (the subscales mean are about 2-3) and mathematics examination, which mean is 70 percent?
Dear Dr. Grande, would you please update (e.g., Videos of 2021, 2020, 2019, etc.) your lectures (e.g., Stepwise multi regression analysis, etc.)? I wish to add them to my final dissertation if I have your permission.
Hey Dr. Grande! Can you give me a list of references for the range of acceptable values to judge multicollinearity and correlations that you mentioned here? Thanks!
Thank you Dr for your valuable video. My question is if my survey has 3 independent variables and one dependent variable, where each of them is assessed by about four questions in the survey, I.e. the survey has 16 questions. So, how can I do the multiple regression analysis in this case. Could you please assist me. Your prompt response is highly appreciated. Mohamed
This is much easier tan what I was taught to test assumptions and see if you are on the right track! Thanks! A for my real analysis, lol for a moment I thought I wouldn't have to dummy code, lol silly me ;-)
i am wondering, when checking the residuals table, the standard residual cannot be lower than -3. what happens if it is? which assumption is violated and how can i solve it ?
hello Dr Todd, I have missing values in my data set so should i click "exclude cases listwise" or should i click "replace with mean". Although I did make the syntax that deals with missing data
Question: what if the normality test would have been statistically significant? if the assumption of normality is not met can you run linear regression?
Dear Mr Grande, I have a question I'm currently testing my data for assumptions and see some people recommend a Durbin-Watson test. Is this necessary and if so which assumption does it check?
Hi Todd, apology as I am new at statistics, I m testing a model where a construct has antecedents and consequences with no mediations. Is it correct that the construct is always the dependant variable and both antecedents and consequences are IV? thanks
Hello, I have a question. If my standard residual range is -3.839 (min) and 2.662 (max), would it be considered as a good range? If it's not, what should I do?
While looking at the range why must the values stay between -3 and 3? What does that tell us other than that it is between those numbers? If the number is outside of that range what does that mean?
If they fall outside of that range, it means that the values were outside of three standard deviations away from the mean, meaning they are outliers in the data.
@@HugoSH93 And how can it be checked whether or not this outlier should be dropped? (Asking for a reply 3 years later is a bit optimistic, I know, but here goes nothing!)
i have two nominal independents variables and one interval dependent variable. I want to analyse if the interaction between the two independent variables effects the dependent. Am i right in saying i should use regression for this? I tried a two way mixed ANOVA firstly but im not sure that's right. thanks!!
To clarify. in my example, one independent is group (ie control or intervention) and the other independent is smoking status (Yes or No). I have two dependent variables which are score pre-intervention and score post-intervention (so repeated measures). I am only interested in the independent variables interaction affecting score post-intervention...
Hi there! I'm running multiple regression using results from 5 different types of scale. I'm comparing result of one scale (dependent) against 4 other scale (independent). I have to run test for normality. I'm confused how to run test normality for situation like this.
Man, you've saved my assignment- not only that but you've helped me *actually* understand what I'm doing, rather than just following along with unexplained instructions like in class practicals!
What a nice surprise! I'm working on a statistics project right now and typed how to check assumptions into Google, clicked on the first result. Didn't think it would be one of my favorite RUclipsrs!
i watched your video twice professor, and it really helped me writing and concluding my final desertion. Really worth watching, easily understandable, my respect for you!
you've saved the day for me, doing my dissertation proposal and much more confident about everything now
This was amazing, thank you so much for not only running the test but also clearly interpreting the results.
Thank you, Dr. Todd Grande. Your lecture really helps me do my dissertation.
This video has been so helpful in many ways. Thank you so much Dr Todd.
Thank you so much. Your old video still serving so many students around the world. Keep up the good work.
> Ishtiaque (Bangladesh)
Stay blessed Dr. Todd Grande for your several lectures on SPSS. I am happy with your efforts Sir.
Dr. Todd Grande is always the best!!!!!!
That is really great to find u here dear Prof...u have enlightened my way towards my hillarious R.P, thk u so much.
Greetings from Kurdistan-Iraq
Thank you, the instructions in the video were clear and helpful. Hope that you will have a good day.
I have returned to this same video about a hundred times, it is awesome! I am wondering if you would be able to make a video showing how to do a ridge regression in SPSS. there are no videos showing how to use this new function!
Stay blessed Dr. Todd Grande for your several lectures on SPSS. I am happy with efforts Sir.
I found this video easier to follow. Thank you Dr. Grande for the detailed information. It was very useful the explanation on how to conduct the assumptions and especially the values, what to look for and the explanation of it.
Thank you for explaining a difficult issue in such a simple way Dr. Grande. It was very helpful for me. I want to ask that if one of these assumptions is not met (i.e. multicollinearity of the variables) is there any way to correct this condition? Or should I renounce from Lineer Regression and do another analysis?
Awesome video that explains the multiple regression in a clear step by step manner. My only remaining concerns is what if my data deviated from the set assumptions (e.g. mulicolinearilty; normality). The data used seem perfect. What if it wasn't?
SOOOOO helpful. thank you for this amazing free learning resource!
it is very interesting explanation how to test regression assumption test. stay blessed
Thanks for this informative video! Can I just check, youve mentioned for the normality tests if the p value falls below .005 then it violates normality assumption, however so far at uni ive been told if they fall below 0.05 (not .005) then it would be considered a violation. Which value should we use as a guide?
Thank you very much for your great online lectures.
For people not familiar with linear algebra independent variable is what statisticians use typically on the x axis and y axis is outcome of straight line y=B*x aka dependent variable.
It was interesting when you pointed out the durability of treatments and how symptoms decrease when treatment continues but not so much when treatment stops. This is a good research topic.
Dear Dr.Todd, your video is the very helpful to me, what consequences do I have if my std.residual min is -3.790, thank you so much in advance if you could help me 🙏🏼🙏🏼🙏🏼
Sir I have three variables.
1. Academic burnout
2. MENTAL health
3. Self efficiency
How hypothesis and objective is that how Academic burnout impact our mental health and Self efficiency so in this case what our IV and DV and which one should we use linear or multiple regression? Please sir respond 🙏
thanks dr grande i hope to learn more from you
I m so thankfull to you. May God bless you Sir.
Hi doc,.. what happens and what should we do (and how) if the dependent variable data is not normal?
I'm also curious about this.
how do we run this if the DV is not normally distributed?
Thank you very much for this detailed explanation!
thanks, you're the real MVP
What do we do if the normality tests are significant?
Thanks Dr.Todd. We read in statistical books that regression residuals have normal distribution. But you check normal distribution of dependent variable. which one is true?
Excellent explanation.
Thank you. What if we find that the dependent variable is not normally distributed. How can we continue? Also can you please refer me to anothet linear regression video of yours, if any, where one of the independent variables is nominal?
Very useful, thanks for creating this video.
I want to thank your for all your videos, I have a query on this one. I have not met the assumption of normality. There is one record that seems to be throwing it out. Should I remove the record?
Thank you for explation. I am very happy that there are videos about spss program and analysis such is multiple regression that help to learn and understand about this important part of analysing results. I have just one question about part correlation, can we interpret it as unique contribution of predictor as procentage when we multiple it with 100 or we first must square it and then multiple it with 100? Thanx for effort, all good, God bless you
Thanks so much for this video. When carrying out my regression, my scatterplot has variables above and below 3 and -3, respectively - the standard residual range is -4.450 to 6.184. My dataset is about n=5500 but there are multiple values outside the -3 to 3 range. When trying to find explanations for this, I came across a video that suggested heteroscedasticity but I'm not sure if it was applicable to my case or not. Are there any tips or help you could provide about this situation? Thank you again for the video
Thank you for your video! Very clear and helpful. I have some questions. I have an independent variable that has a correlation below .3 with the dependent variable, how does that affect interpretation? It's .265. Also, my data is not normally distributed, but I'm using a sample that is reflective of a population and asking about rare behaviours, thereby diversions from normality are likely to represent difference in the population. Would that mean that I am okay to go on and interpret my results despite lack of normality?
Mr. Grande, you mentioned the correlation values as one of the assumptions. But I could not find any reference to the correlation value between dependent and independent variables greater than .30. I would be very glad if you could help with this.
This is so helpful. Thank you so much 🙏
Thank you very much! I got a refresher on regression!
dear sir, thanks for the video. i have a doubt. in the coefficients table, the constant is our B0 and the values written below it (instandardized) are our B1right? This B1 value is then written in Z-score terms in the Standarized beta column. right???
What do I gonna do if I don't have a normal distributed in a dependent variable !!
Dr. Todd, thanks for the nice video, can I ask one question, how can we obtain 95%CI of β ?
I heard another scholar used the partial correlation as the covariance each predictor explains uniquely when the other predictors are held constant. How that is different from using the part to explain the unique contribution of each predictor?
There are a lot of details in this video. I think it would be very good to follow along if/when I need to perform a multiple linear regression. For now it is a bit much to really follow.
Thank you for uploading it!
Sir, in case of multiple linear regression, normal distribution has to be checked only for dependent variable or entire variables
Thank you so much Dr. Todd for your videos. They are very helpful. Please, can a categorical variable eg gender, socioeconomic class be used as one of the predictor variables in multiple linear regression?
1 years later, sorry. Yes, it can be used.
thanks Dr Grande. Could you please make a video on how categorical variables can be used in the model?
Hi, what are your sources for the assumptions? Thanks in advance! :)
Hello Dr. Grande! This is awesome! Do you have a sample on how to write all of this APA?
Whats the best way to identify the skewed independent variable if you have loads of variables
Useful video. Thanks,
Hi Dr. Grande. I have a question. Knowing that when the independent variables are at individual level of analysis and the dependent variable is at group level of analysis the levels of analysis have to match before we perform a multiple regression, what is better: (1) To divide the group score for the dependent variable by the number of individual cases in a group and give each case an individual score for that dependent variable, or (2) To add up the independent variable scores of all members in a group and use that score to conduct the analysis on the group level of analysis. However, when we do (2), the sample size decreases, which creates problems. What do you think?
Is it important to test normality for independent variables?
Welke assumptie wordt geschonden als de correlatie tussen de onafhankelijke en afhankelijke variabele lager is als 0.3?
Hello Dr. Todd,
Thanks for this lovely video. I would like to do this analysis step by step with you. Where can I find the data set?
Thanks.
Dr. Grande, thank you very much for the video. One thing I wanted to ask is whether we are interested in the normality of residuals of the model, and not the raw scores?
I am also confused about this. I was under the impression that it was just the residuals, not the raw data of the variables themselves?
Do we not check for the Normality of the Residuals instead of our dependent variable itself? Does the DV need to be normally distributed as well?
Dr. Todd Grande, thank you a lot, this was very helpful. I have a question, can I make a multiple regression analysis with independent variables, which are different on scales, for example, one questionnaire subscales (the subscales mean are about 2-3) and mathematics examination, which mean is 70 percent?
Dear Dr. Grande, would you please update (e.g., Videos of 2021, 2020, 2019, etc.) your lectures (e.g., Stepwise multi regression analysis, etc.)? I wish to add them to my final dissertation if I have your permission.
Hey Dr. Grande! Can you give me a list of references for the range of acceptable values to judge multicollinearity and correlations that you mentioned here? Thanks!
Mumtaz ! = Excellent !
What is the difference between unstandardized and standardized B coefficient ?
Thank you Dr for your valuable video.
My question is if my survey has 3 independent variables and one dependent variable, where each of them is assessed by about four questions in the survey, I.e. the survey has 16 questions. So, how can I do the multiple regression analysis in this case.
Could you please assist me.
Your prompt response is highly appreciated.
Mohamed
Yes, excellent video, there are many assumptions but I got it, thanks.
What should I do if assumptions are violated, i.e. normality or linearity?
This is much easier tan what I was taught to test assumptions and see if you are on the right track! Thanks! A for my real analysis, lol for a moment I thought I wouldn't have to dummy code, lol silly me ;-)
Helpful review of Coefficients. Thank you.
I was under the impression only the residuals had to be normally distibuted, not necessarily the dependent variable. Is that right?
i am wondering, when checking the residuals table, the standard residual cannot be lower than -3. what happens if it is? which assumption is violated and how can i solve it ?
hello Dr Todd, I have missing values in my data set so should i click "exclude cases listwise" or should i click "replace with mean". Although I did make the syntax that deals with missing data
I have a Kolmogorov-Smirnov significance of .200 as well but a Shapiro-Wilk significance of .003
Excellent
greetings Doc. do we have to ck on normality if i am using a census method considering the sample size is few in no. like #111?
Dr, for the normality assumption, is it only test for dependent variable? Is it ok if the independent variables are not normal distributed?
Very Informative, I learn a lot
I'm glad you found the video useful. Thanks for watching.
Dr. Grande, can you suggest a reference(s) for the bounds on the parameters that you suggest in the video?
Dr. Todd Grande , the Shaprio-Wilk's test showed that p>0.05 in my study, what should I do?
Simple and clear
i need a reference for the idea that a corr. smaller that 0.7 is enough to verify the lack of multicollinearity assumption.
Thank you professor...
How to conduct linear regression analysis when the variables are not normally distributed?
thank you sir really helped a lot
Quick question: if the max std. residual is 4.23 and min is -2.194, but Cook's value is 0-.070, then does that still mean that we have outliers?
I followed the same directions for the Mahalanobis test in your outlier video with multivariate analysis. I found no outliers, there.
If the data was Shapiro Wilk test significant how the further regression will be carried out?
Question: what if the normality test would have been statistically significant? if the assumption of normality is not met can you run linear regression?
Maria did you find the solution?
Dear Mr Grande, I have a question I'm currently testing my data for assumptions and see some people recommend a Durbin-Watson test. Is this necessary and if so which assumption does it check?
How to handle the data if correlation between IV and DV is less than .3. I can't remove my IVs.
Hi Todd, apology as I am new at statistics, I m testing a model where a construct has antecedents and consequences with no mediations. Is it correct that the construct is always the dependant variable and both antecedents and consequences are IV? thanks
Hello, I have a question. If my standard residual range is -3.839 (min) and 2.662 (max), would it be considered as a good range? If it's not, what should I do?
My mean is not around 50... Should it be? I didn't understand this part
Thank you for your knowledge.
While looking at the range why must the values stay between -3 and 3? What does that tell us other than that it is between those numbers? If the number is outside of that range what does that mean?
+Amanda Sutton I am also curious about this.
If they fall outside of that range, it means that the values were outside of three standard deviations away from the mean, meaning they are outliers in the data.
@@HugoSH93 And how can it be checked whether or not this outlier should be dropped? (Asking for a reply 3 years later is a bit optimistic, I know, but here goes nothing!)
@@user-it5qh2gi8b Could you please share the answer if you have found it?
what if we have several questions for each dimension, then how can we proceed?
i have two nominal independents variables and one interval dependent variable. I want to analyse if the interaction between the two independent variables effects the dependent. Am i right in saying i should use regression for this? I tried a two way mixed ANOVA firstly but im not sure that's right. thanks!!
To clarify. in my example, one independent is group (ie control or intervention) and the other independent is smoking status (Yes or No). I have two dependent variables which are score pre-intervention and score post-intervention (so repeated measures). I am only interested in the independent variables interaction affecting score post-intervention...
very good explaination. thank you.
I'm glad you found the video useful. Thanks for watching.
Thanks for this helpful video
Hi there! I'm running multiple regression using results from 5 different types of scale. I'm comparing result of one scale (dependent) against 4 other scale (independent). I have to run test for normality. I'm confused how to run test normality for situation like this.