How to check assumptions of linear regression in Python | How to check linear regression assumptions

Unfold Data Science

Просмотров 9 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 15 янв 2025

Комментарии • 40

@gowthamsr3545 2 года назад ⁺³
There are so many videos explaining the assumptions of linear regression but no one was explaining how to do it............I was searching for this from last 3months,,, thank you sir.....
Thank you 🙏
@UnfoldDataScience 2 года назад ⁺¹
Glad you found it helpful Gowtham😊. Please share with others as well who could be benefited.
@ramyasirigiri673 Год назад
@Unfold Data Science
@ramyasirigiri673 Год назад
@nikhilgupta4859 2 года назад ⁺²
Heyy Aman, I am your subscriber from past 1.5 year and I feel honoured to tell you, after following you I finally got a job transition as a senior data scientist at an MNC 6 month back. Now I have understood the datascience project ecosystem in my company. You are one of the contributors for my success.
Thanks a Ton!!!!!
Also I would like to open my hands for helping learners. So learners you can tag me asking any doubts. I would be more than happy helping you.
@UnfoldDataScience 2 года назад ⁺¹
Thanks Nikhil, your comments are precious.
@nikhilgupta4859 2 года назад
@@UnfoldDataScience Thank you Aman!!
@UnfoldDataScience 2 года назад
Join here for free to share your experience live with me
www.lighthall.co/class/6fdae050-85a4-48c4-9ecc-8dd2fa6ff175
@commonNigerian 2 года назад ⁺¹
You're the best bro! I understood this explanation of yours more than I understood anybody else's. I've also saved a copy of the notebook (from Google Drive) and imported it into my DataSpell IDE so I can easily refer to it whenever I want to check assumptions - I've heard that memorising syntax is not important as long as one understands the logic :). Much love from Nigeria. Subscribed and liked. Cheers!
@UnfoldDataScience 2 года назад
Thanks, Goriola. Your words mean a lot to me. Keep learning.
@spicytuna08 2 года назад
among all uTubers, i would rank this guy #1 for insightfulness. what a gift of teaching!!!
@UnfoldDataScience 2 года назад
THanks a lot. pls share channel with friends.
@mayankmehta8480 2 года назад ⁺¹
Well explained. I was very useful. Pls continue uploading lectures.
@UnfoldDataScience 2 года назад
Thank you Mayank, I will
@shekharkumar1902 2 года назад
The title of your Channel could be " unfolding the untold data science" . Aman ji you reach and teach what no one dare to teach or explain . Amazing job!
@UnfoldDataScience 2 года назад
Thanks shekhar your comments mean a lot.
@asitnayak636 2 года назад ⁺¹
Clean !!!
@UnfoldDataScience 2 года назад
Thanks Asit
@anmolanreja1227 Год назад
Can you please tell me if we have large data means we have more than 30 columns in our data so linear regression will be good for training that data or not?
@developerboy8341 2 года назад
Very intuitive video.
@UnfoldDataScience 2 года назад
Thank you 🙂
@naveenkrish4037 2 года назад
When do we need to check these assumptions? After we do the Train Test Split and prediction, or before the start of Train Test Split?
@kalam_indian 2 года назад
you are excellent always sir
@UnfoldDataScience 2 года назад ⁺¹
Thanks for your positive feedback. Please share with others as well who could be benefited from such content.
@andrewhenderson593 Год назад
How can we make a Residuals vs. Leverage plot which displays the Cook's distance?
@ajaykushwaha4233 2 года назад
Hi Aman, if we have encoded any categorical variables to numerical variables by count or frequency, or by onehotencoding or by top categories by any mean so that columns also need to be converted to Gaussian distribution form ?
@UnfoldDataScience 2 года назад ⁺²
Good question. No, not needed because gaussian is primarily defined for continious only.
@sixlife.official 2 года назад
Great video! I wanted to ask, in order to find whether there is linearity in the first 4 scatterplots, shouldn't one plot the line of best fit? Also, regardless of whether that's true or not, how would one plot the line of best fit?
"a, b = np.polyfit(X,Y,1)
plt.plot(X,a*X+b)" doesn't seem to do anything and obviously im doing something wrong I'm just not sure what
@beautyisinmind2163 2 года назад
Sir, does it mean relation between predictor variable and target variable isn't linear means it does non follow Normality condition(Non normally distributed) and vice versa?
@UnfoldDataScience 2 года назад
No, it does not mean it. Neither way. It's quite possible that a variable is not normally distributed however has a linear relationship with target variable and vice versa.
@pradeept328 Год назад
Actually what should be normal in linear regression? The training data(target variables) or the residuals only?
@UnfoldDataScience Год назад
Both ideally
@pradeept328 Год назад
But after we create a OLS regression model only we get the residuals and residuals normal distribution checks right?
But we are saying normality of residuals should be confirmed before making the regression model.
How the both cases satisfies each other?
@karthebans2420 2 года назад
Can you please explain about errors and residuals, I am not able to get the concept clearly in websites
@UnfoldDataScience 2 года назад
error is the difference between actual and predicted.
@chalmerilexus2072 2 года назад
One question though not related to this video. At what point we do train - test split. BEFORE preprocessing like normalization, imputation etc or AFTER preprocessing?
@UnfoldDataScience 2 года назад ⁺¹
Before, do the same preprocessing for both sets
@chalmerilexus2072 2 года назад
@@UnfoldDataScience ok. Thanks for reply.
@sanyamsingh4907 2 года назад
Hi Aman can you Please advise us on "How to keep the relations with different Data Science Managers long lasting on Linked In?
@UnfoldDataScience 2 года назад
Normal human relationship things nothing much.

Следующие

Автовоспроизведение

5 ways to work with imbalanced data | Imbalanced dataset machine learning | Imbalanced data