Python Feature Selection: Chi Square to Select Features | Machine Learning | Python

Hands-on Multicollinearity Treatment | Variance Inflation Factor | Data Preprocessing in Python

Python Feature Selection: Forward Feature Selection | Feature Selection | Python

ONE Squishy, THREE Makeovers

DRESS TO IMPRESS In Real Life

Dodgers DEFEAT Yankees in Game 2, Shohei Ohtani Injury: David Ortiz, Derek Jeter, Alex Rodriguez

Python Feature Selection: Remove Multicollinearity from Machine Learning Model in Python

Stats Wire

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 39

@Nomar_7 2 года назад ⁺²
nice video, what about checking from where the high threshold is coming and comparing the correlation with the target column and only dropping the one with the less correlation
@StatsWire 2 года назад
We can decide the threshold and see which columns are having high correlation
@oomraden 2 года назад ⁺¹
Hi thanks for the video!
Would'nt that remove all high correlated columns instead of just leaving one column for every relationship?
@StatsWire 2 года назад ⁺²
It will leave one column for every relationship.
@SaifTreks 2 года назад
@@StatsWire Great video! I don't quite get what you mean here. isn't the list returning every column that has a high correlation based on threshold? And then we're proceeding to remove all those columns. Should we intentionally just keep one instead of removing all of them. How is it automatically keeping one if that's what you are saying.
@anishdeshpande395 Год назад
Is this method better than variance inflation factor?
@StatsWire Год назад
Both of them are good.
@AnanyaJoshi-g2x Год назад
what about a scenario where the order of the columns change? since we're checking for adjacent columns and their correlations to be more than the threshold and then remove the first out of the two in case the threshold is matched or passed, if I change the order of columns, the result received will be different. is that going to a correct list of features as well?
@StatsWire Год назад
That is completely ok. You can change the order.
@AnanyaJoshi-g2x Год назад
@@StatsWire thanks for the reply, I did change the order and got a different set of features. Built an XgBoost model with both sets of features and got extremely different forecasts and accuracies in both cases. How do I decide which is correct?
@StatsWire Год назад
Yes, that is going to be a correct feature list. You can change column positions no problem at all.
@baburamchaudhary159 Год назад
I have been following you for feature selection, covered forward, backward, exhaustive, variance threshold, chi2, etc.
You have not shared the dataset, in them.
for us to follow along you, why don't you share dataset?
@StatsWire Год назад
Please find the dataset link: github.com/siddiquiamir/Data
@d1pranjal Год назад
How are diagonal elements being handled in the user defined function correlation(df, threshold) ?
@StatsWire Год назад
I did not get your question
@d1pranjal Год назад
@@StatsWire at diagonal elements... value is 1 > threshold... so all elements will show up in output
@StatsWire Год назад
@@d1pranjal Ok, hope you found the solution yourself.
@protapmaitra5049 2 года назад
This video was really helpful, thanks a ton.
@StatsWire 2 года назад
You're welcome
@michaelsagols8295 2 года назад
Thank you for the video! very well explained! keep it up!
@StatsWire 2 года назад
Thank you for your kind words.
@jorge1869 2 года назад
Excellent, thank you very much.
@StatsWire 2 года назад
I'm glad you liked it. You're welcome
@farahamirah2091 2 года назад
hi, how to get this dataset?
@StatsWire 2 года назад ⁺¹
Hi, please find the dataset: github.com/siddiquiamir/Feature-Selection
@farahamirah2091 2 года назад
@@StatsWire thank you
@StatsWire 2 года назад
@@farahamirah2091 You're welcome!
@akiwhitesoyo918 Год назад
Nice ! Would it be the same if we use PCA to avoid multicollinearity ?
@StatsWire Год назад
Thank you! There would be some minor differences.
@maskman9630 2 года назад
how to find collinearity for categorical features
@StatsWire 2 года назад
You can use chi-square
@maskman9630 2 года назад
@@StatsWire thanks brother...... Suppose I have done chi2 test of independent variable and dependent variables.and then i got f and p values, then how can I select features based on those f and p values...? Will u please clarify this brother
@StatsWire 2 года назад
@@maskman9630 Select the variable whose p value is less compared to other variables.
@mazharalamsiddiqui6904 2 года назад
Very nice
@StatsWire 2 года назад
Thank you
@naveedullah390 Год назад
when i enter the code line =====> corrmatrix = X_train.corr()
it gives the error of =========> AttributeError: 'numpy.ndarray' object has no attribute 'corr'
@StatsWire Год назад
You need to make sure your data is in correct format.
@gisflow406 2 года назад
This wasn't helpful at all. You just picked one of the correlated variables randomly without additional criteria. Anyways, correlation matrix can't do much. It's much more reliable to use VIF or hierarchical clustering for feature selection.
@StatsWire 2 года назад
Hi, this is for demonstration purposes. You can deep dive and pick the variables based on your selection criteria:)

Следующие

Автовоспроизведение

Python Feature Selection: Chi Square to Select Features | Machine Learning | Python

Python Feature Selection: Chi Square to Select Features | Machine Learning | Python

Hands-on Multicollinearity Treatment | Variance Inflation Factor | Data Preprocessing in Python

Hands-on Multicollinearity Treatment | Variance Inflation Factor | Data Preprocessing in Python

Python Feature Selection: Forward Feature Selection | Feature Selection | Python

Python Feature Selection: Forward Feature Selection | Feature Selection | Python

ONE Squishy, THREE Makeovers

ONE Squishy, THREE Makeovers

DRESS TO IMPRESS In Real Life

DRESS TO IMPRESS In Real Life

Dodgers DEFEAT Yankees in Game 2, Shohei Ohtani Injury: David Ortiz, Derek Jeter, Alex Rodriguez

Dodgers DEFEAT Yankees in Game 2, Shohei Ohtani Injury: David Ortiz, Derek Jeter, Alex Rodriguez

HIGHLIGHTS | All Blacks v Japan | Yokohama, 2024

HIGHLIGHTS | All Blacks v Japan | Yokohama, 2024

Multiple Linear Regression using python ( Regression Analysis )

Multiple Linear Regression using python ( Regression Analysis )

XGBoost Regression Algorithm in Machine Learning | Python | XGBoost Algorithm

XGBoost Regression Algorithm in Machine Learning | Python | XGBoost Algorithm

Python Feature Selection: Backward Elimination | Feature Selection | Python

Python Feature Selection: Backward Elimination | Feature Selection | Python

Normalization Vs. Standardization (Feature Scaling in Machine Learning)

Normalization Vs. Standardization (Feature Scaling in Machine Learning)

Tutorial 2- Feature Selection-How To Drop Features Using Pearson Correlation

Tutorial 2- Feature Selection-How To Drop Features Using Pearson Correlation

Step by Step Tutorial on Logistic Regression in Python | sklearn |Jupyter Notebook

Step by Step Tutorial on Logistic Regression in Python | sklearn |Jupyter Notebook

Feature Selection in Python | Machine Learning Basics | Boston Housing Data

Feature Selection in Python | Machine Learning Basics | Boston Housing Data

How to do Multiple Linear Regression in Python| Jupyter Notebook|Sklearn

How to do Multiple Linear Regression in Python| Jupyter Notebook|Sklearn

Discussing All The Types Of Feature Transformation In Machine Learning

Discussing All The Types Of Feature Transformation In Machine Learning

DEMONS ARE ATTACKING BRAWL STARS!!!

DEMONS ARE ATTACKING BRAWL STARS!!!

Brawl Stars expliquez ça

Brawl Stars expliquez ça

ДЖИМЕН ВСЕХ СПАСЁТ ! | Сюжет skibidi toilet 77 (part 4)

ДЖИМЕН ВСЕХ СПАСЁТ ! | Сюжет skibidi toilet 77 (part 4)

Nightmare | Update 0.31.0 Trailer | Standoff 2

Nightmare | Update 0.31.0 Trailer | Standoff 2

Другая война. Почему на САМОМ ДЕЛЕ сближаются Россия и КНДР

Другая война. Почему на САМОМ ДЕЛЕ сближаются Россия и КНДР

Фронт От Шахтерска До Покровска Рухнул🎖 ВСУ Отступают⚔️ Военные Сводки И Анализ За 27.10.2024

Фронт От Шахтерска До Покровска Рухнул🎖 ВСУ Отступают⚔️ Военные Сводки И Анализ За 27.10.2024

Про дтп, огородных бандитов, пикник у переселенца и другие житейские дела.

Про дтп, огородных бандитов, пикник у переселенца и другие житейские дела.

КАРАСЕВ: ВСЁ! СУДЬБА ВОЙНЫ РЕШЕНА! ЗАПАД УМЫВАЕТ РУКИ! НА КОНУ НЕ ТОЛЬКО УКРАИНА... СКОРО УВИДИМ...

КАРАСЕВ: ВСЁ! СУДЬБА ВОЙНЫ РЕШЕНА! ЗАПАД УМЫВАЕТ РУКИ! НА КОНУ НЕ ТОЛЬКО УКРАИНА... СКОРО УВИДИМ...