How to choose just the right features for your model? | Feature selection | Data Science

How to handle imbalanced datasets in Python

Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)

Avengers wake up, Marvel Rivals is fire

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

Hands-on Class Imbalance Treatment in Python | Oversampling | Undersampling | SMOTE | Data Science

Six Sigma Pro SMART

Просмотров 3,1 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 24 янв 2025

Комментарии • 12

@sunilrajput1307 4 месяца назад
Awesome Explanation...
I suggest you attach the Jupyter notebook code with your video.
@prosmartanalytics 4 месяца назад
Thank you! 😊 We used to provide notebooks too but stopped due to IP infringement.
@Zeeshanoca 10 месяцев назад
Awesome presentation. Kindly make a presentation on these also Hybrid Sampling/Ensemble Systems. Thanks
@prosmartanalytics 10 месяцев назад
Thank you! We'll keep these suggestions in mind.
@amazing_performances 11 месяцев назад
Great videos! Thak you for sharing!!!
@prosmartanalytics 11 месяцев назад
Glad you like them!
@DanielTok-bs5mn 8 месяцев назад
awesome, but what about stratify when splitting?
@prosmartanalytics 8 месяцев назад
Thank you! Stratify maintains the same proportion of 0s and 1s in both train and val/test sets as that of the overall data, but it won't resolve the class imbalance issue. We may stratify at the time of split to maintain whatever imbalance we have, and then apply imbalance treatment only to the train set.
@younesgasmi8518 Год назад
Thanks for the presentation if Can I use SMOTE before Splitting the dataset into training and testing dataset ?
@prosmartanalytics Год назад ⁺¹
Welcome! Good question. Any imbalance treatment needs to be applied only to the train data i.e. for training the model, but because the test data represents future data, it is not supposed to be treated for imbalance.
@younesgasmi8518 Год назад
@@prosmartanalytics i mean whene we use oversampling on the whole dataset (before Splitting) because whene i used this way I have got a good confusion Matrix and better metrics ( accuracy recall F1 precision) and there is not any problelm of overfitting.
@prosmartanalytics Год назад ⁺²
Yes, but there is a leakage problem. The results so obtained won't be considered reliable. Test data is suposed to be representing the future. So if we are predicting defaults for a bank where the historical default rate is only 2%, test data should represent this value and not 50%. If we use the entire data for imbalance treatment, somehow the data that we are going to use as test later has already participated in the training process because we generated our labels using that too.

Следующие

Автовоспроизведение

How to choose just the right features for your model? | Feature selection | Data Science

How to choose just the right features for your model? | Feature selection | Data Science

How to handle imbalanced datasets in Python

How to handle imbalanced datasets in Python

Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)

Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)

Avengers wake up, Marvel Rivals is fire

Avengers wake up, Marvel Rivals is fire

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

Noob To Pro With DRAGON REWORK in Blox Fruits

Noob To Pro With DRAGON REWORK in Blox Fruits

I gave 127 interviews. Top 5 Algorithms they asked me.

I gave 127 interviews. Top 5 Algorithms they asked me.

Time Series Forecasting with XGBoost - Advanced Methods

Time Series Forecasting with XGBoost - Advanced Methods

Bayesian Model for Natural Language Processing Using Python

Bayesian Model for Natural Language Processing Using Python

Scale AI CEO Alexandr Wang on U.S.-China AI race: We need to unleash U.S. energy to enable AI boom

Scale AI CEO Alexandr Wang on U.S.-China AI race: We need to unleash U.S. energy to enable AI boom

SMOTE (Synthetic Minority Oversampling Technique) for Handling Imbalanced Datasets

SMOTE (Synthetic Minority Oversampling Technique) for Handling Imbalanced Datasets

Random Forest Regressor in Python: A Step-by-Step Guide

Random Forest Regressor in Python: A Step-by-Step Guide

Stanford's FREE data science book and course are the best yet

Stanford's FREE data science book and course are the best yet

Professional Preprocessing with Pipelines in Python

Professional Preprocessing with Pipelines in Python

Machine Learning Model Explainability with LIME in Python

Machine Learning Model Explainability with LIME in Python

Создал Пилу в Dota 2 Победитель Получает 100.000 Рублей !

Создал Пилу в Dota 2 Победитель Получает 100.000 Рублей !

Чем СТРАШНЕЕ уснешь - тем красивее проснешься Ч.2🥴

Чем СТРАШНЕЕ уснешь - тем красивее проснешься Ч.2🥴

Игровые Истории: Обама из космоса, Летающие дети, WoW с 4000 модов / Булджать

Игровые Истории: Обама из космоса, Летающие дети, WoW с 4000 модов / Булджать

Салют Маска, реакция Путина, будет ли мир? Детали инаугурации и первые шаги Трампа. Разбор новостей

Салют Маска, реакция Путина, будет ли мир? Детали инаугурации и первые шаги Трампа. Разбор новостей

Я СТАЛА ВЛАДЕЛЬЦЕМ РОБЛОКСА😱Хотели бы такой роблокс?😂#роблокс #игры #смешное #интересное #квинка

Я СТАЛА ВЛАДЕЛЬЦЕМ РОБЛОКСА😱Хотели бы такой роблокс?😂#роблокс #игры #смешное #интересное #квинка

Вложил 900тр. в ВАЗ 21099, а она не едет :(

Вложил 900тр. в ВАЗ 21099, а она не едет :(

Трамп пошел против Путина. Лукашенко болен, он отдаст власть? Протесты в Курской области

Трамп пошел против Путина. Лукашенко болен, он отдаст власть? Протесты в Курской области

Squid Game DIY Game Book 오징어게임 게임북 🦑

Squid Game DIY Game Book 오징어게임 게임북 🦑