Brian Lange | It's Not Magic: Explaining Classification Algorithms

Aditya Lahiri: Dealing With Imbalanced Classes in Machine Learning | PyData New York 2019

Natalie Hockham: Machine learning with imbalanced data sets

Amanda Returns || Amanda the Adventurer 2 #1 (Playthrough)

The RajaSaab Motion Poster | Prabhas | Maruthi | Thaman S | TG Vishwa Prasad | People Media Factory

FIGHT HIGHLIGHTS | FRANCIS NGANNOU VS. RENAN FERREIRA

Ajinkya More | Resampling techniques and other strategies

PyData

Просмотров 18 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 23 окт 2024

Комментарии • 15

@OmyTrenav 7 лет назад ⁺¹
Great talk. Thanks!
@Johnnyboycurtis 8 лет назад
Great presentation!
@WalterReade 8 лет назад
Thanks for the video; it was excellent and I learned a great deal. I'd suggest, though, that you split out the test data _before_ you apply the under/over sampling algorithm (to the train data only). That would give a much better comparison of the algorithms, showing how they perform on the unmodified test data.
@ajinkyamore7090 8 лет назад ⁺¹
Thanks. The train/test split is the first step (see cell number 2 in the notebook) and none of the under/over sampling methods are applied to the test set. The performance comparison is indeed on the unmodified test data.
@WalterReade 8 лет назад
I just noticed that as I was going through your notebook on github (thanks for uploading!) and was going to edit my comment. . Yes, that makes perfect sense. What initially confused me was that the graphs are showing the decision boundary on the train data (and I was thinking it was the test data).
@WalterReade 8 лет назад
I do like like the graphs showing the decision boundary on the train data, since it shows how the under/over sampling algs modify the data. I forked the notebook and am going to add the plots of the decision boundary on the test data as well.
@ajinkyamore7090 8 лет назад ⁺¹
Yes, the idea was to show the changes in the data distribution affect the decision boundary.
@rebiiahmed7836 8 лет назад ⁺¹
Thank you for your presentation! Could you please upload the code in notebook file for example?
@ajinkyamore7090 8 лет назад ⁺²
Thanks! Here is a link to the notebook github.com/irreducible/PyData-Resampling/blob/master/PyData-Resampling-nb.ipynb and the slides www.slideshare.net/AjinkyaMore3/python-resampling
@WalterReade 8 лет назад
I found it with a bit of digging:
github.com/irreducible/PyData-Resampling/blob/master/PyData-Resampling-nb.ipynb
@WalterReade 8 лет назад
LOL . . . I should have refreshed the comments before posting my comment. :-)
@rebiiahmed7836 8 лет назад
Great thanks to you Mr Ajinkya More.
@EarlWallaceNYC 8 лет назад
Great video, Thanks.
Where can I get the slides you used?
(I found your paper on arXiv, but it doesn't have the code)
@ajinkyamore7090 8 лет назад ⁺¹
Thanks! Here is a link to the notebook github.com/irreducible/PyData-Resampling/blob/master/PyData-Resampling-nb.ipynb and the slides www.slideshare.net/AjinkyaMore3/python-resampling
@berry4862 8 лет назад
Optimizing an arbitrary metric is rather useless for business. In particular, what is the business meaning of optimizing for precision of normal cases? Something like alarms per month may well be meaningful, but that would be Recall(pos)/Prec(pos)..

Следующие

Автовоспроизведение

Brian Lange | It's Not Magic: Explaining Classification Algorithms

Brian Lange | It's Not Magic: Explaining Classification Algorithms

Aditya Lahiri: Dealing With Imbalanced Classes in Machine Learning | PyData New York 2019

Aditya Lahiri: Dealing With Imbalanced Classes in Machine Learning | PyData New York 2019

Natalie Hockham: Machine learning with imbalanced data sets

Natalie Hockham: Machine learning with imbalanced data sets

Amanda Returns || Amanda the Adventurer 2 #1 (Playthrough)

Amanda Returns || Amanda the Adventurer 2 #1 (Playthrough)

The RajaSaab Motion Poster | Prabhas | Maruthi | Thaman S | TG Vishwa Prasad | People Media Factory

The RajaSaab Motion Poster | Prabhas | Maruthi | Thaman S | TG Vishwa Prasad | People Media Factory

FIGHT HIGHLIGHTS | FRANCIS NGANNOU VS. RENAN FERREIRA

FIGHT HIGHLIGHTS | FRANCIS NGANNOU VS. RENAN FERREIRA

Love you guys, I’ll be back soon

Love you guys, I’ll be back soon

Handling Imbalanced Data | Oversampling | Undersampling | SMOTE | Machine Learning | Data Science

Handling Imbalanced Data | Oversampling | Undersampling | SMOTE | Machine Learning | Data Science

Effective Resampling for Machine Learning in Tidymodels {rsample} R package reviews

Effective Resampling for Machine Learning in Tidymodels {rsample} R package reviews

Eric J. Ma - An Attempt At Demystifying Bayesian Deep Learning

Eric J. Ma - An Attempt At Demystifying Bayesian Deep Learning

This is why you should care about unbalanced data .. as a data scientist

This is why you should care about unbalanced data .. as a data scientist

Vishal Patel | A Practical Guide to Dimensionality Reduction Techniques

Vishal Patel | A Practical Guide to Dimensionality Reduction Techniques

Jaroslaw Szymczak - Gradient Boosting in Practice: a deep dive into xgboost

Jaroslaw Szymczak - Gradient Boosting in Practice: a deep dive into xgboost

What is RAG? (Retrieval Augmented Generation)

What is RAG? (Retrieval Augmented Generation)

Handling Class Imbalance Problem in R: Improving Predictive Model Performance | Unbalanced Dataset

Handling Class Imbalance Problem in R: Improving Predictive Model Performance | Unbalanced Dataset

Handling Imbalanced Datasets using Python | Smote, Upsampling and Downsampling | Satyajit Pattnaik

Handling Imbalanced Datasets using Python | Smote, Upsampling and Downsampling | Satyajit Pattnaik

БМВ Х6 с пробегом в 500к. Мотор пришел но не все так просто.

БМВ Х6 с пробегом в 500к. Мотор пришел но не все так просто.

Пацанская тачка по цене Весты #автомобили #веста

Пацанская тачка по цене Весты #автомобили #веста

Паша техник бросил употреблять #юмор #шоу #интервью

Паша техник бросил употреблять #юмор #шоу #интервью

Why is it different from what I thought?

Why is it different from what I thought?

Кольцо Всевластия от Samsung

Кольцо Всевластия от Samsung

ТЕСЛА КИБЕРТРАК x WYLSACOM / РАЗГОН

ТЕСЛА КИБЕРТРАК x WYLSACOM / РАЗГОН

БРИКС: Казань перекрыта, интернета нет. У Киркорова новые проблемы. Дугин про «сатанинский» Запад

БРИКС: Казань перекрыта, интернета нет. У Киркорова новые проблемы. Дугин про «сатанинский» Запад

Я купил ЛЕГЕНДУ! Возвращение AUDI RS 6!

Я купил ЛЕГЕНДУ! Возвращение AUDI RS 6!