Add feature selection to a Pipeline

Use cross_val_score and GridSearchCV on a Pipeline

This Is Why Python Data Classes Are Awesome

Surviving A Week in OUR Demonic School PT 3 (THE POSSESSION)

GUILTY GEAR -STRIVE- Queen Dizzy Theme 『Radiant Dawn』

Use FunctionTransformer to convert functions into transformers

Data School

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 29

@dataschool 3 года назад ⁺³
Did you know that the code for all of these tips is on GitHub? Check it out: github.com/justmarkham/scikit-learn-tips
@marcelocruz1785 Год назад ⁺¹
I recently discover your channel, and it's incredible the amount of excellent information you provide!
@dataschool Год назад
Thank you!
@mingqian813 3 года назад ⁺⁴
I like all your well-explained videos! In the future, will you consider guiding a hands-on Kaggle project from beginning to end?
@dataschool 3 года назад
Thanks for your suggestion!
@santiagogonzalezq1954 3 года назад ⁺¹
I love your content because it's very well explained and I can practica my english with your pronuntiation. Cheers!
@dataschool 3 года назад
Thank you! That's awesome to hear!
@roy11883 3 года назад ⁺¹
Cheers to Feature Transformer, thanks for sharing this Kevin
@dataschool 3 года назад
You're welcome!
@kevinozero 2 года назад ⁺¹
Thank you so much, this was a super clear and simple explanation.
@dataschool 2 года назад
Thanks so much for your kind words!
@harshedirisinghe6864 2 года назад
This is an excellent explanation!
@dataschool Год назад
Thank you!
@Dara-lj8rk 3 года назад ⁺¹
Learned something new. Thanks heaps
@dataschool 3 года назад
Great to hear!
@shubhamchoudhary5461 3 года назад ⁺¹
please upload more videos like this ..thanks for this great content !! 🙏
@dataschool 3 года назад
Glad you like it! I will be uploading 2 more tips every week (Tuesdays and Thursdays) until I reach 50 tips. You can find all of them in this playlist: ruclips.net/p/PL5-da3qGB5ID7YYAqireYEew2mWVvgmj6
@AceOnBase1 8 месяцев назад
Hey man, if I have a function that does a bunch of regex operations (.str.extract etc) can I put that into a functiontransformer?
@hemangdhanani9434 2 года назад ⁺¹
thanks for uploading such great videos...
@dataschool 2 года назад
Thank you!
@atulsingh-uy2he 3 года назад
Helpful..!!
@dataschool 3 года назад
Thanks Atul!
@lk2055 2 года назад
how is this different from TransformerMixin?
thanks
@dataschool 2 года назад
FunctionTransformer is simpler to use, but TransformerMixin is more flexible. Hope that helps!
@wadewattts5126 3 года назад
Hi sir can you provide example on when using pandas instead of sklearn leads to data leakage.
@dataschool 3 года назад ⁺¹
Sure! If you do missing value imputation on the whole dataset (before splitting the dataset as part of your model evaluation procedure), data leakage will result.
@wadewattts5126 3 года назад
Thank you sir. Another question if you may. But data leakage you indicated is not because of using pandas instead of sklearn, but because you impute before splitting the data. Can I say that I can use pandas or sklearn for preprocessing as long as I split the data to train test validation split first? Thank you in advance
@dataschool 3 года назад ⁺¹
That's technically true, but it misses the bigger picture. pandas lacks separate fit and transform steps, and so your code will quickly become overly complex if you want to do multiple different transformations within pandas without data leakage. And if there are any transformations you need to do that pandas doesn't offer, it's a pain to combine transformations from pandas with transformations from scikit-learn. Finally, it's completely impractical to do cross-validation (without data leakage) if your transformations are done in pandas (depending on the exact nature of the transformation). And if you can't use cross-validation, you also can't do hyperparameter tuning with GridSearchCV. Thus what you are saying is not technically incorrect, but it also means you are not going to be able to use some of the most important parts of scikit-learn. Hope that helps!
@wadewattts5126 3 года назад ⁺¹
Thank you very much for that very comprehensive explanation, Mr. Kevin. I guess I expected to get away with things by using pandas but that turns out to be inefficient. Time to use the power of sklearn. You do very good content. Appreciate it.

Следующие

Автовоспроизведение

Add feature selection to a Pipeline

Add feature selection to a Pipeline

Use cross_val_score and GridSearchCV on a Pipeline

Use cross_val_score and GridSearchCV on a Pipeline

This Is Why Python Data Classes Are Awesome

This Is Why Python Data Classes Are Awesome

Surviving A Week in OUR Demonic School PT 3 (THE POSSESSION)

Surviving A Week in OUR Demonic School PT 3 (THE POSSESSION)

GUILTY GEAR -STRIVE- Queen Dizzy Theme 『Radiant Dawn』

GUILTY GEAR -STRIVE- Queen Dizzy Theme 『Radiant Dawn』

Joker VS Giorno (Persona VS JoJo's Bizarre Adventure) | DEATH BATTLE!

Joker VS Giorno (Persona VS JoJo's Bizarre Adventure) | DEATH BATTLE!

6.1 Scikit-Learn ColumnTransformer [Applied Machine Learning || Varada Kolhatkar || UBC]

6.1 Scikit-Learn ColumnTransformer [Applied Machine Learning || Varada Kolhatkar || UBC]

Simplify Data Preprocessing with Python's Column Transformer: A Step-by-Step Guide

Simplify Data Preprocessing with Python's Column Transformer: A Step-by-Step Guide

Why Isn't Functional Programming the Norm? - Richard Feldman

Why Isn't Functional Programming the Norm? – Richard Feldman

The hidden beauty of the A* algorithm

The hidden beauty of the A* algorithm

Object Oriented Programming is not what you think it is. This is why.

Object Oriented Programming is not what you think it is. This is why.

Functional programming - A general introduction

Functional programming - A general introduction

Backtracking (Think Like a Programmer)

Backtracking (Think Like a Programmer)

Shuffle your dataset when using cross_val_score

Shuffle your dataset when using cross_val_score

SOUTH KOREA AND NORTH KOREA HOLD THEIR BREATH 🇰🇷 🇰🇵 #countryhumans

SOUTH KOREA AND NORTH KOREA HOLD THEIR BREATH 🇰🇷 🇰🇵 #countryhumans

ХРУСТНУЛА ЧЕЛЮСТЬ! Полный Бой Хамзат Чимаев VS Роберт Уиттакер UFC 308 Chimaev Whittaker full fight

ХРУСТНУЛА ЧЕЛЮСТЬ! Полный Бой Хамзат Чимаев VS Роберт Уиттакер UFC 308 Chimaev Whittaker full fight

ИСТОРИЯ ПРО ШТАНЫ #shorts

ИСТОРИЯ ПРО ШТАНЫ #shorts

ПОСЛЕ БОЯ ХАМЗАТ ЧИМАЕВ - РОБЕРТ УИТТАКЕР СЛУЧИЛОСЬ СТРАШНОЕ/НОВОЕ ПОКОЛЕНИЕ БОЙЦОВ/Звуки ММА

ПОСЛЕ БОЯ ХАМЗАТ ЧИМАЕВ - РОБЕРТ УИТТАКЕР СЛУЧИЛОСЬ СТРАШНОЕ/НОВОЕ ПОКОЛЕНИЕ БОЙЦОВ/Звуки ММА

КОГДА К БАТЕ ПРИШЕЛ ДРУГ😂#shorts

КОГДА К БАТЕ ПРИШЕЛ ДРУГ😂#shorts

Главная суперспособность армейских муравьев и пляжные упогебии

Главная суперспособность армейских муравьев и пляжные упогебии

Хамзат Чимаев КРАСИВО ОТВЕТИЛ НА ПРОВОКАЦИОННЫЙ ВОПРОС #мма

Хамзат Чимаев КРАСИВО ОТВЕТИЛ НА ПРОВОКАЦИОННЫЙ ВОПРОС #мма

как спать в самолете правильно ‼️ #марьяналокель #shorts

как спать в самолете правильно ‼️ #марьяналокель #shorts