Learning the Polars DataFrame Library!

Complete Python Pandas Data Science Tutorial! (2024 Updated Edition)

Solving Real-World Data Science Problems with Python! (Predicting Healthcare Insurance Costs)

Noob To Pro With DRAGON REWORK in Blox Fruits

I 3D Printed a $1,500 Chair

Every Home Alone Is Worse Than The Last

Real-World Dataset Cleaning with Python Pandas! (Olympic Athletes Dataset)

Keith Galli

Просмотров 35 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 1 янв 2025

Комментарии • 48

@KeithGalli 8 месяцев назад ⁺²¹
Thank you everyone who tuned in today!!
@rrrprogram8667 5 месяцев назад ⁺²
I really thank god that I found your channel thanks for sharing knowledge and keep uploading
@Hamsters_Rage 8 месяцев назад ⁺³
29:26 - he starts writing some code
@aishwaryapattnaik3082 8 месяцев назад ⁺¹
Such a great tutorial Keith. Please keep uploading such high quality videos on Pandas and many more
@marcinjagusz2481 8 месяцев назад ⁺²
Thanks Keith! I know it takes some time to prepare and record such staff, but please upload more of Python coding!
@KeithGalli 8 месяцев назад ⁺³
will try to keep them coming!
@kebincui 3 месяца назад
Fabulous session. Thanks Keith 👍
@lisitashamatutu1140 4 месяца назад
watching from Zambia 🇿🇲
@danprovost8232 8 месяцев назад ⁺¹
Great stream this was very helpful! Keep up the good work!
@KeithGalli 8 месяцев назад
My man 💪
@zahidmhd 4 месяца назад
we need more like this videos and work on real world data
@AndyJagroom-ur7xh 5 месяцев назад ⁺¹
Can you do an update on the numpy video, thank you so much for these videos it helped me a lot ❤
@Kira-vs4np 8 месяцев назад
just a note, at 1:19:21 the format = "mixed" isn't really working for me, and it fills the date_born column with NaT values. So, I tried format = "%d %B %Y" and it works
@AndyJagroom-ur7xh 5 месяцев назад ⁺¹
What's your laptop? Cool videos BTW
@067-ashish7 8 месяцев назад ⁺²
Please Upload more videos related to data cleaning
@chenjackson6001 8 месяцев назад ⁺²
感谢你的辛苦付出
@KeithGalli 8 месяцев назад ⁺¹
不客气
@brendanthorne8353 4 месяца назад
Hi Keith, watching this video and following along. Just wondering if when we got the fillna code from chat gpt if we should have applied that to our original data frame? Loving the content!
@zahidmhd 4 месяца назад
okay i need full course on data science
@chillydoog 8 месяцев назад ⁺¹
Hawaiian shirt and Twisted Tea! My man
@KeithGalli 8 месяцев назад ⁺¹
hawaiian shirt yes, but sorry to disappoint just a standard sparkling water I'm drinking haha
@chillydoog 8 месяцев назад ⁺¹
@@KeithGalli 😉
@vg5675 7 месяцев назад
Should i always drop the rows containing null values and then perform the further analysis???
@rohitsinha1092 7 месяцев назад ⁺¹
not necessarily it depends you see in case of doing the same kind of cleaning for machine learning dropping an entire col can cause loss of data that might have helped in pattern recognition of the ml algorithm so you can use other methods to handle missing values for that case but i think its better to just handle them seperately rather than just drop an entire coln even tho that is a possible approach for smaller datasets so its case by case basis but as i am analysing this dataset now i see a few colns with excessively large amounts of null values so i think its okay to drop them. Cheers
@nabuzaidnasr 2 месяца назад
thank you
@alphonsinebyukusenge3071 5 месяцев назад
Where can we find the dataset?
@SangNguyen-bu8xd 6 месяцев назад
Amazing thank u sir
@AnasM24 8 месяцев назад
Thank you man
@KeithGalli 8 месяцев назад
you're welcome!
@Kidpambi 8 месяцев назад
Thanks a lot man
@KeithGalli 8 месяцев назад
you're very welcome!
@hassankhalid5569 5 месяцев назад
HATS OFF TO YOU BRO..........BRING SOME REAL LIFE PROBLEMS AND END TO END PROJECTS RELATED. TO DATA SCIENCE
@ramarisonandry8571 8 месяцев назад
From Madagascar
@NaveedAhmed-xt4xk Месяц назад
why are you drinking soda Keith Galli
@KeithGalli Месяц назад
It's a sparkling water! No sugar or calories :)
@sebastianalvarez1537 8 месяцев назад
holy fuq
@KeithGalli 8 месяцев назад
😎😎
@rrcr4769 5 месяцев назад
Hi Keith,
This code handles the issue will:

# Split column 'Measurements'to height_cms and weight_kgs

dfCpy['height_cm'] = None # add a blank column to store height
dfCpy['weight_kgs'] = None # add a blank column to store weight

# Extract height and weight information
dfCpy['height_cm'] = dfCpy['Measurements'].str.extract(r'(\d+) cm', expand=False).astype(float)
dfCpy['weight_kgs'] = dfCpy['Measurements'].str.extract(r'(\d+) kg', expand=False).astype(float)
dfCpy
@SAGAR-ox6ks 8 месяцев назад
i did chatgpt for the questions that you framed and it is showing same solution , i could have easily done chatgpt rather than seing this video just download the dataset and put some rows of the dataset in chatgpt and put all the frames question they will be same as in this video for 2 hrs, it took 5 min for chatgpt to do..
@mohammadsamir2713 8 месяцев назад ⁺⁴
If you're not going to support people efforts, at least don't disappoint them
@Opoliades 8 месяцев назад ⁺¹
Yeah, but what are you going to do when ChatGPT can’t save you? You didn’t “easily” do the task at hand… you made someone/something else do it. Maybe data analyzing isn’t your thing. Perhaps consider being a LLM-expert instead 😊
@youcefbouras-f1s 3 месяца назад ⁺¹
that's what i used :
# Parse out dates from Born and Died
df['Born Date'] = df['Born'].str.replace(r'in.*','', regex=True)
df['Death Date'] = df['Died'].str.replace(r'in.*','', regex=True)
@ajp3355 2 месяца назад
you not use .fillna on your df code?
df['weight_kg'] = df['weight_kg'].fillna(df['height_cm'])
@KeithGalli 2 месяца назад
I didn't want to fillna in this specific dataset given that weights are associated with specific individuals. It didn't seem right to try to automatically populate weights for people based on an average weight or something similar. It's okay to have some nan values in your datasets.
@cnliving 3 месяца назад
Great! For height/weight parts, it's a bit longer, there be some simple solution
measure_pattern = r'(?:(\d+)\s*cm)?(?:\s*/\s*)?(?:(\d+)\s*kg)?'
df[['height', 'weight']] = df['Measurements'].str.extract(measure_pattern)

Следующие

Автовоспроизведение

Learning the Polars DataFrame Library!

Learning the Polars DataFrame Library!

Complete Python Pandas Data Science Tutorial! (2024 Updated Edition)

Complete Python Pandas Data Science Tutorial! (2024 Updated Edition)

Solving Real-World Data Science Problems with Python! (Predicting Healthcare Insurance Costs)

Solving Real-World Data Science Problems with Python! (Predicting Healthcare Insurance Costs)

Noob To Pro With DRAGON REWORK in Blox Fruits

Noob To Pro With DRAGON REWORK in Blox Fruits

I 3D Printed a $1,500 Chair

I 3D Printed a $1,500 Chair

Every Home Alone Is Worse Than The Last

Every Home Alone Is Worse Than The Last

Felix "Unfair" | [Stray Kids : SKZ-PLAYER]

Felix "Unfair" | [Stray Kids : SKZ-PLAYER]

Please Master This MAGIC Python Feature... 🪄

Please Master This MAGIC Python Feature... 🪄

Step-by-Step Data Cleaning in Python with Pandas | Jupyter Notebook Tutorial | CSV to visualization

Step-by-Step Data Cleaning in Python with Pandas | Jupyter Notebook Tutorial | CSV to visualization

Daniel Chen: Cleaning and Tidying Data in Pandas | PyData DC 2018

Daniel Chen: Cleaning and Tidying Data in Pandas | PyData DC 2018

Solving real world data science tasks with Python Pandas!

Solving real world data science tasks with Python Pandas!

Web Scraping with Python and BeautifulSoup is THIS easy!

Web Scraping with Python and BeautifulSoup is THIS easy!

5 Python Libraries You Should Know in 2025!

5 Python Libraries You Should Know in 2025!

“typing” is getting deprecated in Python

“typing” is getting deprecated in Python

How to Do Data Exploration (step-by-step tutorial on real-life dataset)

How to Do Data Exploration (step-by-step tutorial on real-life dataset)

Cybersecurity Trends for 2025 and Beyond

Cybersecurity Trends for 2025 and Beyond

DOTA 2 - ЛОВИ СНЕЖОК!

DOTA 2 - ЛОВИ СНЕЖОК!

Подземелья Чикен Карри #34 Пропавшая ёлочка (Складчикова, Зелигер, Паль, Гудков, BRB)

Подземелья Чикен Карри #34 Пропавшая ёлочка (Складчикова, Зелигер, Паль, Гудков, BRB)

Александр Зубарев х Артем Дзюба х Леонид Слуцкий | ЧТО БЫЛО ДАЛЬШЕ?

Александр Зубарев х Артем Дзюба х Леонид Слуцкий | ЧТО БЫЛО ДАЛЬШЕ?

«Чужой: Ромул». Обзор «Красного Циника»

«Чужой: Ромул». Обзор «Красного Циника»

Неожиданный подарок для братика #iribaby #shorts

Неожиданный подарок для братика #iribaby #shorts

ДЮП ГЛИЧ КУБОВ? в отеле ДОРС роблокс | DOORS FLOOR 2 roblox | Я научился получать КУБ в соло

ДЮП ГЛИЧ КУБОВ? в отеле ДОРС роблокс | DOORS FLOOR 2 roblox | Я научился получать КУБ в соло

'Stars&Sea'+Smoke Tracer.What age do you think would need this? #toys #gelblasters #airsoft

'Stars&Sea'+Smoke Tracer.What age do you think would need this? #toys #gelblasters #airsoft

特殊能力#short #angel #clown

特殊能力#short #angel #clown