Learning Pandas for Data Analysis? Start Here.
HTML-код
- Опубликовано: 3 июн 2024
- A high paced overview of many of the pandas core functionality. As one of the most popular libraries in all of programming, Pandas is an essential tool for learning data wrangling. Watch this video to get a good foundational understanding of what is possible.
Intro to Jupyter: • Jupyter Notebook Compl...
Kaggle notebook made by a viewer! www.kaggle.com/code/lizhechen...
Download the dataset here: www.kaggle.com/datasets/robik...
Timeline:
00:00 Intro
00:57 Importing Pandas
01:26 Data I/O
01:51 Reading From Files
03:03 Writing to Files
03:36 DataFrame Basics
04:57 DataFrame Summary
06:12 Subsetting Columns
07:19 Select dtypes
07:34 Select as Series vs DataFrame
07:58 .loc and .iloc
09:37 .loc Filter Expressions
10:50 .query
11:31 Summary Statistics
12:10 .agg
12:53 Sumarizing Categoricals
13:43 rank, shift, cumsum
14:44 Rolling methods
15:21 Clip
15:41 Groupby
16:44 New Columns
17:33 Sorting
18:30 Missing Data
19:33 Combining Data
19:55 concat
21:01 Merge DataFrames
21:52 Merge Suffixes
22:34 Bonus
Check out my other videos:
Data Pipelines: Polars vs PySpark vs Pandas: • The BEST library for b...
Polars for Data Science: • Polars: The Next Big P...
Speed up Pandas Dataframes: • This INCREDIBLE trick ...
Avoid These Pandas Mistakes: • 25 Nooby Pandas Coding...
Links to my stuff:
* RUclips: youtube.com/@robmulla?sub_con...
* Discord: / discord
* Twitch: / medallionstallion_
* Twitter: / rob_mulla
* Kaggle: www.kaggle.com/robikscube - Наука
Want to follow along with the same dataset and python environment? Big thanks to someone who made a kaggle notebook with this entire tutorial: www.kaggle.com/code/lizhecheng/pandas-2-0-1-tutorial/notebook
Just fork the notebook and explore the data with pandas!
This is a masterpiece Rob. A condensed pandas course. Wow. Even regular Data Scientist can refresh their mind or discover tips and tricks they are not used to use such as the query methods. And what I like the most, it all fits within 23 minutes. I would love to have such videos for some of the other commons libs.
+1
+1
+1
This 20 min video is equivalent to 2hrs of other youtube videos...masterpiece
Thanks! Tell your friends.
This is truly incredible! It's the finest pandas tutorial available on the internet, offering a remarkable balance of breadth and depth.
This is the best video on pandas I’ve seen so far (and I’ve seen dozens). Thank you so much for keeping your explanations short and up to the point!!! Gonna use the video as my top 1 reference resource when I feel stuck!
Thank you for the videos Rob, your hard work is highly appreciated.
Thanks for this video. Packed with info, but still easy to follow, no small talk… Really appreciate your effort!
Thanks, Rob. That's a great summary of the features. Really useful!
Wonderful channel for beginner data analysts & learned a lot of concepts from you…. Great work man
Great video Rob, I would love to see you explaining Machine Learning and Deep Learning models, from theory to practice using scikit-learn, Keras or Pytorch. You really made things look easy. Can't wait to see another of your awesome videos.
Thank You so much for putting this together Rob, you make it look so easy and it's well explained and very clear. I really appreciate you for sharing this with everyone !
Glad it helped you!
I can tell even before watching this video that's its great!!! You're such a great tutor.
Thank u very much.
I can now officially announce and recommend this video to my friends as one stop pandas tutorial and solution.
Thanks Rob
This is great work!! Thank you very much for putting it out here!!
This video is fantastic. informative, concise and a strong foundation for pandas. Most importantly, it is easy to understand and follow along. Thanks for the video, I'm subscribing!
Really appreciate the feedback. Glad you found it easy to follow. I was a little worried it might be too fast.
@@robmulla I typically take notes when watching videos like this so I am accustomed to pausing. In my opinion it's better when there isn't much filler in between so that it's easy to get to the next point or move back to where you want.
Thank you so much ❣️ I have watched your previous pandas video, but this had everything ❤ it was awesome ❤
I understood everything except for to write csv,
Thank you so much for this amazing video ❤
Very easy to follow along, thank you!
Nice Video Rob. This helped me a lot :)
Thank you for this lesson and all your work. As always, I learn so much from you! Any chance you'd do a video lesson on data cleaning? 🙏
Thanks Rob for sharing the knowledge and experience to data community 😊
🙌
I've been learning Pandas for a couple of years on and off now, and have even used it a little at work, and yet there were still a few things in here I didn't know about. The rolling method in particular is a game changer, I've been manually creating functions to do that and now I can just do it in one line of code (and likely faster than my hacked together functions).
Can you give an example of a rolling method application? I'm curious
@@mark-dy9zomoving averages
Very cool ninja panda style!!! So useful and like a real pro awesome!!!
we are waiting for the next part! I personally wanna see sth on visualization!
Thanks for the feedback. I’ll keep that in mind for the next video.
It is solid tutorial for Data Geeks. Thank you)
Good Intro! Thanks!
One of my favorite teachers
It took me 2 hours and 30 minutes to revise pandas, but it's worth it
Thanks for the content, Rob! it's really excellent! Can you do another video like this but with numpy?
Great tip on renaming the multi index columns!!
Glad it was helpful!
Not enough half way through and I can tell this video is gold.
Thanks for sharing your knowledge
I'm wanting to ask a bit more of a meta question. How much time do you spend outside of work on your skills? How much passion or drive do you have and what are your routines? I work in medical ML and came across your EDA video and wanted to get a successful person's view on how to improve and grow.
Magic Rob! hopefully be like you one day
Thanks for the great Video!
How did you manipulate that folder with bunch of.csv files to put fit all together in the df? And how to deal with irregular datas in a typical case like this?
Have you already done some tutorial explaining and detailing these kind of tasks?
Great video as always ! Would be Nice to have the same one with polars
Thanks for this. Straight to the point. Great!
Do you think Polars is going to be especially disruptive? I’ve been using it a bit and I can’t believe how much faster it is at a lot of things. But pandas is very entrenched (and probably has slightly more friendly syntax).
Hi. I wish I watched this before my last project. Hope you will do an advanced series.
Thanks Rob!
Thank you Rob 😊
Thank you Rob!!!
It was really helpful, but I think you missed a section for converting data types in dataframes, specially for date types. thank you very much for this summary.
Hello Rob great video! I have a question, how do you enable the description of the methods that you use. They are showing on the right when you type in the ‘dot’.
Thanks. With Jupyter you just do shift-tab
Great lesson
Glad you liked it!
Great stuff!
Great as always! Now get to work and make tutorials for seaborn and matplotlib :)
thanks for the video, one request though, can we have the same dataset so we can follow along.
this vid is a gem
Thanks! Glad you liked it.
Are you streaming this evening?
WERY NİCE .. THANKS FOR YOUR EFFORTSS :))
Thanks Rob 😁.
Thanks bro
Hi Rob, Please start some series on Tableau. Regards.
Do you have a panda functions cheat sheet (df functions) available? Thanks. Follower 👍
Perfect!
@robmulla do you know a website or where I can find data cleaning exercises or challenges? I want to practice cleaning different kinds of data, any suggestions will be helpful
Thanks!
Can u tell me where u execute ur code/ How do I get to the same terminal
amazing!!
Hi, i have one silly question. How do you get intellisense i.e. functions menu for each object and for each function, the whole list of available parameters. Which IDE you are using ?
It really helps to focus on use case rather than mugging up the function names and their syntax.
Awesome ❤
Hey Rob! Any resouce to download and handson with parquest file
Thanks Rob 15 min done still 7 to go.
Nice! 🙌
Hi Rob,how to read the details of function in jupyter lab just like 2:22
Hi, does anybody know a website or where I can find data cleaning exercises or challenges? I want to practice cleaning different kinds of data, any suggestions will be helpful
This is great
hi! What plugin do you use to see the details of each function?
Great question! Shift-tab in jupyterlab.
I’m sorry I know this will sound dumb to you guys but how is it listing all option after writing a part of if. Like read_ ( then a whole bunch of different commands like read_csv and so on)? I’m using jupyter lab everyday and haven’t seen that ! Cool
Masterpiece thanks thief!
nice, if would be useful if you could put a link for downloading your dataset so we could play around with your data while you explain, it would be appreciated, for example I would need to see by myself what the difference reindexing does when combining datasets, it is not immediately obvious to me and would require some test and comparisons
The datset is on kaggle. Check out this notebook where someone linked the dataset and included the tutorial code: www.kaggle.com/code/lizhecheng/pandas-2-0-1-tutorial/notebook
Hi @robmulla
In Handling Missing Data chapter, would be nice, if you could provide your insight as the best approach and what is normally recommended to do, if it is fillna or dropna, I know that it could be subjective to the task at hand, but having insight as expert would be nice.
cover EDA for time series data
Hello Rob.
Nice dictionary.
I'm new to Data Science. Type every information on my jupyter lab. And im getting error and not dine. I don't understand this, smh what I'm im doing wrong
1:52 min. how to get that dropdown option
🤗
how to get the data of this video
Its time for you to show us hiw to build a dashboard
Doesn’t appear as tho you really used the power of pandas 2.0 with the backend pyarrow default param and checking for nulls/data types :-(
is this guy AI generated? His jawline is too perfect.
No AI. I’m a real person.
Great refresher, but too fast for tutorial. I suggest breaking it in chuncks.
Helpful overview. Good content. But way too fast. Not everyone has an IQ of 150, Mr Mulla. Slow down..