Complete Python Pandas Data Science Tutorial! (Reading CSV/Excel files, Sorting, Filtering, Groupby)

Поделиться
HTML-код
  • Опубликовано: 17 ноя 2024

Комментарии • 2 тыс.

  • @KeithGalli
    @KeithGalli  3 года назад +297

    Hey ya'll! I created a second channel with more Python content (including additional Pandas tips & tricks).
    Please consider subscribing 😊
    ruclips.net/user/techtrekbykeithgalli

    • @sam7250ii
      @sam7250ii 3 года назад +2

      You cleverly edited the code between 25:50 to 25:59 list(df.columns.values) to list(df.columns)😉👍

    • @Vribejs
      @Vribejs 3 года назад +1

      Error:Cannot mask with non-boolean array containing NA / NaN values - gives me error when usinf df.loc (on 40:49 in video)?
      df.loc[df['Our Global Company'].str.contains('Smith', regex=True)]: this is code, I imported another .xlsx table when practising.

    • @yidizhou9899
      @yidizhou9899 3 года назад +5

      @@Vribejs go google it... you can't expect him to do it for you. He checked the documentation just to give us a good overview of pandas.... google out your error if not you will not learn.

    • @chiraggupta1897
      @chiraggupta1897 3 года назад

      i have been working on a excelworkbook having 8 worksheet and i m performing operations on data nd want to place dataframe in the 6 sheet in place of its data .but everytime i do all other sheets gets vanished nd a single gets get formed with the dataframe .plzz help me in appending df into an existing excel

    • @benten5018
      @benten5018 3 года назад +2

      Hey Keith , can can please help me to download the csv.file on an android tablet.
      sorry for bad english.

  • @jcspaziano
    @jcspaziano 9 месяцев назад +147

    I know this is 5 years old but I learned more about using Pandas from this one video than all the other videos ive watched on the topic combined! Just awesome! Thank you!

    • @KeithGalli
      @KeithGalli  9 месяцев назад +15

      Glad that it is still helpful!!

  • @_Nelyen
    @_Nelyen Год назад +186

    This video was super helpful, thank you Keith!
    In case anyone gets to the end of this video, around 48:00, Keith talks about the groupby operator and starts to go over the section "Aggregate Statistic using Groupby (Sum, Mean Counting)". You might run into errors due something that changed after Pandas version 2.0.0.
    Instead of writing: df.groupby(["Type 1"]).mean()
    Try writing: df.groupby(["Type 1"]).mean(numeric_only=True)
    After version 2.0.0 the numeric_only value was changed to False versus True as it's default, causing errors such as "can not convert strings". Hope this is helpful, have a good one!

    • @ljubicabrenjo2794
      @ljubicabrenjo2794 Год назад +3

      Thank you very much, I ran into the problem, this is really helpful! :)

    • @mayowafele9691
      @mayowafele9691 11 месяцев назад +1

      Thank you

    • @NachtHere
      @NachtHere 10 месяцев назад +1

      Thanks Man .

    • @mrme2120
      @mrme2120 10 месяцев назад

      Thanks Dude

    • @rajdevanshu65
      @rajdevanshu65 10 месяцев назад

      Was facing the same issue, thanks a lot.

  • @piotr5830
    @piotr5830 Год назад +40

    Hi Keith - not sure you will read this but wanted to sincerely thank you for this tutorial. 3 years ago this was the first python video I ever watched after graduating from unrelated subject. Today I'm typing this from a business class lounge at JFK, on my way to London where I just got a job as a quant developer at a hedge fund, building pricing models and infra for trading. Worked hard for this but if not for your videos I could be at a very different place. Thank you from the bottom of my heart, your work means a lot to many people. Cheers!

  • @nikithroumpari2553
    @nikithroumpari2553 2 года назад +86

    A strugling biologist here thanks you! We are mostly dealing with big data and it can get a little overwhelming, but you made it a lot easier!

  • @RisingLoaf
    @RisingLoaf Год назад +261

    This 1 hour video did more for me than entire semester of my Data Analysis course... Amazing

  • @not_proton
    @not_proton 4 года назад +58

    Wasted an hour watching a completely useless video on pandas, didn't understand a thing......
    Then found this pure gold of a video, it really helped me a lot. Why didn't I click it earlier............

    • @KeithGalli
      @KeithGalli  4 года назад +32

      lol you had me in the first half 😂

    • @KeithGalli
      @KeithGalli  4 года назад +12

      glad it helped!

    • @not_proton
      @not_proton 4 года назад +3

      @@KeithGalli yeah, really nice job explaining it
      Currently watching the other pandas video (real life problems)

  • @KeithGalli
    @KeithGalli  6 лет назад +987

    Video Outline!
    0:45 - Why Pandas?
    1:46 - Installing Pandas
    2:03 - Getting the data used in this video
    3:50 - Loading the data into Pandas (CSVs, Excel, TXTs, etc.)
    8:49 - Reading Data (Getting Rows, Columns, Cells, Headers, etc.)
    13:10 - Iterate through each Row
    14:11 - Getting rows based on a specific condition
    15:47 - High Level description of your data (min, max, mean, std dev, etc.)
    16:24 - Sorting Values (Alphabetically, Numerically)
    18:19 - Making Changes to the DataFrame
    18:56 - Adding a column
    21:22 - Deleting a column
    22:14 - Summing Multiple Columns to Create new Column.
    24:14 - Rearranging columns
    28:06 - Saving our Data (CSV, Excel, TXT, etc.)
    31:47 - Filtering Data (based on multiple conditions)
    35:40 - Reset Index
    37:41 - Regex Filtering (filter based on textual patterns)
    43:08 - Conditional Changes
    47:57 - Aggregate Statistics using Groupby (Sum, Mean, Counting)
    54:53 - Working with large amounts of data (setting chunksize)
    Thanks for watching friends! :)
    Let me know if you have any questions

    • @dtran288
      @dtran288 5 лет назад +4

      YES!!! THANK YOU!

    • @shadow2frost325
      @shadow2frost325 5 лет назад +7

      Thank you so much for posting this! I have a test in Python soon, so I've been watching this for a review. You explain everything so well and make it easy to follow. I also like how the data was from Pokémon - it makes it more relatable.

    • @dchitan1234
      @dchitan1234 5 лет назад +2

      great tutorial

    • @tejasnareshsuvarna7948
      @tejasnareshsuvarna7948 5 лет назад +34

      A reference notes to help you while you watch the video.
      docs.google.com/document/d/16qcfjwLp1vV-5VnIOGuDC2vxkHQ534_RzQd2Gihk7x8/edit?usp=sharing

    • @Tropax1
      @Tropax1 5 лет назад +2

      Hey dude, love this video by the way but I have a question, can this data be used for machine learning? I have my exams coming up where I have to find a dataset to make predictions and stuff. Are these pokemon cards, do they have label and features if you understand what i'm talking about? Any help would be greatly appreaciated. Thanks in advance.

  • @faizalimuhammadzoda4731
    @faizalimuhammadzoda4731 2 года назад +48

    There is something to the way Keith teaches that keeps me coming back.
    Besides being a good teacher and utilizing techniques which help people grasp the material quickly and remember for long time, he sends forth a wave of positivism. He is such a positive, energetic person.
    Thanks for sharing your knowledge. May it grow and enable you to bless more people with it.

  • @amiliavachford183
    @amiliavachford183 Год назад +8

    thanks for useful video
    If anybody have a problem with calculating the mean of Type 1 grouped data, use this:
    df= pd.read_csv('modified.csv')
    df.groupby(['Type 1']).mean(numeric_only=True)
    instand of this:
    df= pd.read_csv('modified.csv')
    df.groupby(['Type 1']).mean()
    That way, it won't include string-type data in the mean and sum functions.

    • @vissokis
      @vissokis 9 месяцев назад

      thanks it helped a lot...can't understand the error while all the values are numreic already

    • @llamaland1737
      @llamaland1737 7 месяцев назад

      so is it got updated now, since you can only perform the method on int or float columns ...

  • @hughjazz8416
    @hughjazz8416 3 года назад +63

    I have bought multiple Udemy courses on pandas and this one blows them all out of the water, and it’s free! I’m deff subbing!

  • @shayonghoshroy7208
    @shayonghoshroy7208 5 лет назад +352

    Best Pandas tutorial on RUclips, especially 24:25

  • @prubin18
    @prubin18 3 года назад +9

    Great video! One of the best pandas tutorials I've seen.
    I have one comment though. When you run (at 40:00)
    df.loc[df['Name'].str.contains('Mega')])
    You are actually including Meganium in this filter, even though it is not a Mega pokemon. So, one needs to include a space after Mega, such as:
    df.loc[df['Name'].str.contains('Mega ')])
    One can see that this makes a difference because when you run
    len(df.loc[df['Name'].str.contains('Mega')])) and len(df.loc[df['Name'].str.contains('Mega ')])), to know the number of rows, there are two distinct outputs (respectively 49 and 48)

  • @dicspringdkz8234
    @dicspringdkz8234 2 года назад +25

    Keith
    You are more than a teacher. Your level of simplicity in explaining Python in details is out of the moon. Keep up the good work. Your video is always my “go to” any time.
    Again, thanks a lot for using your skills as a blessing to people around the world.

  • @adedokunagunbiade5324
    @adedokunagunbiade5324 2 года назад +4

    I watched the entire video in 30 minutes and learned more than I did with hours of video content. Amazing work.

  • @crtnnn
    @crtnnn Год назад +2

    Started my PhD in hydrogeology and learning Python from the scratch. I love your work, keep it up!

  • @pivo6499
    @pivo6499 5 лет назад +601

    I can't believe I watched this for free, thank you so much!

    • @johnwiley1221
      @johnwiley1221 4 года назад +3

      This was pretty good. I would also check udemy or r/learnpython for other free resources. Found a 30 hour FREE pandas course there the other day

    • @johnwiley1221
      @johnwiley1221 4 года назад

      www.udemy.com/course/the-ultimate-pandas-bootcamp-advanced-python-data-analysis/?couponCode=FF041817B54B4BC9EB6B

    • @quartercast
      @quartercast 4 года назад +8

      @@johnwiley1221 It's not free now, unfortunately :(

    • @musclemusic123
      @musclemusic123 3 года назад

      ki

    • @shambhav9534
      @shambhav9534 3 года назад +3

      The documentation is also free.

  • @rutzyco
    @rutzyco 4 года назад +107

    Coming from the R environment, I must say this is an excellent tutorial to learn about Pandas. I'm very happy to learn that the tools I use in R for data management can be implemented in a similar way in Python. Thanks for taking the time to put this together! Great job.

    • @konata_fan
      @konata_fan 2 года назад +1

      Same here

    • @bretfolger631
      @bretfolger631 2 года назад +1

      I agree - coming to Python from RStudio and after looking at videos all day this is definitely the most helpful and intuitive video!

    • @ratansharma8026
      @ratansharma8026 Год назад +1

      sometimes the syntax may be getting confused for python and r right? if you use both

    • @manan-543
      @manan-543 Год назад

      can someone tell me why is r so encouraged in the data science/analysis circle when python can do everything and more and it is so intuitive

    • @rutzyco
      @rutzyco Год назад +1

      @@manan-543 I think Python is far more general and overall can do a lot more, but in my field, packages associated with statistical models are far more abundant in R than in Python. For example, I'm not sure Python comes even close to R for the implementation of Bayesian hierarchical models, GLMMs, GAMMs, etc. Also, methods papers often publish packages in R, so it seems to remain the default for statistics. Until the statisticians start switching in large numbers I'm not sure this is gonna change anytime soon; and when it does, it probably will be Julia, not Python.

  • @Orion3000k
    @Orion3000k 4 года назад +61

    Mannnn your one of the best Python go-tos PERIOD. Straight to the point and easy to understand. thanks for teaching us all!

  • @remy0705
    @remy0705 10 месяцев назад +2

    This 1 hour course is all I need for my data analysis course. This is the best video I found on RUclips. Thanks ❤️❤️❤️

  • @garthhorne617
    @garthhorne617 3 года назад +1

    I have been learning python and using pandas for about 3 months now and done innumerable searches on the internet with questions regarding use of specific statements and coding. I wished I had come across your video earlier! You are a born teacher and know how to layout and explain complex terms and concepts. How can someone that looks so young have such a strong grasp on presentation and user needs? The concepts you explain are the same things I have sought information on for 3 months but all in one place and succinctly explained. Thank you for all your work.

  • @LureUnitFtw
    @LureUnitFtw 5 лет назад +92

    One of the best tutorial that I've ever seen in RUclips! Thumbs UP!

  • @AndrewMann205
    @AndrewMann205 5 лет назад +5

    Between jobs for the first time in decades I wanted to learn data science using software other than just Excel and Access. Your video was well explained and frankly better than anything else I have seen so far involving Python and Pandas. Thank you for a job well done.

  • @bentrash7885
    @bentrash7885 4 года назад +26

    Awesome tutorial! One advice I'd have for any python developers is to get in practice of working within virtual environments. Really helps to avoid conflicts when you're working on a project which may require some older versions of a library but your other projects may require latest ones, stuff like that.

  • @gustinelimurilo
    @gustinelimurilo 4 года назад +22

    53:30 you can use .size() to get the count of each Pokemon type instead of adding a new column.
    It would look like this:
    df.groupby(['Type 1']).size()
    Great tutorial!!

  • @MatBat__
    @MatBat__ 3 года назад +2

    Bro I started a data science internship in the beggining of the Year, we use a lot of pandas and you are saving my life from day 1.
    Thanks again, you are a god send! Subbed on both channels, cheers!

  • @klauscheang7063
    @klauscheang7063 5 лет назад +105

    Excellent!! I like the way you organize the videos on different topics and functions of working with data. Please make more videos on how to work data science in Python. E.g. Statistical analysis (descriptive statistics, t-test, linear regression) or data processing tutorial (like what we do in SQL).

  • @nimaonta1725
    @nimaonta1725 4 года назад +49

    Dude you deserved all the subs for this video alone. You explained everything so good. keep it up :)

  • @RockIT1
    @RockIT1 4 года назад +23

    I like the way he interacts with his viewers

  • @kanstantsinhupalau6337
    @kanstantsinhupalau6337 2 года назад +1

    Saved my day! I started learning Pandas, but when I missed several months during circumstances and this video about basics helped me quick comeback. Thank you!

  • @MichaelPeterDalsgaard
    @MichaelPeterDalsgaard 4 года назад +2

    I swear this is the most useful python channel on RUclips. Top stuff.

  • @disagio9517
    @disagio9517 3 года назад +5

    I came for the tutorial, stayed for the cutesy pokemon stuff, really warmed my heart

  • @DavidWhitt
    @DavidWhitt 5 лет назад +7

    Dude... you should make more videos... you are a natural born teacher!!

  • @jiangxu3895
    @jiangxu3895 5 лет назад +6

    I just went through your numpy tutorial. And that's the reason I come here. Thumb up!

  • @piggeh6465
    @piggeh6465 2 года назад

    Will now recommend this video to anyone who is interested in learning pandas! This video is awesome

  • @Aimad_off
    @Aimad_off Месяц назад

    I just finished your NumPy's course sir, and I'm moving now to pandas, I just want to thank you for your efforts !

  • @kylieying2
    @kylieying2 6 лет назад +9

    Thanks for posting! As an MIT student taking a data analysis class, this video was very helpful, more useful than the other tutorials online!!

    • @kipishism
      @kipishism 5 лет назад

      Found it very useful too!

    • @kregg34
      @kregg34 5 лет назад +4

      "As an MIT student"
      Weird flex but ok

  • @mdhidayat5706
    @mdhidayat5706 3 года назад +3

    Awesome tutorial Keith, I learnt a lot by following your hour long tutorial.
    Created a new notebook instead of using the GIT version as it doesn't show what happens before you commented the code.

  • @MiguelMusic123
    @MiguelMusic123 4 года назад +11

    This video helped my massively! Been learning through online python courses with people trying to act and saying unnatural jokes, but your video felt super natural and easy to watch. Many thanks!

  • @bjbmbc
    @bjbmbc 3 года назад +1

    Gold medal bro, I was searching extensively for a good data science resource and reddit just sent me to random coursera/edx courses that used to be free but don't appear to be anymore. Your content is highly organized, extremely concise, and well thought out. There is a reason that only .01% of the votes are downvotes. THANK YOU!

  • @atraps7882
    @atraps7882 4 года назад +1

    Day 1 on my journey to learn data analysis with python, this vid and kaggle's free pandas course is just what i needed to give me more motivation to keep learning.

  • @nutrathriveyoutube7056
    @nutrathriveyoutube7056 5 лет назад +29

    This is an amazing tutorial! Please keep publishing like this. very well explained!
    I would love to see about matplotlib, numpy and if you can get inside machine learning

  • @bharathianjeneya2111
    @bharathianjeneya2111 4 года назад +6

    On point Keith. 5 hrs worth training covered in an hour. Made my day.

  • @brandongarza1366
    @brandongarza1366 3 года назад +3

    I haven't started this yet, but based on your previous videos I know this is going to be great. Thanks Keith, you are a great teacher.

  • @RunyCalmera
    @RunyCalmera 3 года назад

    This was great man. Even in 2020. Only thing and suggestion. Do not change a cell when you are elaborating on a new feature. Just click a new cell down and elaborate. Because then in the jupyter notebook you will see all the variations. If you change it it won't save in the jupyter notebook.
    I'm very happy with this tutorial. You break it down easily. I'm new to pandas and python and this has helped me a lot with pandas.

  • @elwinmentaram6031
    @elwinmentaram6031 4 года назад +1

    2 years after this video was posted, I'm here watching and learning Tons of stuff. Thanks man!!!!

    • @narayangautam6955
      @narayangautam6955 4 года назад +1

      Me tooo today i watched it Comedy 😂😂😂

  • @DennisGorshteyn
    @DennisGorshteyn 4 года назад +10

    You break down all the details in a way that I can't believe this is for free. Very high quality stuff. I was up and running with this library in short order

  • @modernafsolutions3233
    @modernafsolutions3233 4 года назад +7

    Wow man! Holy smokes that was such an amazing breakdown. I came into this knowing nothing about Pandas and now I want to get back to work with my personal data! Thank you so so so so much. I’m off to find the documentation!

    • @KeithGalli
      @KeithGalli  4 года назад +2

      Glad you enjoyed! Your comment made my day :)

  • @philippedid
    @philippedid Год назад +2

    A big thanks for your work from France . I have learned a lot about Pandas .

  • @jamesdonly518
    @jamesdonly518 5 лет назад +19

    Ok I've been learning Pandas for a while now, over many different sources, and this one video has shown me much more helpful little hints and tips than all of the other material I've looked at previously!!! Thannnnnk you! Please do more Pandas stuff as this has been so awesome =]

  • @nikluz3807
    @nikluz3807 4 года назад +11

    this is an excellent tutorial, especially the filtering/conditional changes section. I have always loved how google sheets has built in queries, and I wanted to be able to do a lot of the same things using pandas. This essentially gave me all of the power I needed! thanks!

  • @bijoysaraf650
    @bijoysaraf650 4 года назад +5

    Very simple yet comprehensive tutorial on Pandas. You had my attention throughout. I do use Pandas for data analytics along with numpy. That said I learnt quite a few tips and tricks.
    Thank you for sharing your knowledge. Way to go Keith!
    Liked and subscribed.

  • @duyanh1823
    @duyanh1823 3 года назад

    I am a Pokemon fan, randomly watch Python Panda for my project and find this. Such a big help. Thanks KEITH!

  • @joashbrijit
    @joashbrijit 3 года назад +2

    You've just got me 30% of my whole assignment. Thanks dude

  • @DrewLevitt
    @DrewLevitt 3 года назад +7

    In the chunksize section, you pick a well-documented bad practice, namely calling pd.concat inside a for loop. As the loop runs repeatedly, this operation becomes more and more expensive (because new_df gets longer and longer). Per the pandas documentation, the better approach is to append each df to a list and then pd.concat the list elements just once, after the for loop.

    • @terabhaininja9
      @terabhaininja9 2 года назад

      Hello, can you please provide with a tutorial for that? Quite new and clueless here.

    • @terabhaininja9
      @terabhaininja9 2 года назад +3

      dataHere = []
      for chunk in pd.read_csv('modified.csv', chunksize=5):
      dataHere.append(chunk)

      newnew = pd.concat(dataHere)
      This looks right?

  • @ProdMGD
    @ProdMGD 2 года назад +3

    Great video to get people up and running. It took me two hours to watch, take notes, and test out some examples. I feel like this was time very well spent. Thank you for this.

  • @xnick_uy
    @xnick_uy 3 года назад +12

    27:15 It seems that the dataframe got scrambled up a bit there, most likely from having the cell running multiple times. Even when there was an error message, it appears that either the Total or the Legendary column was moved to the left of HP. Upon running the cell again (with the corrected version?) it calculated a new Total adding the previous values and generating corrupted results.

  • @8rameshb
    @8rameshb 3 года назад +1

    The best tutorial I have seen so far on data analytics. I now see how python/pandas helps in data analytics. Thank you very much for making and sharing this video.

  • @philipcoppage3592
    @philipcoppage3592 3 года назад +5

    SQL person w/ limited exposure to Python here. This was useful as hell.

  • @bensondube5646
    @bensondube5646 5 лет назад +8

    Excellent Tutorial Keith. Very clear, at the right speed and interesting to learn from. This material is very suitable for a self learner. Keep it up.

  • @Diegtz555
    @Diegtz555 3 года назад +10

    Wow, thanks for this tutorial. I'm starting on python and took a course of udemy, but it was confusing, with your explanations many doubts are cleared up. Thanks Keith:)

  • @indraneel6601
    @indraneel6601 Год назад

    Day 1 : 18:27
    Day 2 : 55:00
    Day 3 : completed
    Tq for the brief intro to pandas
    just for tracking my progress.

  • @hhbbhvvbjhbbyjj
    @hhbbhvvbjhbbyjj 3 года назад +1

    the best python tutorials i ve seen

  • @orfeaspapaioannou2755
    @orfeaspapaioannou2755 5 лет назад +29

    dude this is an amazing introduction to pandas. Really helpful, thanks a lot

  • @viveknayak9899
    @viveknayak9899 4 года назад +7

    Comprehensive, perfectly paced.... Lovely tutorial!

  • @andyn6053
    @andyn6053 5 лет назад +7

    WOW! This was just what I have been looking for! Fantastic tutorial! You explained everything very well and clear from start to finish. Best Pandas tutorial on youtube for sure! Thanks man :)

  • @jesseraines870
    @jesseraines870 Год назад +1

    "STOP TEXTING ME IM MAKING A VIDEO! WHO HAS THE NERVE!?" 😆😆
    Great video bro. Currently in a term of data science with python and struggling hard. This video has been tremendously helpful! Big thank you!!

  • @hautboisjc
    @hautboisjc 3 года назад

    Came here after watching your "Real World Data Science Tasks with Python" video and I didn't expect there are still things I don't know with Pandas. Thank you! Liked and subscribed.

  • @cindyshaw2485
    @cindyshaw2485 4 года назад +6

    Thank you, Keith, for making this super helpful tutorial. You're a great teacher!

  • @RPGMadnessVX
    @RPGMadnessVX 4 года назад +6

    F for "MEGAnium" that got filtered out while being old school Poke 😂
    Great course! Looking forward to learn more from you!

    • @iftrejom
      @iftrejom 4 года назад

      I fixed it by writing "Mega " in the code.

  • @paulblades2325
    @paulblades2325 3 года назад +4

    Thank you so much for your time and effort. This is the best python tutorial I have watched. Straight forward and well organized. I appreciate the time stamps.

  • @kostas6915
    @kostas6915 2 года назад

    Apart from maybe silencing your phone while doing the tutorial, you are definately the man for the job!! Great work and VERY helpful!!!

  • @MikeStallings2023
    @MikeStallings2023 2 года назад

    I am a retired software guy, enjoying your videos, thanks for these! You are making me almost want to go back to work again.

  • @bidhanbhattarai8863
    @bidhanbhattarai8863 4 года назад +6

    Makes me want to play the old Emerald games again, wonderful tutorial, keep them coming

  • @jasonaraosfuentes2130
    @jasonaraosfuentes2130 5 лет назад +78

    This is an extremely usefull tutorial. You explain so good bro. Thank you very much. Like and subscribed. Hugs.

  • @cdgxflower2679
    @cdgxflower2679 5 лет назад +12

    I've been looking for a good pandas and python video for quite sometime now. I have to say that this is really amazing. You've explained it so well that a beginner like me could easily understand. Great job and thank you. Can't wait for more videos. (if possible, matplotlib)

  • @harshitsarda3297
    @harshitsarda3297 2 года назад

    literally one the most useful videos on pandas ever

  • @immanuelsuleiman7550
    @immanuelsuleiman7550 3 года назад

    it's been a year since I first saw this video
    pandas has been the best thing to happen to me and this is where it all started
    thank you Keith

    • @KeithGalli
      @KeithGalli  3 года назад

      That's awesome, really happy to hear that this video had an impact on you. You are very welcome :)

    • @immanuelsuleiman7550
      @immanuelsuleiman7550 3 года назад

      @@KeithGalli I'm seriously impressed that you replied to both of my comments
      Take care and stay safe

  • @micsierra806
    @micsierra806 5 лет назад +25

    Excellent tutorial; exactly what I was looking for. Liked and subbed. Thank you for sharing your expertise.

  • @takako230
    @takako230 3 года назад +5

    Awesome video Keith! I'm a beginner programmer but your explanation is super clear! Thanks for the videos:)

  • @skyblue021
    @skyblue021 5 лет назад +17

    Thank you Keith for this video, absolutely amazing and valuable for many! THANK YOU!

    • @KeithGalli
      @KeithGalli  5 лет назад +3

      Glad you found it helpful! :)

  • @martistarti2374
    @martistarti2374 3 года назад

    Omgeeeeee!!!! Thank you so much!!! I've searched sooooo many videos trying to help with the delimiter problem I've had (i didn't know that was the problem) and you're the ONLY one I've found that even mentions it!!! 🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾

  • @sammmyranaway
    @sammmyranaway 5 месяцев назад

    I think taking the Pokémon data set makes this tutorial so much fun. Loved it.

  • @NoName-fi2ow
    @NoName-fi2ow 4 года назад +6

    What the hell, I imagined this topic in afternoon and video recommended after only few hours. And the shocking fact I didn't even searched about this topic from many days.

  • @williambaker4915
    @williambaker4915 2 года назад +4

    52:49
    You can also just write the code: df.groupby(['Type 1']).count()['Name']
    That way, you don't have to add the count column.

  • @stephanierodriguez1035
    @stephanierodriguez1035 5 лет назад +5

    This was such a great introduction to pandas and on DataFrame. This is exactly what I was looking for.
    Since I hadn't previously downloaded pandas onto my mac, and didn't feel like installing anaconda either, I was running into some troubles installing pandas with just "pip install pandas" so I thought I would include the instructions as to how I did it.
    simply do:
    pip install pandas --user
    If nose and tornado aren’t downloaded do:
    pip install nose --user then pip install tornado --user (nose needs to be installed first)
    then terminal also suggested I add it to my path, so I did:
    sudo nano /etc/paths
    add the path at the end of the file
    do ^X and then Y then hit enter

  • @tototoysentertainment9483
    @tototoysentertainment9483 4 года назад

    i watched more than 10 different videos about pandas, this is the most easy and understandable one. Worth your time!

  • @krebbikrebkreb
    @krebbikrebkreb 3 года назад

    One of the best programming tutorials ever made. seriuosly.

  • @SMFahim-vo5zn
    @SMFahim-vo5zn 4 года назад +8

    When I start making money with these knowledge, I'll give you some share!

  • @pemadechen9901
    @pemadechen9901 4 года назад +5

    I loved the fact you used pokemon as data set it was fun learning I could also check
    my knowledge about pokemon hahha Love love

  • @BrandonS-lk2qc
    @BrandonS-lk2qc 3 года назад +3

    I learned so much, thank you. Then at the end...that music tho. I lost it! LOL! Did not see it coming.

  • @gillesderoo2027
    @gillesderoo2027 2 года назад

    You are the GOAT. Your explanations using Pokemon makes so much sense.

  • @TomNeedhamNeDrum
    @TomNeedhamNeDrum 9 месяцев назад

    This was genuinly so helpful, thank you! I am mostly through a data science course and have been struggling to figure out actual applications for the information I have learned. This was excellent!

    • @KeithGalli
      @KeithGalli  9 месяцев назад

      Glad you found it helpful!!

  • @mohitjain4943
    @mohitjain4943 6 лет назад +6

    finally.. a new video... I was waiting for a Long Time😍😋

  • @jsaylor525
    @jsaylor525 5 лет назад +6

    "It's not super important that you know about a DataFrame"? That's one of the main objects in pandas, I'd say its highly important.

  • @saurabh-patil
    @saurabh-patil 5 лет назад +4

    This tutorial helped me alot. Thank you so much!

  • @Chuukwudi
    @Chuukwudi 3 года назад

    From the bottom of my heart, Thank you very much. May you never lack. May the elements, forces, and the entire Creation align itself for your own good.

  • @JayJay-ki4mi
    @JayJay-ki4mi 2 года назад +1

    Pandas is incredible. The stuff you can do with it is mind blowing. For example I've got product data in a CSV, and prices in another. The prices depend on values in the product name. Pandas can easily do this without writing for loops.

  • @Esoj.
    @Esoj. 7 месяцев назад

    If you're getting an error in the Groupby section try: df.groupby('Type 1').mean(numeric_only = True).sort_values('Defense', ascending=False)
    Added "numeric_only = True" inside of ".mean()" and it worked 👍

  • @nicoledeasis664
    @nicoledeasis664 3 года назад +11

    "stop texting me! I'm making a video!"
    "who has the nerve" hahahahahahha you explained well, thank you.