Exploratory Data Analysis in Pandas | Python Pandas Tutorials

Поделиться
HTML-код
  • Опубликовано: 21 авг 2024

Комментарии • 182

  • @santiagofajardo4949
    @santiagofajardo4949 Год назад +98

    Hello,
    at minute 24:24, I managed to reverse the range of column names using [5:13][::-1]. The expression [::-1] is used to reverse ranges and it is very useful:
    df2 = df.groupby('Continent')[df.columns[5:13][::-1]].mean(numeric_only=True).sort_values(by='2022 Population', ascending=False)
    df2
    Thank you very much, Mr. Alex, for these tutorials.

    • @user-zq6cp7lh3s
      @user-zq6cp7lh3s 7 месяцев назад +1

      Thank You!

    • @renanz21
      @renanz21 7 месяцев назад +4

      Alternatively, start counting columns backwards,
      df2 = df.groupby("Continent")[df.columns[-5:-13:-1]].mean().sort_values(by='2022 Population', ascending=False)
      df2

  • @satrapech6107
    @satrapech6107 11 месяцев назад +31

    the correction of df.corr() is:
    numeric_columns = df.select_dtypes(include=[np.number])
    correlation_matrix = numeric_columns.corr
    correlation_matrix()

    • @pradiptanugraha6841
      @pradiptanugraha6841 10 месяцев назад +1

      Thanks it works. Why df.corr() not working on me ?

    • @rajkumarjadi7061
      @rajkumarjadi7061 9 месяцев назад

      thanks man.

    • @francescab1413
      @francescab1413 9 месяцев назад +43

      df.corr(numeric_only = True)
      worked for me

    • @arrofifahmi7708
      @arrofifahmi7708 7 месяцев назад +2

      @@francescab1413 me too mate! Thanks a lot!

    • @SDMNKhan
      @SDMNKhan 7 месяцев назад

      name 'np' not defined?

  • @pbp7
    @pbp7 Год назад +28

    Man, “Oceania” was so funny 😂, tks for the class!

  • @JW-pu1uk
    @JW-pu1uk Год назад +30

    This is absolutely top tier content. I can't stress this enough to people new, or going into the DA/DS field: you WILL be exploring and cleaning data sets much more than you will be visualizing and building models.
    Thanks for this, Alex!

  • @rafaelmarques5623
    @rafaelmarques5623 8 месяцев назад +9

    Oceania is one of the 7 Continents (North America, South America, Europe, Asia, Africa, Oceania, Antartica). It's basically Australia and the countries (islands) around it.
    Hope that helps!

  • @AlastorGarcia
    @AlastorGarcia Год назад +8

    Thanks Alex! Right now i'm applying to my first DA Job and you have no idea how useful your videos have been for me!!

    • @ermano5586
      @ermano5586 Год назад +1

      Hey? How is it going? Did you succed in applying for the job you want?

  • @kartikgupta370
    @kartikgupta370 8 месяцев назад +6

    We can also write this to save time writing all the column names in the list "df2 = df.groupby('Continent')[df.columns[12:4:-1]].mean(numeric_only=True).sort_values(by='2022 Population', ascending=False)
    "

  • @shankarmidatala2049
    @shankarmidatala2049 Месяц назад +1

    Namaste! I found your tutorials "Simple, Easy to follow, and To the point". Thanks.

  • @frenamakenson9844
    @frenamakenson9844 6 месяцев назад +17

    Hello,
    100000000 thanks for sharing
    For the Corealtion part at 11mn
    df.corr(numeric_only=True) # pass numeric only param to not having error

  • @MaximKazartsev
    @MaximKazartsev Год назад +4

    Alex, thank you for this great video and everything you do!
    In order to avoid manual ordering of the population years, there is a way to use df.columns method, by adding reversed. The whole construction looks like
    df2 = df.groupby('Continent')[list(reversed(df.columns[5:13]))].mean().sort_values(by='2022 Population', ascending=False)
    And it works )

  • @sj1795
    @sj1795 7 месяцев назад +2

    EXCELLENT SUPERB video!! I can't believe it--I'm 6/7 videos away from the end of your FANTASTIC bootcamp series! Wahoo! I learned a lot in this video. :) As for "ending on a low note", hardly Alex lol All your content is uplifting and rewarding! As always, THANK YOU!

  • @pradiptisimkhada292
    @pradiptisimkhada292 Год назад +4

    I just finished all the videos in you bootcamp playlist few hours ago and I'm excited to do this again..

  • @toygar8699
    @toygar8699 8 месяцев назад +21

    For those get error in heatmap:
    import matplotlib.pyplot as plt
    numeric_columns = df.select_dtypes(include=['float'])
    sns.heatmap(numeric_columns.corr(), annot=True)
    plt.rcParams['figure.figsize'] = (20, 7)
    plt.show()

    • @asmitaupadhyay4656
      @asmitaupadhyay4656 5 месяцев назад

      thank you

    • @nointernetnarwhal7615
      @nointernetnarwhal7615 4 месяца назад

      THANK YOU!!!!!! I almost quit for good.

    • @nassrmohamed278
      @nassrmohamed278 4 месяца назад

      i had that error in corr : " could not convert string to float: 'AFG'"
      do you know how to solve this

    • @user-vy8kk9ob3s
      @user-vy8kk9ob3s 4 месяца назад

      thanks a lot toygar

    • @yanpaucon1043
      @yanpaucon1043 3 месяца назад

      @@nassrmohamed278 df.corr(numeric_only=True)

  • @DuckingDuck-th2lt
    @DuckingDuck-th2lt 7 месяцев назад +10

    Hello, Alex!
    Once again, thanks a lot for all your hard work!
    At 13:10 I got an error ValueError: 'box_aspect' and 'fig_aspect' must be positive"
    Solved it by putting the plt.rcParams BEFORE the sns.heatmap
    The other problem was that some functions didn't work until I added the parameter numeric_only = True, e.g., df.corr (numeric_only=True) or .mean(numeric_only = True)
    Hope, it can help someone!

    • @yanpaucon1043
      @yanpaucon1043 3 месяца назад

      Thank you, You are the Best!

  • @staquatica1607
    @staquatica1607 10 месяцев назад +41

    I got some error's (using pycharm) that I solved by using "mumeric_only=True". For instance: df.corr(numeric_only=True) and df.groupby("Continent").mean(numeric_only=True)

    • @mohammedshadaabkhan3228
      @mohammedshadaabkhan3228 9 месяцев назад +6

      Hey use this code instead
      numeric_df = df.select_dtypes(include='number') # Select only numeric columns
      plt.figure(figsize=(20, 7)) # Set the figure size
      sns.heatmap(numeric_df.corr(), annot=True) # Create the heatmap with annotations
      plt.show()

    • @DevanshAsawa
      @DevanshAsawa 7 месяцев назад +1

      helped a ton thanks

    • @haley2486
      @haley2486 7 месяцев назад +1

      Thanks for posting! I had to do SHIFT+TAB on the corr() function to find out how to get only numeric values.

    • @nassrmohamed278
      @nassrmohamed278 4 месяца назад +1

      thaaaaaaaaaaaaaaank youuuuuuuuuuuuuuuuu

  • @DEDE-ix9lg
    @DEDE-ix9lg Год назад +1

    I always enjoy a video from Alex. Making one of the best videos , while some other channels just can be a real headache

  • @kogureyoeh
    @kogureyoeh Год назад +6

    at 24:00
    you can just simply add ".sort_index()" on the "df3 = df2.transpose()", so that we don't have to manually rearrange the columns.
    df3 = df2.transpose().sort_index() worked on my end, hope on your end too.

  • @abhishekchaudhary7913
    @abhishekchaudhary7913 7 месяцев назад +2

    df4=df3.sort_index(ascending=True)
    df4 at 26:11 as alex is sorting manually you sort the year directly by this command

  • @user-yp1ej5ou6b
    @user-yp1ej5ou6b 8 месяцев назад +1

    Hey, just a quick note here, when we're plotting the populations, it's only related to the numeric values compared to the highest populations, in fact (for example) Oceania's population increased in around 2.5 times
    Anyway, thanks for the content, it's amazing

  • @keluargaindo-timordiuk
    @keluargaindo-timordiuk Год назад +5

    For the grouping data I do df2=df.drop(columns=['CCA3','Country','Capital'])
    df3=df2.groupby('Continent').mean(numeric_only=True).sort_values(by="2022 Population",ascending=False)
    df3
    to get to the same output as seen in the video

    • @danielmariobuchberger
      @danielmariobuchberger 10 месяцев назад

      Me too, this should be explained, because Strings can not get easy a mean...to long is most the problem!

    • @bolajiawofuwa8116
      @bolajiawofuwa8116 8 месяцев назад

      THANK YOU!!!!!!

  • @tranguyen4462
    @tranguyen4462 4 месяца назад

    omg I laughed out loud at the "Oceania" part ;)))) Alex is so funny and brutally honest about things he didn't know ;)))

  • @quotesdiary310
    @quotesdiary310 Год назад +2

    Hi Alex
    Thank you so much for your support for freshers in the field of data analytics.

  • @Inc0gnit030
    @Inc0gnit030 Год назад +1

    I really enjoyed this introduction to Pandas! Keep up the good work!

  • @ngwamalfred8151
    @ngwamalfred8151 11 месяцев назад

    Where would l have been without this video .

  • @SoggyBagelz
    @SoggyBagelz Год назад +3

    Lets goo!

  • @kevindeschepper8140
    @kevindeschepper8140 2 месяца назад

    To exclude rank from being display in the numerice data: columns_to_include = df.select_dtypes(include=['number']).columns.difference(['Rank'])

  • @nadarioferguson6276
    @nadarioferguson6276 4 месяца назад

    Thank you so much for this. I really enjoyed it and learned a lot of what I had forgotten a few years ago.

  • @LaMeeLifestyle
    @LaMeeLifestyle Год назад +4

    Thanks for all you do. I’m loving the bootcamp. Just finished excel project. However, please can you make a video on story telling?

  • @Zenitsu-mq7fq
    @Zenitsu-mq7fq 5 месяцев назад

    24:50
    df2 = df.groupby('Continent').mean(numeric_only=True).iloc[:, -5:-13:-1].sort_values(by = '1970 Population', ascending = False)
    df2 = df2.transpose()
    df2.plot()
    This way we don't use the copypasting and changing columns, just use reversed indexes)

  • @Charlay_Charlay
    @Charlay_Charlay 7 месяцев назад

    Thank you for the Pandas class!

  • @aayushitrivedi3481
    @aayushitrivedi3481 Год назад +2

    love your videos alexx ;)

  • @abdulsami6117
    @abdulsami6117 Год назад

    Love from Pakistan Alex, Really Helpful and Enjoyable.
    I also like the OOPS sound you make 😂😂

  • @aishwaryapattnaik3082
    @aishwaryapattnaik3082 Год назад +2

    Thanks a lot for this clear cut explanation. Can you make something similar for NLP projects end to end ?

  • @sarayusemesta6132
    @sarayusemesta6132 2 месяца назад

    26:00
    you can just add this to inverted columns
    df2 = df.groupby('Continent')[df.columns[5:13]].mean(numeric_only=True).sort_values('2022 Population', ascending=False)
    df2_inverted = df2.iloc[:, ::-1]
    df2_inverted

  • @neildelacruz6059
    @neildelacruz6059 11 месяцев назад +1

    Thank you Alex this is very helpful.

  • @OkallTheAnalyst
    @OkallTheAnalyst 5 месяцев назад +1

    Incase you are running into an error at minute 11:12, add numeric_only = True to the corr. i.e df.corr(numeric_only = True).

  • @kevindeschepper8140
    @kevindeschepper8140 2 месяца назад

    another way to select the columns (think of a big data sets where indicing with numbers would be challeging) columns_to_include_2 = df.select_dtypes(include=['number']).filter(like='population').columns

    • @kevindeschepper8140
      @kevindeschepper8140 2 месяца назад

      columns_to_include_2 = df.select_dtypes(include=['number']).filter(like='Population').columns.difference(["World Population Percentage"]):P

  • @user-dx2hx2rd4g
    @user-dx2hx2rd4g Год назад +1

    Thank you for the useful information!

  • @moniquebrasilbaptista1989
    @moniquebrasilbaptista1989 10 месяцев назад

    I am sure I am going to use some of these tips. Thank you!😍❤

  • @vitorribeirosa
    @vitorribeirosa Год назад +1

    Neat...
    Thanks for sharing this content.
    Cheers

  • @jjsansano
    @jjsansano 3 месяца назад

    This is great! Thank you!

  • @enix492
    @enix492 Год назад +2

    Hello Alex. I read a few reviews on your recommended course on Udemy. People are saying that it is a bit outdated especially the last section. Do you think I should still go for it and the non updated part doesn't matter? Love your content and thanks for everything you do here.

    • @AlexTheAnalyst
      @AlexTheAnalyst  Год назад +2

      I haven't taken it in a while - worth listening to more recent comments. Could be outdated?

  • @elfridhasman4181
    @elfridhasman4181 Год назад +1

    Thank you Alex💯🔥

  • @jeffrey6124
    @jeffrey6124 19 дней назад

    Hope you also make a Pyspark series 🤓

  • @quotesdiary310
    @quotesdiary310 Год назад +1

    Thank you so much alex

  • @TheRobinCreations
    @TheRobinCreations 10 месяцев назад

    Thank you so much it was very informative.

  • @anuarroho2561
    @anuarroho2561 25 дней назад +1

    mean(numeric_only=True)

  • @minasghazaryan9344
    @minasghazaryan9344 Год назад +6

    Hi, Alex. First of all thanks for a great video and explanations in it.
    If you could help out with the issue I get running your exact code I would be more than grateful.
    Running the df.corr() line gives me the following error: ValueError: could not convert string to float: 'AFG' .
    Same comes for the heatmap,etc. What could it be here?
    Thanks a lot in advance.

    • @ReneePieschke
      @ReneePieschke Год назад

      Getting the same errors.

    • @11zaad
      @11zaad Год назад +2

      try this ==> df.corr(numeric_only=True)

    • @dustin3320
      @dustin3320 Год назад +13

      Best to use df.corr(numeric_only=True) to get around this

    • @Batira583
      @Batira583 9 месяцев назад

      you saved my life thanks so much @@dustin3320

    • @fede77
      @fede77 8 месяцев назад +2

      df.corr(numeric_only = True)

  • @innocentnduaguba
    @innocentnduaguba 8 месяцев назад +2

    Thank you so much Alex, truly great content you put out there. I have a question please; when I run df.groupby('Continent').mean() and df.corr() I get errors, please what could be the cause and what can I do to remedy it.

    • @sabithsaqlain1367
      @sabithsaqlain1367 8 месяцев назад +1

      use df.corr(numeric_only = True)

    • @sj1795
      @sj1795 7 месяцев назад +1

      @@sabithsaqlain1367 THANK YOU for this!! This was driving me a little nutty. Really appreciate you sharing this. :)

    • @SDMNKhan
      @SDMNKhan 6 месяцев назад

      I could not fix the mean() issue.

    • @chriscurtis95
      @chriscurtis95 3 месяца назад +1

      df.groupby('Continent').mean(numeric_only=True)

    • @user-ml2qj4fm9x
      @user-ml2qj4fm9x Месяц назад

      @@chriscurtis95 🙏 Thank You!

  • @user-fx9eq7zm2v
    @user-fx9eq7zm2v Год назад

    Again, thank you were much!

  • @youssefbekk4453
    @youssefbekk4453 Год назад

    high level , thanks

  • @Chathur732
    @Chathur732 День назад

    at 11:12 the df.corr() does not work now. Instead use:
    df_numeric = df.select_dtypes(include=[float, int])
    correlation_matrix = df_numeric.corr()
    correlation_matrix

  • @sivasagarchakkarai1687
    @sivasagarchakkarai1687 Месяц назад +1

    If "df.corr()" doesn't work for the same data set were using in this Video. And It throughs an error : could not covert string to float: 'AFG'. Like this, Try : df.corr(numeric_only = True)

    • @user-hr5bm4oe8f
      @user-hr5bm4oe8f Месяц назад

      same

    • @user-hr5bm4oe8f
      @user-hr5bm4oe8f Месяц назад

      numeric_columns = df.select_dtypes(include=[np.number])
      correlation_matrix = numeric_columns.corr
      correlation_matrix()

  • @HarshKumar-ws3wv
    @HarshKumar-ws3wv 5 месяцев назад

    Sir, in your opinion : Jupyter vs Pycharm? Which is better for Exploratory Data Analysis ?

  • @rjk537
    @rjk537 Год назад +1

    I'm a law graduate without any experience or qualifications in data analysis whatsoever but i want to get into data analysis. Will i be able to get a job in this field? and if yes then what possible skills and certifications will help me to achieve the same? please give me some tips and insights it would be really helpful!

    • @ermano5586
      @ermano5586 Год назад

      Yes, you can, from skills I would prefer mostly analytical thinking, learn probability and statistics, other high math stuff.
      From certification mr Alex said that Amazon and Tableau certifications, and others will help, but anyways if it's long-term learning certificate, I think it is ok to have it on CV. But the thing that highlites you it is the projects that you have done mostly for your job and I mean not only portfolio projects but another ones to show your uniqueness.

  • @philiprhome3824
    @philiprhome3824 Год назад +1

    as R user, the syntax of pandas is just weird in compare to tidyverse (dplyr and tidyr)

  • @haithammontaser7769
    @haithammontaser7769 Год назад

    Hello Alex. Thanks for the video and content. Is there any video for data per-processing?

  • @iqraasif3783
    @iqraasif3783 9 месяцев назад +1

    Hi, can someone help. When I plot figures that have been grouped, it doesn't show the figure, just says .

    • @user-tm7uw4os1n
      @user-tm7uw4os1n 6 месяцев назад

      21:09 I just figured it out. Simply add another line after the plot, like:
      df2.plot()
      plt.show()

  • @diegomartins7214
    @diegomartins7214 9 месяцев назад

    Thank you!

  • @orlumbuseuw5646
    @orlumbuseuw5646 Год назад +19

    Was there here an adult ignorant of what Oceania is or is this some inner joke in the channel?

    • @octaverius762
      @octaverius762 Год назад +2

      I can't believe this

    • @litoavila.
      @litoavila. Год назад +1

      Also FYI America is just one continent, in case you doubt it

    • @MatthewBreithaupt
      @MatthewBreithaupt 10 месяцев назад +2

      OceanEeeA

    • @MatthewBreithaupt
      @MatthewBreithaupt 10 месяцев назад +1

      FYI Australia is not a *small* island. Oceania doesn't "mean" anything, it's the name of a continent containing the countries listed right in front of you since you already filtered the data 😂😂

  • @gauravpunera3256
    @gauravpunera3256 Год назад +1

    Alex please make video on how to get international remote data analyst job

  • @Marcusram
    @Marcusram Год назад

    we can do df3=df3.iloc[::-1] to solve the problem with the date order

  • @octaverius762
    @octaverius762 Год назад +3

    Alex which continent do you think Australia is in 😮

    • @AlexTheAnalyst
      @AlexTheAnalyst  Год назад

      :D

    • @chefernandez563
      @chefernandez563 Год назад

      Australia is also a continent tho😂 sometimes ppl will also refere to NZ ans Aus as the "Australias" but Oceania includes the other surrounding islands

    • @octaverius762
      @octaverius762 Год назад +1

      @@chefernandez563 Oceania is a continent, Australia is a country. How people often speak is not relevant

    • @dragoneer121
      @dragoneer121 Год назад

      @@octaverius762 Actually it is relevant. Though different countries do have different models and its entirely up to convention. Australia the continent is usually considered the 3 islands of mainland Australia, Tasmania and Papua New Guinea

  • @truthgaming2296
    @truthgaming2296 7 месяцев назад

    its spells 'O-Ce-A-Nia' btw
    btw thank for this guidance SIr Alex :)

  • @adminravi
    @adminravi 10 месяцев назад +1

    Is it ok if I use:
    pd.set_option('display.float_format', '{:.2f}'.format) instead of
    pd.set_option('display.float_format', lambda x: '%.2f' % x)

    • @rohallav
      @rohallav 10 месяцев назад

      or even better you can do lambda x: f"{x:.2f}"

  • @ayoubchouket
    @ayoubchouket 5 месяцев назад

    thank you

  • @r10053506
    @r10053506 2 месяца назад

    why is my program when running corr() is not automatically detecting numbers and runs into an error

  • @arpitmaheshwari122
    @arpitmaheshwari122 8 месяцев назад +1

    hey, can anyone tell if the correlation command is working in vs code?
    I'm getting a value error in this part.
    please share the solution if you have one
    thanks :)

  • @karanvaghela4668
    @karanvaghela4668 Год назад

    Hey alex why we should use python instead of SQl Because SQl is easy

  • @l7932
    @l7932 2 месяца назад

    thanks sir

  • @OazadOMER
    @OazadOMER 11 месяцев назад +1

    Thank you very much Alex I'm shifting from Ph to Data Analyst with your bootcamp I had an issue with plt.show() AttributeError: module 'matplotlib' has no attribute 'show' i's deprecated and I counldn't find something sameller and also my chart not showing numbers 14:10
    Best regards

    • @dishanbhandari
      @dishanbhandari 4 месяца назад +1

      Hi there, did u find the solution to your problem of not showing numbers? I ran into the same problem too.

    • @olaleyeboluwatife949
      @olaleyeboluwatife949 Месяц назад

      @@dishanbhandari hey mate, you found the solution?

  • @rnjesus9950
    @rnjesus9950 7 месяцев назад

    This worked for me where df.corr() did not:
    # Select numeric columns (excluding any non-numeric columns)
    numeric_columns = df.select_dtypes(include=['float64', 'int64'])
    # Calculate the correlation matrix
    correlation_matrix = numeric_columns.corr()
    correlation_matrix

  • @akademy_performance_digital
    @akademy_performance_digital 7 месяцев назад

    great

  • @osiomogieasekome8799
    @osiomogieasekome8799 Год назад

    I couldn't get seaborn to import... I tried online solutions about installation but it didn't work

  • @user-re4ip5ms9w
    @user-re4ip5ms9w 3 месяца назад

    my heatmap is broken its not showing all the values even if I wrote the annot = True anyone have a fix? i tried almost everything when I hit shift+tab

  • @donvious
    @donvious 4 месяца назад

    hi, where is the link for the csv format document?

  • @meredithleonor5035
    @meredithleonor5035 Год назад

    why use anaconda instead of google collab, just curious looking forward in visual tutorial at python and statistics thanks i really need this type of tutorial i am studying cohort analysis and RFM analysis

    • @peaceandlove8862
      @peaceandlove8862 10 месяцев назад

      Oceania is the continent that includes Australian and New Zealand.

  • @ermano5586
    @ermano5586 Год назад

    I have one problem, which is that the table does not display columns starting from "area (km^2)" when we call "df" to view the table, I mean there is no scrollbar for horizontal data, can anyone help for this, please?

    • @ruchirmittal9207
      @ruchirmittal9207 8 месяцев назад +1

      Try another browser. Some browsers doesn't support that feature.

  • @dragoneer121
    @dragoneer121 Год назад +1

    Continents are mostly a social convention. The english spekaing countries tend to use 7, while spanish speaking countries have a 6 continent model where it uses Oceania and combines North and south America.
    Australia is the continent but Oceania is a geopolitical convenience. If it was not included most of the pacific isalnd countries would not be associated with a continent. North and South America are another convenience and Central america is only a region by American standards.
    As an example of how ridiculous it is as a continent, Hawaii would be included if it was independant.

  • @user-zk3be2uh8l
    @user-zk3be2uh8l 2 месяца назад

    corr_matrix = df.select_dtypes(include='number').corr()
    # Then proceed with creating the heatmap
    sns.heatmap(corr_matrix, annot=True)
    plt.rcParams['figure.figsize'] = (20, 7)
    plt.show()
    I have used this code for heatmap but the notebook doesn't populate the heatmap with individual correlation values rather colored tiles only. please anyone can help?

    • @ramanjeetkaur8969
      @ramanjeetkaur8969 2 месяца назад

      pip install --upgrade seaborn matplotlib
      Update seaborn and matplotlib. It worked for me

  • @srijanrawat4014
    @srijanrawat4014 Год назад

    i am having problem in downloading the file , can anyone help me out

  • @dishanbhandari
    @dishanbhandari 4 месяца назад

    My heatmap doesn’t contain the data values inside them as in 14:18 instead it just shows a heatmap with column values as in the top most band. I have written the code just as shown above df.corr(numeric_only=True) as well as that ‘annot’ but still no data values. Pls Anyone help

    • @NyeinHtutSwe
      @NyeinHtutSwe 3 месяца назад

      i am also run into same problem :). I still cant find the solution

    • @jDub997D
      @jDub997D 2 месяца назад +1

      upgrade your seaborn package
      pip install seaborn --upgrade
      restart your kernel and rerun all the boxes

    • @olaleyeboluwatife949
      @olaleyeboluwatife949 Месяц назад

      @@jDub997D 1000 thanks bruv... bless you

  • @chefernandez563
    @chefernandez563 Год назад +1

    Am I the only one who knew Oceania was Australia, New Zealand, Samoa and those places😂😂

  • @taroge5464
    @taroge5464 11 месяцев назад +1

    no explanation.................pd.set_option('display.float_format',lambda x : '%.2f' % x)

  • @naagarhive6581
    @naagarhive6581 6 месяцев назад

    OOPs

  • @Ben-qe8ju
    @Ben-qe8ju Год назад

    O-she-ana

  • @roshandhumal1193
    @roshandhumal1193 Год назад

    Sir Alex.
    I am Roshan Dattaram Dhumal
    I live in India from Mumbai.
    I want to start my career in data analysis but I don't know how to start and I want to know what steps you have to take to become Data analytics.
    I would like to request you to please explain to us and give us some steps. Please sir I will definitely do hard work.

    • @hammadahmed7192
      @hammadahmed7192 Год назад

      try passing numeric only argument. In recent version, default value of this argument has changed to false so it tries to correlate string values as well.
      df.corr(numeric_only = True)

  • @DatabaseAdministration
    @DatabaseAdministration 5 месяцев назад +2

    It's funny american don't know the continent of australia.

  • @alikoohi8265
    @alikoohi8265 Год назад

    informative video thanks.Just found an easier way to reverse order of rows:
    df3 = df2.transpose().loc[::-1] 😉

  • @gogor8017
    @gogor8017 7 месяцев назад

    You said 'Oceania' so many times, now it sounds like meaningless word.

  • @aayushitrivedi3481
    @aayushitrivedi3481 Год назад +2

    first
    pin me

  • @RaihanRisad
    @RaihanRisad 5 месяцев назад

    i couldnt able to do df.corr() because it was saying some columns are not numeric so that case i had to use numeric_df; numeric_df = df[['2022 Population', '2020 Population', '2015 Population', '2010 Population', '2000 Population', '1990 Population', '1980 Population', 'Area (km²)', 'Density (per km²)', 'Growth Rate', 'World Population Percentage' ]]
    numeric_df.corr()

    • @dwbrow3
      @dwbrow3 4 месяца назад +1

      Try df.corr(numeric_only=True)

  • @marypazcuessy3004
    @marypazcuessy3004 3 месяца назад

    Can anyone help me? My heatmap wont load all the numbers, just the Rank row starting at 1
    I used
    df.corr(numeric_only = True)
    sns.heatmap(df.corr(numeric_only = True), annot = True)
    plt.show()

  • @csaracho2009
    @csaracho2009 Год назад

    (Minute 9:30)... So, in the Continent America there are 'two" Continents, "NorthAmerica"and "SouthAmerica"/ Ha Ha Ha, Americans...