Practical Statistics for Data Scientists - Chapter 1 - Exploratory Data Analysis

Поделиться
HTML-код
  • Опубликовано: 21 авг 2024
  • PRACTICE STATS, SQL, AND PYTHON HERE: stratascratch....
    Patreon: / shashankkalanithi
    This is an overview of Chapter 1 of Practical Statistics for Data Scientists. I'll be going over the first couple of chapters of this book because the later chapters cover similar materials from my Hands on Machine Learning book overview videos.
    FREE Python Course: • Python for Data Analys...
    Link to Book: amzn.to/3ufeoYQ
    ----------------------------
    MX Master 3 amzn.to/3sTroBW
    LG 35in Curved Monitor amzn.to/39pPzR3
    USB-C Hub amzn.to/31Ip8Sl
    MacBook Pro Retina 16 Inch amzn.to/2PSwZde
    Twitter: / kalamari95
    LinkedIn: / shashankkalanithi

Комментарии • 219

  • @emmanuelczarpascua6721
    @emmanuelczarpascua6721 Год назад +23

    Thank you Shashank, I followed your data analyst roadmap and I was able to get a job as a data analyst even before completing all the recommended topics to go through. I forgot about it in the last couple of months. I am currently learning machine learning but felt that I don't really knew much of statistics and it became a limiting factor in learning ml. thank your for this video on stats. :)

  • @larryflores3849
    @larryflores3849 2 года назад +43

    This is what I need! I'm going over stats with Khan Academy right now and this is going to be a great supplement to that. Thanks again for these videos as always!

  • @HarpreetSingh-zy2gj
    @HarpreetSingh-zy2gj 2 года назад +2

    This is success when some foreigners as well as Indians want to learn from you and yours channel

  • @yochillfelix
    @yochillfelix 2 года назад +18

    why would you take the time to do this? You are such a kind soul!!! Thank you so much! I can't believe that your teaching this book! Patreon subscribed!!!

  • @Scoville95
    @Scoville95 2 года назад +43

    Technically not a strenuous subject but deriving meaning and reasoning from EDA is another thing! Great video!

  • @steveea6744
    @steveea6744 2 года назад +20

    I recently discovered your channel and I wish I would have found it a while ago.
    I wish to become a data analyst, soon. I don’t have experience (which I know and hope to change that soon) but I’m trying to obtain as much knowledge, tips and resources as I can and your channel has been helpful. I thank you for that.

    • @George-jz7qy
      @George-jz7qy 2 года назад

      Maybe we can work together because we on the same page

    • @nijagunadarshan2529
      @nijagunadarshan2529 2 года назад +2

      Hi Steve and George, are you interested in making a study group? We can discuss and learn from each other.

    • @George-jz7qy
      @George-jz7qy 2 года назад +2

      @@nijagunadarshan2529 that would be a great idea i think if we learn from each other it will quicken the process and will keep us motivated

    • @Bhatt4924
      @Bhatt4924 2 года назад +1

      pls make a study group

    • @George-jz7qy
      @George-jz7qy 2 года назад +1

      @@Bhatt4924 ok i will make one just give me by end of day and will send you link

  • @mizzchoc10
    @mizzchoc10 2 года назад +19

    Statistics is my weakness. So happy you are covering this! I always get motivated when I watch your videos, you have a way of explaining complicating things in a simple way.

  • @mr.r_r7199
    @mr.r_r7199 2 года назад +13

    this book has been on my to read list for a while, glad to see someone covering and explaining it

    • @confidential303
      @confidential303 2 года назад +1

      it is a such a low quallity book , I dont consider it as a book more as a annotation.

  • @syusyu28
    @syusyu28 2 года назад +4

    I have the same book! Recently I started learning. I will keep watching your channel.🙂 please continue this series 🙏

  • @bluelantern5241
    @bluelantern5241 Год назад +3

    You're really good at explaining things and speaking clearly. I appreciate this content very much

  • @davidyolchuyev2905
    @davidyolchuyev2905 2 года назад

    i was gonna pay a statistician for explaining to me the data interpretation of the difference between the average and the trimmed mean 60$ per hour, and then i found your video. The best thing is that I am also reading this book. What a great source. Thank you.

  • @tolulopeadetola1070
    @tolulopeadetola1070 2 года назад +3

    I just started following you few hours ago, I can tell that your channel is very practical
    Thank you for sharing hands on knowledge with us, I value and appreciate it.
    Keep it up

    • @d4tset785
      @d4tset785 2 года назад

      Hi Tolulope, would you like to join a study group?, I’m starting on Data engineering.

    • @confidential303
      @confidential303 2 года назад

      @@d4tset785 what does data engineering do?

  • @ResilientFighter
    @ResilientFighter 2 года назад +6

    love your channel and love how your going through the different books.

  • @thefighterjett2614
    @thefighterjett2614 2 года назад +1

    I need to comment more on your videos, they help me so much. It slips my mind but I’m working on it. Take that algorithm!!! Also thanks Shashank for pushing these videos out so quickly and sticking to your word. I remember you said you’d start these in a week and you posted Chapter 1 within days. Very Inspiring brother

  • @mr_amit_bhat
    @mr_amit_bhat 2 года назад +1

    Thanks. Wanted exactly this kind of a video to actually grasp what we read.

  • @tishakozlov9065
    @tishakozlov9065 2 года назад +2

    Hello Shashank, I am a student from Russia, currently on my first year of university. I discovered your channel couple mohtns ago and literally every video i have seen since then was awesome! Thank you so much! My dream of becoming a data analyst is becoming more and more real. Keep going!

    • @nijagunadarshan2529
      @nijagunadarshan2529 2 года назад +1

      Hello there, are you a beginner if so can make a study group?

  • @allandavid8684
    @allandavid8684 Год назад

    I haven't watched your video yet but I'm just commenting for the algorithm because you're such a life saver. I'm currently going through your road map. I'll comment again once I've seen the whole video though!

  • @americovaldazo4441
    @americovaldazo4441 2 года назад

    This video is pure gold for every data scientist. Thank you.

  • @Debatom
    @Debatom 2 года назад +11

    Where can I find a link to the notes? I can't find them in the description below the video. Great video and thanks for the help!

  • @mattdone3094
    @mattdone3094 2 года назад +1

    Awesome vid Shashank. I took an elementary statistics class in my community college but I finished the first chapter and it covers some very useful metrics and visualizations. Thank you so much for spreading the knowledge at no cost.

    • @ShashankData
      @ShashankData  2 года назад +4

      Of course! I spent way too much on my college education so want to help spread knowledge as cheaply as possible

    • @shanc3734
      @shanc3734 Год назад

      God bless you! What a sweet heart you are! Thank you Very,very much!

  • @opiquez
    @opiquez 2 года назад +3

    Hi Shashank, per the book: you can run the .value_counts() method on the pd.cut() result and it'll get you the proper Frequency table.

  • @IAmVik01
    @IAmVik01 7 месяцев назад +2

    Suggestion, Please keep your explanations simple and clean and please slow down little bit. Every learner here have patience if you are not too fast/slow but good in explaining. sometimes I lost you. :) I found that the explanation in the boot much better, Also when you are giving a reference of other web site, please provide URL. btw thanks for the video and it really helpful at some level.

  • @remlatzargonix1329
    @remlatzargonix1329 2 года назад

    Indices IS the correct form of the word....."Indexes" iq an Americanization.......not typically used in English-speaking countries, outside the USA. (Also, maybe in Canada as they use a combination of English and American English)

  • @priyendupant5941
    @priyendupant5941 2 года назад +1

    Awesome stuff, No hesitation whatsoever in becoming a patreon

    • @ShashankData
      @ShashankData  2 года назад

      Thank you so much for the support!

  • @kanishkadubey6765
    @kanishkadubey6765 2 года назад +1

    Thank You for such a useful video. It do helped me alot. Moving ahead to Chapter2 now with the help of your video.
    Thanks a ton !!

  • @AdarshKumar-sj5dn
    @AdarshKumar-sj5dn 2 года назад

    Very useful video Shashank Kalanithi. Thank you sharing your knowledge.

  • @khushiagarwal5249
    @khushiagarwal5249 2 года назад

    I really like the way you've made notes on the book!

  • @the0golden0men
    @the0golden0men 2 года назад

    46:28 correction an outlier is 8 or more not 10 because the box plot went to 7,5 and the first outlier was 10 (that's why he made the mistake). Since you can't get half a gold medal, when a country gets 8 medals, the country starts to become an outlier.

  • @jetjet6560
    @jetjet6560 2 года назад

    Damn I could not have gotten a better time to find this channel! The quality and content is really good dude :)

  • @bergkampthagoat
    @bergkampthagoat Год назад

    For the weighted median you can use the weightedstats library btw

  • @jayanthinathanmohanrajan713
    @jayanthinathanmohanrajan713 2 года назад +3

    Could you create a statistics playlist and cover these things under it??

  • @niceday2015
    @niceday2015 2 года назад

    High quality tutorial as always! Thank you. Keep the good job!

  • @online.hustle
    @online.hustle 6 месяцев назад

    Thank you!!! So much better than reading the book.

  • @ludgerderyce7312
    @ludgerderyce7312 2 года назад

    Great Content Can't wait for the subsequent chapters... ALSO thanks for going over your search methods online very useful...

    • @ShashankData
      @ShashankData  2 года назад

      Thanks so much, the next video is out now :)

  • @ChrisMao_708
    @ChrisMao_708 2 года назад +4

    3:34 my weight is down after watching this video, thanks

  • @shubhamdandekar20
    @shubhamdandekar20 2 года назад

    I love your videos and the way you explain things is so good. Keep it up.

  • @alisquest
    @alisquest 2 года назад +1

    Are we going to have the rest of book??? Doing something like this really helps us newbies.....do you intend to do the rest of the book?

  • @Kngdmio
    @Kngdmio 7 месяцев назад

    At 59:26 youre like... "oh.. interesting..." -- and then a moment of relief when you saw it was C++ not Python, haha

  • @julio1148
    @julio1148 2 года назад +4

    been using pandas for a while, just learned what it stands for lol

  • @madnecessity
    @madnecessity 2 года назад

    Exactly what I need right now
    Thank you so much!

  • @mohdhammadkhan5570
    @mohdhammadkhan5570 2 года назад

    Great work!
    As you have made video playlist on machine learning by O'reilly publication and currently working on another book of their publication i.e " practical stats for Data Scientists"
    Kindly start series on:
    Book: Python for Data Analysis
    Same O'reilly publication

  • @seamusugochukwu5711
    @seamusugochukwu5711 2 года назад +1

    Hey Shashank, any difference between a data analyst and a data scientist?

  • @kirand.4122
    @kirand.4122 2 года назад

    Not a crib comment but 1:01:40 in a bar chart you exchanged ylabel and xlabel.

  • @DeepakSharma-vx3ee
    @DeepakSharma-vx3ee 2 года назад +2

    Subscribed, please cover Multivariate Statistics after this 😊

  • @tjbroussard3524
    @tjbroussard3524 2 года назад

    Please keep this going!

  • @jppbkm
    @jppbkm 2 года назад

    Great video! Hope you're doing well settling in to the new place in Seattle. I am also a big Seaborn fan. I will have to check out plotly after how highly you recommend it!

  • @mohamedqani682
    @mohamedqani682 3 месяца назад

    Practical Statistics for Data Scientists, Do you prepare only chapter one or there's other chapters that you prepare? Continue the good work.

  • @kunalharia8524
    @kunalharia8524 2 года назад

    at 46:44 when looking at boxplot - you used "outlier" - perhaps better to say "suspected outlier"?

  • @ShivamTiwari-on2kl
    @ShivamTiwari-on2kl 2 года назад +1

    I'm on the third chapter of this book

  • @SOORYAPRAKASHKBML
    @SOORYAPRAKASHKBML 2 года назад

    Eagerly waiting for the video on next chapter Shashank!

  • @openyard
    @openyard 2 года назад +33

    I had to change the video speed to 0.75x. I think learning can be improved by instructing at natural human conversation speed.

    • @MC-8
      @MC-8 2 года назад +2

      He probably sped his video up a bit

    • @Juanp082413
      @Juanp082413 Год назад

      Jajaj

    • @towfiq2266
      @towfiq2266 Год назад +5

      Here's me watching it at 1.75x

    • @openyard
      @openyard Год назад

      @@towfiq2266 You may be having some knowledge on this matter. If you are a freshman in this field and can absorb the information at the speed you have indicated, kudos 👏 to you.

    • @joshuabretana
      @joshuabretana Год назад

      Thats what happens when indians move to the west theyre that guud

  • @mazakeral725
    @mazakeral725 2 года назад

    Do I need to become proficient in Python before exploring Data Analytics or Data Science?

  • @maharsajjad7184
    @maharsajjad7184 2 года назад

    Thanks for making a video on this book ..... I am thinking of starting to read this book.

    • @ShashankData
      @ShashankData  2 года назад +1

      It's a great book! Thanks for checking out my video Mahar!

  • @HeshamAliSalem
    @HeshamAliSalem 2 года назад

    I want to thank you for this video, and I have a Question and order:
    Q: I understand how to calculate the Weighted average but I want to know what is number 46.83 refers I think that the normal AVG is better in this situation I know that you make the example to explain the function but I want to go deep and ask when to use weighted AVG or only normal AVG.
    my order: can you share your notes.

  • @chipile
    @chipile 2 года назад

    Excellent explanation! subscribed for life!

    • @ShashankData
      @ShashankData  2 года назад

      Thank you so much for the kind works Abisai!

  • @ganeshhegde4049
    @ganeshhegde4049 2 года назад

    Thanks for sharing this , great job !

  • @mohamedjelassi9672
    @mohamedjelassi9672 2 года назад +1

    I have a question... you speak about statistics in data analytics... why not using R much stronger than the general python language

    • @ShashankData
      @ShashankData  2 года назад +2

      Great question! While you're right out-the-box R has much stronger stats abilities than Python, a lot of that gap is closed through commonly used third party libraries in Python. Because I an building my Data Science career on Python, my channel usually focuses on the use of Python to accomplish tasks. That being said, I always say R is a tremendous and widely accepted language in the Data Science community.

  • @yourfinancialanalyst
    @yourfinancialanalyst 2 года назад

    I am going to cover this playlist...please keep bringing

  • @ronyandrade7040
    @ronyandrade7040 2 года назад

    just to say, Hello from Brazil!

  • @jasonsykes4199
    @jasonsykes4199 2 года назад

    I know it is pretty hard to change the way one speaks. I am not talking so much about your voice or the majority of your speech. I am only focused on your saying, um, uhh, and the likes of the space holders. You can just mute these in your edits. This will make your videos near perfect. You have great content. I thought I would just add my 2 cents. Keep up the great work.

  • @aqibrehmanpirzada4552
    @aqibrehmanpirzada4552 2 года назад

    Excellent Work Sir
    Great Work Please Make more videos on this book

  • @amoghamaresh6702
    @amoghamaresh6702 2 года назад +1

    Hey Shanshank, great video. Thanks for uploading. Keep making these videos mate.
    Can I ask you to make a video on spectral data analysis. Thanks

  • @minhnguyen6508
    @minhnguyen6508 2 года назад

    Thank you for another book learning!

  • @GraceXIEgx
    @GraceXIEgx 6 месяцев назад

    Thank you for these videos!

  • @meetnagadia7853
    @meetnagadia7853 2 года назад

    Hey Shashank
    Great Work 👍

  • @navneetsahu4625
    @navneetsahu4625 2 года назад

    Great initiative 👍👍

  • @Drganguli
    @Drganguli 2 года назад

    Great video and thanks for putting up

  • @samfamily9459
    @samfamily9459 2 года назад

    It's really helpful.. nice work.. please share us more on statistics for data science, AIML..

  • @SA-ie2sp
    @SA-ie2sp 2 года назад

    There videos are the best.

  • @sarim9574
    @sarim9574 2 года назад

    I'm thinking about buying this book...but I heard the python code isnt that good, although the stat concepts are explained really well

  • @thegoldenagelegendz4425
    @thegoldenagelegendz4425 2 года назад

    I really want to join this series because i am looking for this and your explanation is well n good in just staring of video .

  • @dorothysilverman7660
    @dorothysilverman7660 2 года назад

    This sound silly, but did you make those notes from the book? I like them better than the book! If they are available, where would I find them?

  • @vishuuuu2547
    @vishuuuu2547 Год назад

    this is what i need

  • @georgezambrano5166
    @georgezambrano5166 2 года назад

    Awesome video!!

  • @lillogadget
    @lillogadget 2 года назад

    Just used your link to buy 3 books off of Amazon, thank you so much for sharing your content!

    • @ShashankData
      @ShashankData  2 года назад

      Thanks so much for the support Billie! Hope you find them useful

  • @Meristem968
    @Meristem968 2 года назад

    Thank you. You are wonderful

  • @Coffee_is_all_you_need
    @Coffee_is_all_you_need 2 года назад

    Thank you dude !

  • @viktoriiauntilova7695
    @viktoriiauntilova7695 2 года назад +2

    Dude I just bought this book and was pretty surprised it was in R ! Your video is of great importance !! Thank you so much ! Awesome job!
    Why did you use such a complex function to determine weighted median if we could do it simply:
    fraction = medal_count.Gold/medal_count.Total*100
    medal_count[fraction == np.median(fraction)]['Total'].iloc[0]
    this give the same result of 40 :)
    Please make the rest of chapters !! IT WAS AWESOME!!!! 😍

  • @nikhilshrestha4711
    @nikhilshrestha4711 2 года назад

    thank you sir 🙏. Great video 🙂

  • @charlizamon1
    @charlizamon1 2 года назад

    Great vid bud

  • @moroccangamereviews8824
    @moroccangamereviews8824 2 года назад

    what a beautiful content

  • @countryboy9695
    @countryboy9695 2 года назад

    very useful. pls continue

  • @lajaya101
    @lajaya101 2 года назад +1

    Thank You,
    Glad to find your channel.
    by the way, what apps you use to make the note?

    • @ShashankData
      @ShashankData  2 года назад +1

      Notion! It’s a great note taking tool

  • @kokiinho1
    @kokiinho1 2 года назад

    Do you have any video that say why you decided it to become a data scientist?

  • @slrahman8723
    @slrahman8723 2 месяца назад

    Just stunning!
    Could you make video about a book "story telling with data" by Cole Nassbaumer Knaflic

  • @vadivelan4228
    @vadivelan4228 2 года назад

    Good one.. Thank you....

  • @JeffersonCanedo
    @JeffersonCanedo 2 года назад

    Great video good job 👍

  • @axelrasmussen5365
    @axelrasmussen5365 2 года назад

    Great video

  • @piyushgopal44
    @piyushgopal44 Год назад

    notes are really helpful and the video is very informative. can u tell me what software you are using on the left side of the screen, it will be help me making my personal notes too. thank you

  • @johnmathew3141
    @johnmathew3141 2 года назад

    Amazing class

  • @Younessss_
    @Younessss_ 2 года назад

    dude you're amazing :)

  • @jarupbg
    @jarupbg 6 месяцев назад

    New here!! Hope I can learn something

  • @AbhishekSingh-lp9oq
    @AbhishekSingh-lp9oq 2 года назад +1

    Can you please share your notes

  • @hasnainshaukat4824
    @hasnainshaukat4824 2 года назад

    Hi really great video but if you can kindly add time steps across the video with topic names that would be really helpful

  • @moncefkarimaitbelkacem1918
    @moncefkarimaitbelkacem1918 2 года назад

    👌 quality content

  • @anto1756
    @anto1756 2 года назад

    I needed this 1 semester ago :((

  • @ben-cb5er
    @ben-cb5er 2 года назад +1

    Thank you for your amazing videos! Can you please do some videos and just tell us what to study as a beginner and how to study and practice. I'm new to data analyst and your advice and tips will truly help new people like me that just getting started :) thank you

    • @neneaytheenti4327
      @neneaytheenti4327 2 года назад

      Start with excel sheets, then master Sql and select one visualisation tool among Tableau, powerbi or quicksight. These skills are more than enough to open doors for plethora of jobs around you as an entry level data analyst. Once you get in job...upskill yourself by learning Python and its libraries like pandas. Then slowly transition to Data scientist role by learning advanced mathematics. All the best !!

  • @zoobia5554
    @zoobia5554 2 года назад

    Can anyone tell me why the variance differs when I use numpy and then when I use Shashank's method.

  • @kalyanreddy6260
    @kalyanreddy6260 2 года назад

    What is alternative of power pivot in excel for data analysis. Because excel 2016 home student doesn’t provide power pivots and for to buy it is much cost, is there any alternate to do analysis ?

    • @ShashankData
      @ShashankData  2 года назад +1

      R and Python are great alternatives that are widely used in industry. Personally I'm in favor of Python.

  • @behrouzhosseini1934
    @behrouzhosseini1934 2 года назад

    Hi Shashank, thanks for video, can you please let me know what application you are using for Note? Thanks