Practical Statistics for Data Scientists - Chapter 2 - Data and Sampling Distributions

Поделиться
HTML-код
  • Опубликовано: 28 дек 2024

Комментарии • 46

  • @yasmeenkarachiwala9612
    @yasmeenkarachiwala9612 11 месяцев назад

    Hello Shashank, Thank you so much! Eagerly waiting for the remaining chapters.

  • @syusyu28
    @syusyu28 3 года назад

    ohhh Chapter2!!! I have been waiting fro that!

  • @Kinnoshachi
    @Kinnoshachi 3 года назад +3

    Bro sir, for the first time in my life I feel like I’m in a book club, thank you, I can’t wait for chapter 5

  • @axeltessai1359
    @axeltessai1359 3 года назад

    33:38 sklearn.utils.resample() returns all 2732 rows from the data if n_samples is not specified. just adding n_samples=1000 to resample() will fix this. Otherwise you are just grabbing the whole same data every iteration which is why you keep getting the same mean and the same median.

  • @Prasadh-p9e
    @Prasadh-p9e 15 дней назад

    Taught really well . Thank you so much . Are these notes available ?

  • @viktoriiauntilova7695
    @viktoriiauntilova7695 3 года назад

    Thanks Shashank ! we do like your videos ! 😍We can learn together like that ! It is very helpfull ! Waiting for more videos for this book

  • @ShahzadHassanBangash
    @ShahzadHassanBangash 3 года назад

    Please make more of these loving these series

  • @ludgerderyce7312
    @ludgerderyce7312 3 года назад

    Great work & the Patreon page is top notch

    • @ShashankData
      @ShashankData  3 года назад

      Thank you so much! There’s much more content where that came from

  • @richardlinsley-hood7149
    @richardlinsley-hood7149 3 года назад

    So what would you consider the best way to analyze a U shaped distribution? Such as hourly or yearly temperatures. These form a quasi-sinusoidal pattern which gives a strongly U-shaped distribution to the data.
    A lot of stuff is about normal distributions but U shaped is covered much less often.

  • @bhavinmoriya9216
    @bhavinmoriya9216 2 года назад

    You use notion for writing notes? Thanks for wonderful video.

  • @RN-er7mz
    @RN-er7mz 3 года назад

    A big thanks for the series, also please make series on "naked statistics"

  • @lokeshkumar1365
    @lokeshkumar1365 3 года назад +2

    Could you please use of applied statistics in model building and AB testing for modal validation?Thanks

  • @harshasamudrala2115
    @harshasamudrala2115 3 года назад

    thank you for the well presented video brother !!!!

  • @517127
    @517127 3 года назад

    Your videos are really useful

  • @dimpushah3712
    @dimpushah3712 3 года назад

    How Central limit theorem works for unsymmetrical data?

  • @VishalSoni-te7hy
    @VishalSoni-te7hy Год назад

    How can i access notes sir?

  • @konstantinosfasoulakos5038
    @konstantinosfasoulakos5038 2 года назад

    Nice video, generally in Europe we use cm for measuring heights, but it is well known that the USA use foot (btw I am from Greece).

  • @josecarrasco9840
    @josecarrasco9840 3 года назад

    thank you for this video!

  • @moroccangamereviews8824
    @moroccangamereviews8824 2 года назад

    please do more videos like this :)

  • @rameshkannan1075
    @rameshkannan1075 2 года назад

    Hi Sir I need guidance on Predicting Default on mortgage loan project using R. What are the perquisites I need to enhance my skills..

  • @hescajeda
    @hescajeda 3 года назад +1

    Plis make the video with option traductor for everypeople.

  • @AbdElrahman_Mohamed_faheem
    @AbdElrahman_Mohamed_faheem 3 года назад

    Can you share a link of notion work space of the chapter ?

  • @ba-en1io
    @ba-en1io 3 года назад

    love the series!

  • @d3v487
    @d3v487 3 года назад

    Thanks for Amazing Guide. Please upload the next chapter guide 🙏.

    • @ShashankData
      @ShashankData  3 года назад

      Thank you! It’s already uploaded

    • @d3v487
      @d3v487 3 года назад

      @@ShashankData sorry but I can't see the chapter 3.

    • @ShashankData
      @ShashankData  3 года назад +1

      @@d3v487 Sorry I thought this comment was on the Chapter 1 video. Chapter 3 will be coming out soon

  • @wilsvenleong96
    @wilsvenleong96 3 года назад

    would be better to abstract the cleaning function for the height column out into its own function.

  • @content_king3012
    @content_king3012 2 года назад

    I have done it exactly the same in the video but this error keeps showing up. Can anyone help me? I have followed every details in the video. TIA.
    # Central Limit Theorem
    import pandas as pd
    import matplotlib.pyplot as plt
    sample_amounts = range(10, 100, 20)
    for x in sample_amounts:
    samples = []
    for i in range(1000):
    samples.append(player_data_import["height_inches"].sample(x).mean())
    plt.hist(samples)
    plt.legend([str(z) + " Samples" for z in sample_amounts])
    plt.title("Central Limit Theorem")
    TypeError: Traceback (most recent call last)
    Cell In [14], line 11
    9 samples = []
    10 for i in range(1000):
    ---> 11 samples.append(player_data_import["height_inches"].sample(x).mean())
    12 plt.hist(samples)
    13 plt.legend([str(z) + " Samples" for z in sample_amounts])
    TypeError: unsupported operand type(s) for +: 'int' and 'str'

  • @willosullivan1986
    @willosullivan1986 3 года назад

    Not common knowledge for me in Australia that 12 inches = 1 foot, but I'm not representative of the population!.
    Thanks for your contribution to data science, I love your video walkthroughs of text books.

    • @ShashankData
      @ShashankData  3 года назад +1

      Haha I do have to tell myself that only we use those measurements every once and a while

    • @willosullivan1986
      @willosullivan1986 3 года назад

      @@ShashankData haha you guys are off doing you're own thing! So I'm considering joining Patreon to support you (first person ever) because I find your content really valuable.. in the Discord do we have an opportunity to talk to you directly and what perks come with joining?

    • @ShashankData
      @ShashankData  3 года назад

      @@willosullivan1986 Thanks so much for the consideration Will, yes you can talk to me directly using Discord or Patreon, and you get access to a bunch of notes I have on machine learning, Statistics and future topics

    • @willosullivan1986
      @willosullivan1986 3 года назад

      @@ShashankData joined :)

  • @sandipansarkar9211
    @sandipansarkar9211 3 года назад

    finished watching

  • @siddheshdhanawade3709
    @siddheshdhanawade3709 Год назад +2

    how can i acces this notes

  • @kli9005
    @kli9005 3 года назад

    Hello,
    What times are your live streams? I am in the CST zone.

    • @ShashankData
      @ShashankData  3 года назад

      I’m CST too :). 10 AM CST on thursdays and 7 PM CST on Friday’s

  • @djblaccs
    @djblaccs 5 месяцев назад

    ANYONE SHARE THE LINK TO THE DATASET...? 😅 PLEASE.

    • @djblaccs
      @djblaccs 5 месяцев назад

      PLEASE I STILL CAN NOT ACCESS THE DATASET CAN I GET HELP PLEASE ?

  • @OceanAlves23
    @OceanAlves23 2 года назад

    Olá, quando teremos novos capítulos? Abraços, do Brazil/Nordeste/PI/Teresina.

  • @xXHelsingGamingXx
    @xXHelsingGamingXx 2 года назад

    🤩

  • @tigerzero9939
    @tigerzero9939 2 года назад +1

    Are you trying to learn the material by teaching others while monetizing at the same time? I would not advise learning probability theory, statistics, or programming by watching these videos.

  • @Younessss_
    @Younessss_ 2 года назад

    bro im from italy nobody even knows what a foot is ahahaha, i just tell people i'm 6'4" cause thats what my height in cm translates to.

    • @ShashankData
      @ShashankData  2 года назад

      Hahahaha yeah I need to be a bit more cognizant about the global audience we have in this channel. I wish we used metric here