33:38 sklearn.utils.resample() returns all 2732 rows from the data if n_samples is not specified. just adding n_samples=1000 to resample() will fix this. Otherwise you are just grabbing the whole same data every iteration which is why you keep getting the same mean and the same median.
So what would you consider the best way to analyze a U shaped distribution? Such as hourly or yearly temperatures. These form a quasi-sinusoidal pattern which gives a strongly U-shaped distribution to the data. A lot of stuff is about normal distributions but U shaped is covered much less often.
I have done it exactly the same in the video but this error keeps showing up. Can anyone help me? I have followed every details in the video. TIA. # Central Limit Theorem import pandas as pd import matplotlib.pyplot as plt sample_amounts = range(10, 100, 20) for x in sample_amounts: samples = [] for i in range(1000): samples.append(player_data_import["height_inches"].sample(x).mean()) plt.hist(samples) plt.legend([str(z) + " Samples" for z in sample_amounts]) plt.title("Central Limit Theorem") TypeError: Traceback (most recent call last) Cell In [14], line 11 9 samples = [] 10 for i in range(1000): ---> 11 samples.append(player_data_import["height_inches"].sample(x).mean()) 12 plt.hist(samples) 13 plt.legend([str(z) + " Samples" for z in sample_amounts]) TypeError: unsupported operand type(s) for +: 'int' and 'str'
Not common knowledge for me in Australia that 12 inches = 1 foot, but I'm not representative of the population!. Thanks for your contribution to data science, I love your video walkthroughs of text books.
@@ShashankData haha you guys are off doing you're own thing! So I'm considering joining Patreon to support you (first person ever) because I find your content really valuable.. in the Discord do we have an opportunity to talk to you directly and what perks come with joining?
@@willosullivan1986 Thanks so much for the consideration Will, yes you can talk to me directly using Discord or Patreon, and you get access to a bunch of notes I have on machine learning, Statistics and future topics
Are you trying to learn the material by teaching others while monetizing at the same time? I would not advise learning probability theory, statistics, or programming by watching these videos.
Hello Shashank, Thank you so much! Eagerly waiting for the remaining chapters.
ohhh Chapter2!!! I have been waiting fro that!
Bro sir, for the first time in my life I feel like I’m in a book club, thank you, I can’t wait for chapter 5
33:38 sklearn.utils.resample() returns all 2732 rows from the data if n_samples is not specified. just adding n_samples=1000 to resample() will fix this. Otherwise you are just grabbing the whole same data every iteration which is why you keep getting the same mean and the same median.
Taught really well . Thank you so much . Are these notes available ?
Thanks Shashank ! we do like your videos ! 😍We can learn together like that ! It is very helpfull ! Waiting for more videos for this book
Please make more of these loving these series
Great work & the Patreon page is top notch
Thank you so much! There’s much more content where that came from
So what would you consider the best way to analyze a U shaped distribution? Such as hourly or yearly temperatures. These form a quasi-sinusoidal pattern which gives a strongly U-shaped distribution to the data.
A lot of stuff is about normal distributions but U shaped is covered much less often.
You use notion for writing notes? Thanks for wonderful video.
A big thanks for the series, also please make series on "naked statistics"
Could you please use of applied statistics in model building and AB testing for modal validation?Thanks
thank you for the well presented video brother !!!!
Your videos are really useful
How Central limit theorem works for unsymmetrical data?
How can i access notes sir?
Nice video, generally in Europe we use cm for measuring heights, but it is well known that the USA use foot (btw I am from Greece).
thank you for this video!
please do more videos like this :)
Hi Sir I need guidance on Predicting Default on mortgage loan project using R. What are the perquisites I need to enhance my skills..
Plis make the video with option traductor for everypeople.
Can you share a link of notion work space of the chapter ?
love the series!
Thanks for Amazing Guide. Please upload the next chapter guide 🙏.
Thank you! It’s already uploaded
@@ShashankData sorry but I can't see the chapter 3.
@@d3v487 Sorry I thought this comment was on the Chapter 1 video. Chapter 3 will be coming out soon
would be better to abstract the cleaning function for the height column out into its own function.
I have done it exactly the same in the video but this error keeps showing up. Can anyone help me? I have followed every details in the video. TIA.
# Central Limit Theorem
import pandas as pd
import matplotlib.pyplot as plt
sample_amounts = range(10, 100, 20)
for x in sample_amounts:
samples = []
for i in range(1000):
samples.append(player_data_import["height_inches"].sample(x).mean())
plt.hist(samples)
plt.legend([str(z) + " Samples" for z in sample_amounts])
plt.title("Central Limit Theorem")
TypeError: Traceback (most recent call last)
Cell In [14], line 11
9 samples = []
10 for i in range(1000):
---> 11 samples.append(player_data_import["height_inches"].sample(x).mean())
12 plt.hist(samples)
13 plt.legend([str(z) + " Samples" for z in sample_amounts])
TypeError: unsupported operand type(s) for +: 'int' and 'str'
Not common knowledge for me in Australia that 12 inches = 1 foot, but I'm not representative of the population!.
Thanks for your contribution to data science, I love your video walkthroughs of text books.
Haha I do have to tell myself that only we use those measurements every once and a while
@@ShashankData haha you guys are off doing you're own thing! So I'm considering joining Patreon to support you (first person ever) because I find your content really valuable.. in the Discord do we have an opportunity to talk to you directly and what perks come with joining?
@@willosullivan1986 Thanks so much for the consideration Will, yes you can talk to me directly using Discord or Patreon, and you get access to a bunch of notes I have on machine learning, Statistics and future topics
@@ShashankData joined :)
finished watching
how can i acces this notes
Hello,
What times are your live streams? I am in the CST zone.
I’m CST too :). 10 AM CST on thursdays and 7 PM CST on Friday’s
ANYONE SHARE THE LINK TO THE DATASET...? 😅 PLEASE.
PLEASE I STILL CAN NOT ACCESS THE DATASET CAN I GET HELP PLEASE ?
Olá, quando teremos novos capítulos? Abraços, do Brazil/Nordeste/PI/Teresina.
🤩
Are you trying to learn the material by teaching others while monetizing at the same time? I would not advise learning probability theory, statistics, or programming by watching these videos.
bro im from italy nobody even knows what a foot is ahahaha, i just tell people i'm 6'4" cause thats what my height in cm translates to.
Hahahaha yeah I need to be a bit more cognizant about the global audience we have in this channel. I wish we used metric here