Bootstrapping Main Ideas!!!

Поделиться
HTML-код
  • Опубликовано: 27 сен 2024

Комментарии • 654

  • @statquest
    @statquest  3 года назад +121

    NOTE: A lot of people ask "What happens when the original collection of measurements is not representative of the underlying distribution?" It's important to remember that a confidence interval is not guaranteed to overlap the true, population mean. A 95% CI means that if we make a ton of CIs using the same method, 95% of them will overlap the true mean. This tells us that 5% of the time we'll be off. So yes, a sample that is totally bonkers is possible, but rare. Understanding this risk of making the wrong decision, and managing it, is what statistics is all about.
    Also, at 5:55 I say there are up to 8^8 combinations of observed values and possible means, but this assumes that order matters, and it doesn't. So 8^8 over counts the total number of useful combinations and the true number is 15 choose 8, which is 6435 (for details on this math, see: en.wikipedia.org/wiki/Multiset#Counting_multisets )
    Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/

    • @natasgestel6873
      @natasgestel6873 2 года назад +1

      We take for granted all that went behind that idea of 95% CI that you stated - it was Jerzy Neyman's who came up with that definition. Have you read "The Lady Tasting Tea"? A bit of a history of some incredible mathematicians, including Ronald Fisher and Jerzy Neyman. The 95% comes up on page 123. Thanks for all your valuable statistics videos!

    • @statquest
      @statquest  2 года назад +3

      @@natasgestel6873 Yes, I've read the book. Those dues were pretty smart.

    • @PastryDonut
      @PastryDonut 2 года назад +1

      Thank you for explaining that order doesn't matter. I was looking for the clarification on this everywhere.

    • @alexandersmith6140
      @alexandersmith6140 11 месяцев назад

      So, if we take our sample of 8 observations, and we calculate a 95% confidence interval around the sample mean by bootstrapping, and then a genie appears and tells us that the true population mean lies outside of that confidence interval, that's the same as saying that our original 8-observation sample's mean actually wouldn't appear 95% of the time if we repeated the experiment infinitely many times, each experiment being an 8-observation sampling of the population?

    • @statquest
      @statquest  11 месяцев назад +2

      @@alexandersmith6140 The definition is of a 95% CI is that if we repeated the process of creating the 95% CI a ton of times, 95% of the CIs created that way would overlap the true mean. Thus, if collected 8 measurements and used Bootstrapping to calculate a 95%, then that if we repeated that process of creating the 95% CI a ton of times (collected 8 measurements, then calculated the CI with bootstrapping), then 95% of those CIs will overlap the true mean.
      In other words, it doesn't matter if we use bootstrapping, or some formula to calculate the CI, in both cases we have to collect 8 measurements a ton of times.

  • @NeuralNine
    @NeuralNine Год назад +39

    There is nobody on RUclips that explains statistics better or in a more entertaining way than you! Keep it up!

  • @gamuchiraindawana2827
    @gamuchiraindawana2827 9 месяцев назад +7

    What I love about you is that you explain the big picture first. You help me understand why we should care in the first place, or the motivation behind the concept. Then you dive into the details afterwards, you make the information more accessible without compromising the technical integrity of the information. A very rare skill indeed, I'm reading Introduction To Statistical Learning in R ( ISLR ) and some chapters aren't intuitive, whenever I read a chapter that doesn't make sense I just watch your videos. That's how I know you're not compromising the technical integrity of the information, because what you say doesn't contradict what I read in academic papers, it's just easier to understand than what I read in academic papers. You are one of a kind!

    • @statquest
      @statquest  9 месяцев назад +1

      Thank you very much!

    • @gamuchiraindawana2827
      @gamuchiraindawana2827 9 месяцев назад +1

      @@statquest No, thank YOU Josh!

    • @CaptainFeatherSwordz
      @CaptainFeatherSwordz 6 месяцев назад +3

      I think you summed up the value of these videos really well. Starting with the big picture and then zooming into the details is so much more beneficial for learning and I think this is one of the things Josh nails!

    • @gamuchiraindawana2827
      @gamuchiraindawana2827 6 месяцев назад

      @@CaptainFeatherSwordzIt's worlds apart from what the education system has conditioned us to right?

  • @yannivdp1600
    @yannivdp1600 Год назад +146

    I have done a master's in stats and a course in data analysis, and the only reason I've passed these things is that after a long and confusing lecture I can just come and watch you explain it in simple terms. Bam!
    Thank you so much!!

    • @statquest
      @statquest  Год назад +1

      Thanks! I'm glad my videos are helpful! :)

    • @BheezHandle
      @BheezHandle 11 месяцев назад

      I am presently in your shoes, taking a Data Science Course but thanks to @statquest. giving him Double Bam!!

    • @PunmasterSTP
      @PunmasterSTP 6 месяцев назад

      That's cool! How are your studies/career going?

  • @monoarul_islam_3
    @monoarul_islam_3 2 года назад +16

    You're probably the best guy for this job. Even though I don't know where I'm gonna apply all these. I just keep going through all of your videos. After finishing up this playlist I'll watch the ML playlist. Keep amazing us. Thank you JOSH

  • @Hemzees
    @Hemzees 3 года назад +7

    I can never get over how your videos make me love statistics when all my professors and recommended texts made me run away from it. Super grateful!! Also, I think I asked when this video was coming about a year ago.

    • @statquest
      @statquest  3 года назад +2

      Glad it finally came out! :) Sorry it takes me so long to make videos.

  • @vtphan2012
    @vtphan2012 2 года назад +33

    Another great video. This video explains how to do bootstrap, which is the easy part. The more difficult part is to understand why bootstrap works. The conceptual challenge is that bootstrapping assumes that if we were to repeat an experiment, it would produce one of the outcomes we had observed. This could be a huge assumption, depending on the applications. Boot strapping does not add any new information to what has been observed.

    • @statquest
      @statquest  2 года назад +1

      Noted

    • @sgpleasure
      @sgpleasure Год назад +1

      "The reason why this works is because the histogram of the sample tends to look very similar to the histogram of the population. That's really the key idea behind the bootstrap, and we will see how this idea can be used in all kinds of complicated situations. "
      Taking an online course on bootstrap regression and came here to try to understand why bootstrap works when it does not generate any new information.

    • @vtphan2012
      @vtphan2012 Год назад +5

      @@sgpleasure When you sample from a population, it’s unsurprising that the distribution of the sample resembles the distribution of the population. So, you’re not really obtaining any new information. In essence, we’re only pretending it’s new information, when in fact, it’s just reconfirming existing information.

  • @aishah8398
    @aishah8398 3 года назад +11

    I wonder n feel so much regard for the institution and teachers, who taught you... no doubt, you are doing an incredible job...stay blessed always

  • @meanyapickles
    @meanyapickles Год назад +1

    All semester long I have been floundering through my statistics class, no thanks to my professors' boring and quite difficult-to-follow lectures on the materials. I've felt so dumb all semester, so when the next section called for "bootstrapping" I finally decided to throw her lecture videos aside and see if someone could explain the concepts better on RUclips. Boy am I glad I stumbled upon this. The visuals are straight to the point and the way you talk through everything very slowly and clearly is SOOO helpful. The enthusiasm and goofiness helps me keep my attention, which is a pain for me with ADHD. I could rewatch my prof's videos 5 times and retain nothing. Makes me wanna just burst into tears from frustration. But I felt like I could actually keep up with this video and _understand_ it!
    TL;DR thank you for making this, it was a HUGE improvement over my professor's teaching style and I will DEFINITELY be consulting you for future topics. You're a peach

    • @statquest
      @statquest  Год назад +1

      Hooray! Thank you very much. Just for reference, here's a list of all of my videos: statquest.org/video-index/

    • @meanyapickles
      @meanyapickles Год назад +1

      @@statquest thank you very much

  • @rhexieleelafuente4271
    @rhexieleelafuente4271 2 года назад +19

    What a comprehensive and fun discussion! I really had trouble understanding the concept of bootstrapping by myself but your lecture helped me a great deal :> Kudos!

    • @statquest
      @statquest  2 года назад +1

      Glad it was helpful!

  • @잠자기장인-x3r
    @잠자기장인-x3r 5 месяцев назад +1

    You sir are an absolute legend. Really helping me getting through my course, because my professor explains the same concept in a method that is 100 times harder to understand

    • @statquest
      @statquest  5 месяцев назад +1

      Happy to help!

  • @aneishajay7949
    @aneishajay7949 Год назад +1

    I read a section on bootstrapping countless times and only understood it finally after watching your video! All I have to say to that is: BAM! (and thanks a bunch)

  • @bubblebath2892
    @bubblebath2892 Месяц назад +1

    I am justt speechless at - how can you mae something so complicated so simple , hats off to you and thanks a ton

  • @quorrexnoway9127
    @quorrexnoway9127 3 года назад +38

    This kinda feels illegal xD Really nice explained!

  • @diegobuenovillafane869
    @diegobuenovillafane869 Год назад +3

    BEST explanation EVER of bootstrap. Thanks for your dedication!

  • @qi2li2n_prance
    @qi2li2n_prance 10 месяцев назад +1

    You made this concept so much easier to understand than what I was supposed to be learning it from. Thank you so much!!

    • @statquest
      @statquest  10 месяцев назад

      Glad it was helpful!

  • @travelthetropics6190
    @travelthetropics6190 3 года назад +6

    Wow! this is the first time I learned this. awesome!

  • @witchaponkitthaworn5998
    @witchaponkitthaworn5998 Год назад +1

    I am learning machine learning and came to this term , this videos explain it very clear, thank you.

  • @ayenisedaghat4667
    @ayenisedaghat4667 6 месяцев назад +1

    You are a legend my friend! A legend. I am doing my masters in Data Science this fall and this is amazing

    • @statquest
      @statquest  6 месяцев назад

      You can do it!

  • @AvatardSwag
    @AvatardSwag 3 года назад +3

    This is the most amazing video I've seen on bootstrapping, thank you! Quadruple Bam!

  • @luko141
    @luko141 2 года назад +1

    Mr. Josh - u are amazing. World needs more ppl like u. Its like education on another level. Thank you

  • @AliasNichtVerfügbarDrölf
    @AliasNichtVerfügbarDrölf 2 года назад +1

    Not the information I was looking for but i couldn't stop myself from watching it to the end. It was quite entertaining :)

  • @alessandrobucci951
    @alessandrobucci951 2 года назад +1

    One of the most useful video on this topic on youtube, thanks!

  • @dutonic
    @dutonic Год назад +1

    Wow this is so good. The intro made me laugh so hard, it wasn't even that funny I just didn't expect it.

  • @tsuchiko_7155
    @tsuchiko_7155 2 года назад +4

    OMG, it was a truly easy-to-understand video! Both the animation, narration, and explanation!!!! I wanna give a billion likes!!!

  • @PunmasterSTP
    @PunmasterSTP 6 месяцев назад +1

    Bootstrapping? More like "Bro, it's awesome knowledge you're dropping!" 👍

  • @ignasialemany7738
    @ignasialemany7738 2 года назад +1

    The triple BAM was amazing, thank you!

  • @oscarrosalescorzo
    @oscarrosalescorzo Год назад +2

    The work you do is awesome!! Love it.

  • @andreyele3464
    @andreyele3464 Год назад +1

    Josh, you're sent to us from heaven, thanks

  • @w157-p5x
    @w157-p5x 7 месяцев назад +1

    blud just dropped one of the best explanatory videos out there and thought we wouldnt notice☠☠☠

  • @everyhinghere1377
    @everyhinghere1377 Год назад +1

    this is better than college-level advanced course !!! thank you

  • @adityarazpokhrel7626
    @adityarazpokhrel7626 2 года назад +1

    So smoothly explained.
    Thank you sir.

  • @GeerathBhat-vm1vp
    @GeerathBhat-vm1vp Год назад +1

    Thank you so much for your wonderful videos. I have a small request to provide a lecture on FLDA, GMM, EM Algorithm, MLE estimation, MAP estimation. Also, there are some lectures which are not in the book, please also include those lectures too. Thank you so much again!!!. I want to learn more and more from your lectures.

    • @statquest
      @statquest  Год назад

      Thanks! I'll keep those topics in mind.

  • @davusieonus
    @davusieonus 2 года назад +1

    i loved your bams and the illustrations for the steps and your explanation helped a lot

  • @keepcontinue
    @keepcontinue Год назад +1

    Excellent explanation as always by StatQuest!!! Thx a lot!!!

  • @sevenindigo5945
    @sevenindigo5945 Год назад +1

    I can only say one thing: BAM!!! you are the best teacher BAM!!!

  • @amirmusavi3141
    @amirmusavi3141 2 года назад +1

    i read 30 pages of a book , almost get sth,watch 10 min of statquest, fully understand the subject, you're the best bro

  • @alaminmustafa8902
    @alaminmustafa8902 2 года назад +2

    I love the terminology alert😂
    quadruple bam !😂

  • @НосокНесуществующий

    Wow, that was super easy to understand. Thank you very much

  • @Observe411
    @Observe411 2 года назад +1

    Great video!
    Btw, you could probably do a really good Solid Snake voice. Would love to get an Easter egg in one of the next videos!!

    • @statquest
      @statquest  2 года назад +1

      That would be funny. :)

  • @dashsingh30095
    @dashsingh30095 3 года назад +1

    Statquest is the netflix for data science concepts.

  • @annaspitczok9471
    @annaspitczok9471 2 года назад +1

    Big BAM for so much statistic knowledge in such little time

  • @petercourt
    @petercourt 3 года назад +6

    Awesome work! Thanks Josh! It seems weird to me that this creates useful/meaningful results. You're just reusing the same data again and again, so even though we can generate new means by bootstrapping, are those means actually correct? In other words, has anyone compared bootstrapping (and calculating the mean) to repeating the experiment a bunch of times (and calculating the mean)?

    • @statquest
      @statquest  3 года назад +4

      Yes. Bootstrapping has been around for about 40 years and has tons of theoretical justification etc. It's the real deal.

    • @jeanf6295
      @jeanf6295 2 года назад +1

      You can simulate bootstrapping techniques easily.
      The bootstrapped distribution has the same mean as the statistical mean of the sample, and the more points you have in your original sample, the closer its width will be to the spread of the means you would get if you redid the experiment a bunch of time.
      Bootstrapping is not the same however as doing the experiment a bunch of times, as the latter would give a bigger dataset with a narrower confidence interval. This is a confidence interval estimation method that does not make assumptions about the underlying data distribution.

  • @williamss4277
    @williamss4277 5 месяцев назад +1

    Thank you for the teaching 🎉

  • @SP-qk6vd
    @SP-qk6vd 7 месяцев назад +1

    In 8:23 the notation on the x-axis should be median values not mean values since we are using median as statistic measurement for bootstrapping in this case...pls look into it

    • @statquest
      @statquest  7 месяцев назад +1

      Yep. That's a typo.

  • @Tozziz
    @Tozziz 2 года назад +1

    Dear Josh I bought a few study guides :) Thanks so much for your videos

    • @statquest
      @statquest  2 года назад

      Awesome! Thank you so much for your support!!

  • @thepresistence5935
    @thepresistence5935 2 года назад +1

    9 th wonder I learned bootstrapping and confidence intervals! hurray!

  • @thomasdelossantosverrijp8117
    @thomasdelossantosverrijp8117 Год назад +1

    thank you, this was very helpful

  • @YK-jn2kp
    @YK-jn2kp 3 года назад +1

    I loved the idea of shameless self promotion idea lol. Thanks for your time and effort.

  • @lahaale5840
    @lahaale5840 2 года назад +1

    Thank you so much, your videos are always so helpful to me

  • @musefaredi5540
    @musefaredi5540 Год назад +1

    Nice way of explanation!! BAM!!!

  • @ElSamyto
    @ElSamyto 11 месяцев назад +1

    Okay you are the best thank you for doing this video !

  • @verafan6962
    @verafan6962 Год назад

    Create video! One small flaw I would like to point out the lingo should be fail to reject the null hypothesis instead of cannot reject the hypothesis.

  • @Mesye_bober
    @Mesye_bober 2 месяца назад +1

    Nice work , man

  • @darnelb1912
    @darnelb1912 3 года назад +2

    Great video! I learn so much with you!

    • @statquest
      @statquest  3 года назад +1

      Awesome! Thank you!

  • @yingyanhuang1688
    @yingyanhuang1688 Год назад +1

    You are amazing! Thank you!

  • @DurjoyRoy-ll4zt
    @DurjoyRoy-ll4zt 2 месяца назад +1

    Loved it... Big BAM!

  • @utkarshsingh2675
    @utkarshsingh2675 2 года назад +1

    easy to understand....thanks josh!

  • @fgfanta
    @fgfanta 6 месяцев назад +1

    University professor explained it in a confused and insufficient way (to put it politely), then I came to StatQuest.

  • @arifmemovic3383
    @arifmemovic3383 2 года назад +1

    You rock Josh. Thanks for making this video!

  • @gitaecookiemonster2892
    @gitaecookiemonster2892 3 года назад +1

    Love all of your videos!! Thanks a lot!

  • @vig9737
    @vig9737 2 года назад +1

    Josh is on Spotify! BAM

  • @mahdiranjbar3569
    @mahdiranjbar3569 2 года назад +1

    Ur videos are just so cool, tnx a lot

  • @rajatbadade20
    @rajatbadade20 3 года назад +1

    Please upload videos on monte carlo simulation and integration

    • @statquest
      @statquest  3 года назад

      I'll keep that in mind.

  • @dylanspodcast
    @dylanspodcast Год назад +1

    This guy is great!

  • @AnalyticsAlchemy-nq3kq
    @AnalyticsAlchemy-nq3kq 6 месяцев назад +1

    Nice explanation...awesome

  • @bushraw66
    @bushraw66 3 месяца назад +1

    Thank you

    • @statquest
      @statquest  3 месяца назад

      TRIPLE BAM!!! Thank you so much for supporting StatQuest!!! :)

  • @Tweeteketje
    @Tweeteketje Год назад

    Thank you so much, very clear again! Not planning to make some videos about Fisher information, Jackknife, and Delta method, by any chance? 😬

    • @statquest
      @statquest  Год назад +1

      No time soon, but I'll keep them in mind.

    • @tripp8833
      @tripp8833 Год назад +1

      @@statquest Jackknife would be cool!

  • @uyenvu3920
    @uyenvu3920 5 месяцев назад

    Thank you for this informative video; it really clarified my understanding of bootstrapping! However, I'm curious about the choice between sampling with replacement versus without replacement in the bootstrap method. How do we determine the most appropriate method for our specific dataset? Are there particular scenarios or types of data where one method is preferred over the other? Any additional insights would be greatly appreciated. Thanks a lot!

    • @statquest
      @statquest  5 месяцев назад

      Bootstrap always uses sampling with replacement.

  • @Cuddpo
    @Cuddpo 9 месяцев назад +1

    Thank you 🙏

    • @statquest
      @statquest  9 месяцев назад

      You’re welcome 😊!

  • @anakagung7613
    @anakagung7613 2 года назад +1

    Awesome song!

  • @GlorifiedTruth
    @GlorifiedTruth 3 года назад +1

    🎵
    Boot World
    Workin', playin'
    Boot World
    It's number one in boots.
    🎵

  • @suheyla655
    @suheyla655 Год назад +1

    You are the best!

  • @RobertaBraga_
    @RobertaBraga_ Год назад +1

    Thanks Josh, you are the one!

    • @statquest
      @statquest  Год назад

      Thank you and congratulations again. I'm so glad I was helpful. BAM! :)

  • @brothermanbill7338
    @brothermanbill7338 3 года назад +3

    I love how cute was 'small bam'

  • @tripp8833
    @tripp8833 Год назад +1

    Thanks!

    • @statquest
      @statquest  Год назад

      Thank you so much for supporting StatQuest!!! BAM! :)

  • @leonelemiliolereboursnadal6966
    @leonelemiliolereboursnadal6966 2 года назад +1

    ¡Gracias!

    • @statquest
      @statquest  2 года назад

      Hooray!!! Muchas Grasias for supporting StatQuest!!! BAM! :)

  • @marlocosta5531
    @marlocosta5531 2 года назад +1

    Thank you JOSH!

  • @etwasverfehlt2397
    @etwasverfehlt2397 2 года назад +2

    Josh if you need someone who cleans your room or makes the dishes, just give me a call. I own you that

  • @brettclark3885
    @brettclark3885 2 года назад

    At 6:40 when you start to discuss the 95% CI; I think there will be a lot of people who wont understand the subtlety of this distribution. You have created a distribtuion of 'statsitics'; in this case the mean. So, as you would appreciate you have derived the "sampling distribution' of the mean, from which the standard deviation = the standard error of the mean and the 95% CI calcaution is trivial. The uninitated might not appreciate how this is different from a distribution of a single data set; whereby the standard error = the standard devation / sqrt(n).

  • @datascience_azamat
    @datascience_azamat 2 года назад +1

    wow, great content!

  • @tuchevera
    @tuchevera 10 месяцев назад +1

    Thank you very much!! One question: When a need to calculate de standard error, I just need to calculate de standard deviation of the resamples? Or a need to calculate the standard deviation divided by the square root of n?

    • @statquest
      @statquest  10 месяцев назад +1

      Remember what the standard error is - it the standard deviation of the means we would get from collecting a lot of different samples and calculating the mean for each one. So, if we use bootstrapping to create a bunch of means, all we need to do is calculate the standard deviation of those means..

    • @tuchevera
      @tuchevera 10 месяцев назад +1

      @@statquest Ooh I got it! Thank you very much for your answer and for being generous enough to explain! :)

  • @theburtmacklin9615
    @theburtmacklin9615 3 года назад

    Nice - “Part 1”, gonna have another nice solid multi-part set.

    • @statquest
      @statquest  3 года назад +1

      At least 2 parts. Part 2 will be out soon.

  • @PythonArms
    @PythonArms Год назад +1

    great video

  • @dsanjoy
    @dsanjoy 3 года назад +1

    Instantly subscribed!!

  • @hughsignoriello
    @hughsignoriello Год назад +1

    Explained better than my professor! Amazing work Josh

  • @mmm-me4kk
    @mmm-me4kk Месяц назад

    Thank you for the great explanation, but just a question, the bootstrap histogram will always be similar to the original data distribution if you repeat the bootstrap a lot of times. So i'm not sure if i get the added value, especially because the bootstrapped dataset is a based on the means (in this example) but the mean of 0.5 is a bit misleading since the standard deviation is very high, based on the bootstrap and confidence interval we cannot reject the null hypothesis, as it include 0, but isn't that a bit.. like, I mean of course statistically it's correct, but in reality the drug has a very sign. effect (either positive or negative), but since both occur, the mean implies that the drug has no effect, so do the bootstrapped histograms. Thanks again.

    • @statquest
      @statquest  Месяц назад

      I'm not sure I fully understand your question. The idea is that the boostrap represents what would happen if we could repeat the experiment a lot more times.

  • @garyboy7135
    @garyboy7135 2 года назад

    Can you also use bootstrapping method to calculate standard error for medians?

  • @SwapperTheFirst
    @SwapperTheFirst 3 года назад +2

    Thanks! Couple of questions - could someone please clarify this for me, please:
    1) At 8:40 we should see "median values" at the bottom distribution instead of "mean"? 2) also, at the same time mark, why confidence levels moved to the left this far? they cover mostly "feeling worse" data points.
    More general question - is Bootstrapping theoretically or conceptually linked to the Central Limit Theorem?

    • @statquest
      @statquest  3 года назад +3

      1) Oops! That's a typo. It should say "median".
      2) The CI was found by identifying the 2.5% and 97.5% quantiles, which were shifted as seen in the video.
      3) I do not think so.

    • @SwapperTheFirst
      @SwapperTheFirst 3 года назад

      @@statquestThanks, Josh! Could you please elaborate on the CL for medians. _Why_ it is so shifted to the left, compared to CL for mean values. I'm so sorry to bother, but it seemed that I _get_ it, while in reality I cannot understand why the CL for median values is so, so different from CL for means.
      I've purchased your PCA guide. Pure awesomeness!

    • @statquest
      @statquest  3 года назад

      @@SwapperTheFirst Thank you for supporting StatQuest!!! As I wrote earlier, the CI was found by identifying the 2.5% and 97.5% quantiles (95% of the quantiles are between 2.5 and 97.5). If that doesn't make sense to you, consider watching the StatQuest on quantiles: ruclips.net/video/IFKQLDmRK0Y/видео.html

    • @phelipe2587
      @phelipe2587 3 года назад

      I was thinking about Central Limit Theorem.. The sample data comes from some unknow distribution, so if we generate a new dataset and calculate the mean over and over again.. the histogram of these means will be like a normal distribution? If I'm not wrong, that's what central limit theorem is about, right? Unless it doesn't work when you repeat bootstrap like 1,000 or 10,000 times.. i don't know, I'm confusing

    • @SwapperTheFirst
      @SwapperTheFirst 3 года назад

      @@phelipe2587 This is my thought exactly. Using bootstrapping (random process) we get a normalized distribution (for example, of means), even when initial distribution is not normalized.
      I want to make a small experiment, though. I will get data from Josh deck (8 datapoints) and will run the bootstrap, say 10K, using highly random data (say, from random.org). Then I will get 8 datapoints from some other distribution, which is not normal (say, wealth distribution in US) and again, compare with bootstrap distro after 10K.
      Also want to check the median CL in bootstrapped distro, since I (alas) still don't get it.
      But when you play with actual data, instead of endless theories - sometimes you may have an insight.

  • @MrWater2
    @MrWater2 9 месяцев назад

    How do you calculate the confidence interval in the bootstrap?

    • @statquest
      @statquest  9 месяцев назад

      This is explained at 6:40

  • @drachenschlachter6946
    @drachenschlachter6946 2 года назад

    Can you please do a video of the Monte carlo simulation?

    • @statquest
      @statquest  2 года назад

      I'll keep that in mind.

  • @ishangrotra7265
    @ishangrotra7265 Год назад +1

    Hello i had a question when you said that the confidence interval contain 0 in it shouldn't it be 0.5 since that is the mean ?

    • @statquest
      @statquest  Год назад

      The purpose of the 95%CI is to tell us whether or not the observed mean, 0.5, is statistically different from 0, and, in this context, when a 95%CI contains 0, we fail to reject the hypothesis that there is a statistically significant difference between the observed mean and 0.

    • @ishangrotra7265
      @ishangrotra7265 Год назад

      @@statquest hello josh thank you for replying just one more question so whenever the CI contains 0( or the mean we are trying to differentiate from) in it we will fail to reject the null hypothesis correct ?

    • @statquest
      @statquest  Год назад

      @@ishangrotra7265 That's the idea, however, I believe the null specifically refers to 0.

    • @ishangrotra7265
      @ishangrotra7265 Год назад +1

      @@statquest thank you josh please keep up the good work you have of a really great help !

  • @75hilmar
    @75hilmar 2 года назад

    0:40 I bet this is the official thought process of pharmaceutical companies

  • @aditya4974
    @aditya4974 3 года назад +2

    Bam!! Thank you!

  • @vandanakannan6883
    @vandanakannan6883 2 года назад

    Thank you so much for this video!! Are there any cases where bootstrapping must NOT be used?

    • @statquest
      @statquest  2 года назад

      Not that I know of off the top of my head. Maybe if you only have 3 or 4 measurements, maybe then bootstrapping will not be very useful.

  • @alexlee3511
    @alexlee3511 11 месяцев назад

    Nice video! but can I understand that this method is similar to permutation but differs to which permutation create a new dataset by blending the data together? do bootstrapping involve any blending of sample?

    • @statquest
      @statquest  11 месяцев назад +1

      To be honest, I'm not sure if "blending" is a technical term or not. Bootstrapping is related to permutation methods, but fundamentally different since it allows, and actually requires, sampling with replacement.

  • @THEMATT222
    @THEMATT222 2 года назад +3

    Before this video, I thought bootstrapping was a way of tying your shoe laces, lol

  • @arunkennedy9267
    @arunkennedy9267 7 месяцев назад

    How is it different from drawing a normal distribution and getting the probability from that??

    • @statquest
      @statquest  7 месяцев назад

      That question is answered at 8:13

  • @shadowguo5431
    @shadowguo5431 2 года назад

    This indifference voice is damn attractive

  • @mitsuhafan1543
    @mitsuhafan1543 2 года назад +1

    this video is so great :3 bam!