Introduction to Bayesian Statistics - A Beginner's Guide

Woody Lewenstein

Просмотров 91 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 25 дек 2024

Комментарии • 182

@chrisdsouza8685 21 день назад ⁺²
This introduction to Bayesian Statistics is simply the best available on RUclips. I cannot imagine why it got 89,000 views but only 2400 likes. I think it is because many persons looked at the comments and thought it didn't need praising.
That is unfortunate The comments are a PRIOR, and every added comment adds evidence to create a better posterior.
@chrisdsouza8685 21 день назад
I forgot to add one thing. Perhaps an unfortunate effect of the lack of commendation has been that Woody Lewenstein has not added to his Bayesian videos.
Tragic.
There is so much more that he could have put into the second video in this playlist.
@subzero4579 2 года назад ⁺²
This is the best description of Bayesian statistics I've seen
@VictorLopez-qb7qr 16 дней назад
Amazing explanation. I've known and used Baye's theorem for a long time, but never saw thw whole picture until now. Thanks so much
@joedixon06 8 месяцев назад ⁺¹¹
Excellent. In general I leave comments 0.5% of the time. But when I think something is really superb I always leave a comment. What is the chance I thought Woody’s tutorial was really superb?
@woodyrow 7 месяцев назад ⁺¹
Thanks! Glad my video made the cut!
@Darkxxus 2 месяца назад ⁺²
To answer your question: we can't know. You didn't give enough information. All we know is that you leave comments only 0.5% of the time, and that you leave a comment 100% of the time when you find the video is "really superb". But we don't know at which rate (r) you leave comments on video that you don't find "really superb". So all we can say is that among the videos you watch, you qualify only at maximum 0.5% (if the rate r would equal 0%) of them as "really superb", which is not much. Maybe the real conclusion is that you should stop watching so mediocre videos overall haha :p
@nickdagostino1844 Год назад ⁺³
great explanation, i like how you gradually introduced the different concepts
from probability -> conditional probability -> baby bayes -> bayes
@reigak6599 2 года назад ⁺¹⁰
Everything was explained perfectly. This video deserves more viewers and comments. Thank you so much for sharing with us.
@SM-xn9bv Год назад ⁺³
As I wrote before this is the best ever explanation of the Bayesian statistics - THANK YOU VERY MUCH!!! and I am coming back to it every time to refresh this concept and vocabualry :)! When we talk about the numerator ( P(a=5, b=3 | Bob wins)*P(Bob wins) ) of the probability equation (that Bob wins given A=5 and B=3) at 1:07:20. Basically we can not really operate with these terms separately (P(a=5, b=3 | Bob wins) * 0.057) and have to merge them together into the distribution (y=(8x3)x^6(1-x)^5). Just trying to catch a moment and a point when it pivots from the frequentist to the bayesian :). In other words the point is that when we look at this formula as a math expression then we can camcel out the y=(8x3)x^3(1-x)^5 while left only with (3/8)^3 which then would be purely frequentist estimation of the probability of Bob winning (I do understand that in fact we have the areas under the graphs for the respective distributions in numerator and denominator). If you were so patient to having read till this point :) - what is confusing is the P(a=5, b=3 | Bob wins) part of the numerator which is hard to imagine...
@nicolasabboud8577 Год назад ⁺¹
The best tutorial on youtube that explains Bayesian Statistics so far! 🌷
@MarkPerry-lr4xq 3 месяца назад
Brilliantly taught! I thought the simulation was particularly interesting, especially as it shows that simulations can completely avoid having to perform the difficult integrals that crop up with more complex analyses. Thank you so much.
@gregoriocuesta5551 3 месяца назад ⁺¹
Brilliant explanation of Bayesian Basics. Best I have seen, easy to follow and understand.👍👍👍
@KazaaModo 2 года назад ⁺²¹
What a greatly structured session. Learnt so much new stuff. Thanks.
@cristinamorenodelcastillo8527 Год назад ⁺¹
The first lesson is everything! I finally understand the fundamental difference between Bayesian and frequentist statistics. 🎉 beautifully explained, thank you!
@woodyrow Год назад
Thank you so much!
@pravinhonmute6608 Год назад ⁺¹
Wonderful teaching, earlier i found it difficult to understand the probability but now its seems easy. thanks prof.
@kevingodsave8893 5 месяцев назад
This is a top shelf explanation of an intuitively difficult concept. Introducing Incremental complexity using examples is a superb teaching method. The visualization using Excel to break out the calculation stages was for me the icing on the cake. Thank you, Woody!
@woodyrow 4 месяца назад
Thanks so much! Really glad you enjoyed it and thanks for the cmment.
@barttrudeau9237 Год назад ⁺²
I have been interested in Bayesian analysis for a few years and seen dozens of videos. This is the best video I have seen to learn the concepts. Thank you so much for producing and sharing this knowledge!
@woodyrow Год назад
I'm thrilled to hear this! So glad you enjoyed it. Woody
@Hermanubis1 10 месяцев назад
@@woodyrow Most europeans have 100IQ, East asians 105, ashkenazis 112, african americans 85, sub saharan africans 70.
@jayashrishobna 5 месяцев назад
Thank you for this absolute gem of a lecture. I think it might tickle you to know how I approached the problem re Tanya's tennis and sunny days (23:00 ish).
As a healthcare professional, I'm much more familiar with sensitivity, specificity, positive/negative predictive values, prevalence etc, than I am with an equation for conditional probability. So for my first attempt at the problem, I basically framed the situation as "tennis" being a diagnostic test for "sunny", and drew a 2x2 table for "tennis" against "sunny"!
The sensitivity of the test is 80%, ie. 80% of "sunnies" had a positive "tennis" result. The specificity of the test is 65%, ie. 65% of "not sunnies" had a negative "tennis" result. The "prevalence" of sunny is 60% and not sunny is 40%. Therefore, I just had to solve for the positive predictive value of the "tennis" test for "sunny", by using the relative prevalence to "weight" the sunnies vs not-sunnies within the "tennis" group. Et voila, it yielded the exact same process of multiplication as using the cond probability equation, which I used for my second attempt.
I know this may sound like a much more complicated method, but seeing the probabilities in a table and applying concepts I already know truly helped me actually understand the multiplications within the conditional probability equation, rather than just solving for it blindly. This was a lightbulb moment for me. Thank you!
@woodyrow 5 месяцев назад ⁺¹
@@jayashrishobna really interesting! So nice to hear about how you went about solving this!
@aniket1983 2 года назад ⁺¹
This is not one of the but the best video on Bayesian theory....thank you so much for doing this....
@woodyrow 2 года назад
Thank you so much!
@mikolajwojnicki2169 11 месяцев назад ⁺¹
Excellent video! I needed to refresh my Bayesian statistics knowledge and this was a perfect start.
@woodyrow 10 месяцев назад
Glad it helped!
@ASMM1981EGY Год назад ⁺¹
Much better explanations than the famous channel of StatQuest. Thanks a lot.
@woodyrow Год назад ⁺²
That's very kind of you to say, though I think in general StatsQuest is really good!
@BrowningRockwell-d8x 3 месяца назад
Thank you for presenting an informative and concise lecture. Time well spent for me.
@harryhirsch2024 Год назад
This is the best explanation I ever saw.
@jomath5790 6 месяцев назад
I just love the flow of the lecture.
@woodyrow 6 месяцев назад ⁺¹
Thank you so much! Lovely to hear that!
@haraldurkarlsson1147 2 года назад ⁺⁴
This is excellent! Clear, concise and systematic. Best explanation I have seen of Bayes thus far.
@Sergei-ld1iv 2 года назад ⁺²
This is really great class!!! Many thanks! You are a great teacher as you can put yourself into the shoes of a student and highlight the least clear connections !!! The graphics is awesome as well - very clear, down to detail not redundant!!! Really pity and weired that this channel did not attrackt too many subscribers !
@woodyrow 2 года назад
Thanks so much!
@codewithme2day Год назад
@woodyrow why are we using the conditional formulas instead of the Bayes formulas that I know which is P = P(A/B) × P(B) / P(A). I am new to statistics. Please explain
@nelsondelarosa5490 11 месяцев назад ⁺¹
Indeed your video is FANTASTIC and IMMENSILY helpful. Thanks!!!
@woodyrow 10 месяцев назад
Thanks! Glad you liked it!
@marianelamarino4915 Год назад ⁺²
So clear, excelllent video! Was very helpful 👏🏼👏🏼 Thank you!
@woodyrow Год назад
Thanks so much! Really glad you enjoyed it!
@acm... 2 года назад ⁺²
Excellent breakdown of the topic! The final parts about simulating Bayes theorem in the Excel really drove the whole idea home really well.
@sherrifmarty00 2 года назад ⁺¹
Thanks for educating us mr. Lewenstein!
@zdenkogarasevic1477 8 месяцев назад ⁺¹
🎯 Key Takeaways for quick navigation:
00:00 *🎓 Introduction to Bayesian Statistics*
- Exploring Bayesian statistics from scratch.
- Suitable for anyone interested in probability and statistics, from students to professionals.
- Starting with fundamental questions about probability and its applications.
01:10 *🎲 Objective vs. Subjective Views on Probability*
- Contrasting objective (frequentist) and subjective (Bayesian) views on probability.
- Highlighting limitations of frequentist approach, especially for one-off events like horse races.
- Illustrating subjective Bayesian model's flexibility and rationality in handling uncertainty.
09:33 *📊 Degrees of Belief in Bayesian Probability*
- Bayesian probability as degrees of belief or uncertainty measures.
- Illustrating subjective probabilities through scenarios involving pregnancy and gender prediction.
- Emphasizing rationality in adjusting beliefs based on available evidence.
10:01 *🧠 Conditional Probability Basics*
- Introduction to conditional probability using simple visual examples.
- Building intuition for conditional probability through visualizations.
- Setting the stage for understanding Bayes' theorem.
13:19 *📝 Formulating Baby Bayes Theorem*
- Deriving a simplified version of Bayes' theorem using visual probability representations.
- Demonstrating application of the theorem in simple probability problems.
- Introducing notation and terminology for hypothesis and evidence probabilities.
20:20 *🌳 Bayes Theorem Application with Tree Diagrams*
- Applying Bayes' theorem to complex scenarios using tree diagrams.
- Solving probability problems involving multiple events and conditional probabilities.
- Demonstrating how evidence updates prior probabilities to yield posterior probabilities.
23:34 *📈 Bayesian statistics application example: Updating probability with evidence*
- Bayes' theorem updates the probability of an event given new evidence.
- Example: Given the probability of sunny weather and playing tennis, Bayes' theorem helps update the probability of sunny weather given that tennis was played.
- Demonstrates how prior beliefs are adjusted based on new information.
24:45 *📊 Bayesian statistics application example: Probabilistic analysis in economics*
- Scenario: Analyzing the probability of a recession given job loss using Bayes' theorem.
- Demonstrates the use of prior probabilities and conditional probabilities in economic analysis.
- Shows how Bayesian statistics can be applied to decision-making in economic forecasting.
29:22 *🏃‍♀️ Bayesian statistics application example: Probability distributions in sports*
- Example: Analyzing the probability of a girl running 100 meters in a certain time frame using normal distribution.
- Shows how Bayesian statistics is used to update probabilities based on additional information (e.g., being in the school running team).
- Illustrates how conditional probability influences the assessment of outcomes in sports.
33:19 *🧠 Bayesian statistics application example: Counter-intuitive results*
- Examines counter-intuitive outcomes using conditional probability in IQ distribution scenarios.
- Demonstrates how small changes in distributions can lead to significant shifts in probabilities.
- Highlights the importance of understanding conditional probability in interpreting statistical results.
41:42 *🦠 Bayesian statistics application example: Medical diagnosis*
- Examines a medical diagnosis scenario using Bayes' theorem.
- Illustrates how prior beliefs are updated based on diagnostic test results.
- Emphasizes the significance of understanding conditional probability in medical decision-making.
48:11 *📊 Understanding Bayes Theorem through an Example*
- Explains the application of Bayes Theorem using an example involving Steve, a shy individual, to illustrate how prior probabilities and evidence combine.
- Demonstrates how intuition can be misleading when prior probabilities and evidence are not considered.
- Breaks down the calculation process step by step, showing the application of Bayes Theorem in determining the probability of Steve being a librarian given certain traits.
51:42 *📈 Formal Naming and Components of Bayes Theorem*
- Defines the formal components of Bayes Theorem: prior, posterior, likelihood, and evidence.
- Illustrates the terminology used in relation to each component, such as "prior" for the initial probability, "posterior" for the updated probability, "likelihood" for the probability of evidence given a hypothesis, and "evidence" for the total probability of the observed evidence.
- Provides insights into the significance of each component in Bayesian inference and decision-making processes.
56:36 *🔍 Exploring a Complex Example: Bayesian Approach vs. Frequentist Approach*
- Introduces a more complex example involving a game between Alice and Bob to compare Bayesian and frequentist approaches.
- Contrasts the frequentist method, which relies on straightforward calculations, with the Bayesian method, which involves applying Bayes Theorem to update probabilities based on evidence.
- Demonstrates how Bayesian inference can provide more accurate predictions by considering prior probabilities and updating them with observed evidence, even in complex scenarios.
48:40 *📊 Bayesian Intuition: Steve's Occupation*
- Daniel Kahneman presents a scenario about Steve, a shy and tidy individual, posing the question of whether he's more likely to be a farmer or a librarian.
- Despite intuitive judgment favoring Steve being a librarian, Bayesian analysis challenges this assumption by considering the proportion of farmers and librarians meeting Steve's criteria.
- Applying Bayesian theorem, the analysis shows that Steve is more likely to be a farmer, emphasizing the importance of considering the base rate in making probability assessments.
51:28 *📈 Bayesian Terminology: Understanding Bayes Theorem Components*
- Prior: The probability of a hypothesis before considering any new evidence.
- Posterior: The probability of a hypothesis after considering new evidence.
- Likelihood: The probability of observing the evidence given that the hypothesis is true.
- Evidence (Marginal Likelihood): The probability of observing the evidence, accounting for both scenarios where the hypothesis is true and where it's not.
56:36 *🎲 Bayesian Approach: Alice and Bob's Game*
- Illustration of a game between Alice and Bob where a ball is randomly placed on a table, dividing it into two sections, with each player scoring points based on where the ball lands.
- Contrasting frequentist and Bayesian approaches in assessing Bob's probability of winning the game.
- Through simulation, Bayesian analysis consistently yields a higher probability of Bob winning compared to the frequentist approach, demonstrating the Bayesian method's reliability in probabilistic assessments.
Made with HARPA AI
@Vanadium404 11 месяцев назад
Woah! Quality stuff and your examples added more to grasp the intuition underlying these Bayesian concepts. Regards from Pakistan
@woodyrow 11 месяцев назад
Thanks so much!
@joserobertopacheco298 2 года назад ⁺¹
I am from Brazil. Excelent explanation, very good job. Thank you and congratulations.
@woodyrow 2 года назад
Thanks so much! Glad you enjoyed it.
@joserobertopacheco298 2 года назад
@@woodyrow Thank you for your answer. I just subscribed to your channel.
@evanskip1 2 года назад ⁺⁶
Well done! Among the best of the best what I have watched so far on Bayes theorem. I suggest you delve further with this lessions into Bayes stats in future vedios
@woodyrow 2 года назад ⁺¹
Thanks so much! Yes I'd love to do a second part where I go deeper into this.
@evanskip1 2 года назад
@@woodyrow great, am looking forward to it and sharing to my colleagues students as well
@epilepsyawareness 2 года назад ⁺³
Outstanding explanations.
Thank you.
@suelook9562 2 месяца назад
Excellent explanation!!!
@jomath5790 6 месяцев назад
This video deserves more likes.
@HuseyinOmerErgen 2 года назад
32:30 minor nitpick but I would say it is higher than 45% because running faster would make someone more likely to be on the running team. Good lecture so far!
@kaskilelr3 2 года назад ⁺¹
Thank you so much for this tutorial. Very clear and with very interesting examples, I am so glad i found this channel
@kylat5496 2 года назад ⁺¹⁷
Thank you for teaching us trash goblins! We are forever in your debt
@RohanPharmaceutics Год назад
This is my best class ever
@woodyrow Год назад
Thanks Rohan! Glad you like it!
@Leo-vn1nf Месяц назад
Very nice, thanks for the lecture
@goodmorningprester3936 Месяц назад
Bro that was a good tutorial. Learnt alot
@gregoryzoro6063 Год назад
Excellent simulation with Excel functions for Bayesian estimates !!
@woodyrow Год назад
Thank you! Glad you liked it.
@JeanOfmArc 5 месяцев назад
This was an awesome video; really appreciate it!
@justinjoseph4746 2 года назад ⁺²
Great! Really enjoyed learning this. Thank you
@woodyrow 2 года назад
Thanks so much Justin!
@shlomopreiss-bloom7577 Год назад
Excellent lesson - thanks!
@woodyrow Год назад
Thanks very much!
@samherman1437 10 месяцев назад ⁺¹
Great !!!
@kazisabbirahmadnahin2814 2 года назад ⁺¹
Great effort💗
Keep making more videos on this topic.
@woodyrow 2 года назад
Thanks! I will try to do more!
@abhishekojha6382 Год назад
Amazing video! Such clarity & presentation! Thank you! Learned a lot!
@woodyrow Год назад
Thanks so much Abhishek! So nice to hear that.
@TonyLMai 2 года назад ⁺¹
Well done Woody!
@ABurns-lk5lp 3 года назад ⁺¹
Great post mate,keep it going.
@unpatel1 2 года назад ⁺¹
This was a great intro and I enjoyed it! Thank you.
@fernandojackson7207 10 месяцев назад ⁺¹
Excellent presentation. Wouldn't the Monty Hall Problem be an example of where using Bayes would be helpful? The update info would be that the host, Monty Hall will switch to a door with a goat. The setup is this: Monty Hall is the host of a TV Show, where a contestant must choose one of three doors , where there are goats behind two of the doors, and a car behind the other door. If the contestant chooses the door with the car, they get to keep it. If they choose a door with a goat behind it, they're out. The additional info is that once the contestant selects one of the doors, Monty will stop the show, open up one of the doors containing a goat, and proceed to ask the contestant if they'd prefer to switch to another door. Then the question is whether it is a good idea for the contestant to switch. Answer is yes, given by choosing at random, uniformly, the contestant will have initially chosen the car only 1/3 of the time, and one of the two goats 2/3 of the time. So, 2/3 of the time, contestant will have made the wrong choice, and will improve the odds by switching. I hope this isn't too confusing.
@woodyrow 10 месяцев назад ⁺²
Exactly! Yes, the Monty Hall problem is a very good example of Bayes' theorem in practice!
@fernandojackson7207 10 месяцев назад
@@woodyrowHi again, if I may ask a more general question in Statistics: Once a hypthesis test has been setup: Difference of Means, Proportion, Anova, etc. Is there a "Natural" way of defining the statistic to be used to determine whether we reject or don't reject the Null Hypothesis?
@woodyrow 9 месяцев назад ⁺¹
Great question@@fernandojackson7207 . I suppose overall I'd say there is not a general natural way of doing this. In practice, you learn when and where to use each one through practice.
I think the best I could do is maybe to suggest thinking through the following:
1. Type of Data: The nature of the data (e.g., categorical vs. continuous, paired vs. independent samples) significantly influences the choice of the test statistic. For example, a t-test is appropriate for comparing the means of two independent samples of continuous data, whereas a chi-square test is used for categorical data.
2. Hypothesis Being Tested: The hypothesis itself (difference of means, proportion, variance, etc.) guides the choice of the statistic. For instance: For testing differences between means, you might use a t-statistic in a t-test.
For proportions, a z-statistic might be used in a z-test.
For comparing variances, an F-statistic is used in ANOVA (Analysis of Variance).
3. Assumptions Underlying the Statistical Test: Each statistical test comes with its own set of assumptions (e.g., normality, homogeneity of variances, independence). The choice of statistic is contingent upon whether these assumptions are met. For example: A t-test assumes normally distributed differences, but if this assumption is violated, a non-parametric test like the Mann-Whitney U test might be more appropriate.
ANOVA assumes homogeneity of variances among groups; if this is not met, you might use a Welch's ANOVA instead.
4. Design of the Study: The study design (e.g., matched pairs, blocked designs) also influences the choice. For matched pairs, a paired t-test uses the differences within each pair as the data for analysis.
@woodyrow 9 месяцев назад ⁺¹
But beyond these sorts of observations, it's often just the case that through experience you begin to recognise what is appropriate in which situations. Hope that helps a little!
@fernandojackson7207 9 месяцев назад
@@woodyrow Thank you so much for the explanation, Woody.
@ProfessorSaadExplains 11 месяцев назад
Great lesson! Thank you. Keep up the good work.
@hemgreen9984 8 месяцев назад ⁺¹
41:28 similar to IQ distributions for men and woman
@rocknroll909 Год назад
Incredibly cool stuff. You're a great teacher, thank you so much for this
@woodyrow Год назад
Thank you so much!! Glad you liked it!
@nassersaed4993 Год назад
@woodyrow great video 👍 the only thing not clear for me is how you derived the 1/9 denominator at 1:05, how did you calculate it? Can you please refer to a resource exploring the formula in detail 🙏
@woodyrow Год назад
@@nassersaed4993 glad you like it! The 1/9 is calculated using integration around 1:05. Watch that section and see if it makes sense. You'll need to know about integration though, which is a pretty big topic.
@kartikeyaagrawal7301 2 года назад
This is so under rated
@kunwar_divyanshu 2 года назад
Pls Make some more videos like this on stats topics . Thank You Sir for this wonderful explanation.
@joshuabrown9898 2 года назад ⁺¹
This is so good!
@haraldurkarlsson1147 2 года назад ⁺¹
Woody,
Thanks for an outstanding high-quality video. I was quite happy until the very end when you did your simulation in Excel. That is not reproducible (even though this is pseudo-random) as it would have been in a program like R (or python). There are indeed many packages in R that do Bayes but the recent addition "bayesrules" takes the cake. It is a simple and user-friendly packes that is worth taking a look at.
@kealebogaappolos6934 10 месяцев назад
Thank you for this video. What I missed is how you calculated the probability in your first example for normal distribution. You just said look at the area under the curve and you said calculate and get the value.
@woodyrow 10 месяцев назад
Calculating probabilities with the normal distribution essentially involves using a computer or a calculator to find the area. While there is a (complicated) function for the normal distribution, there is no formula for computing the area under the curve. you have to just use a computer/calculator to find it. Before computers, it just had to be approximated by hand!
@homerf5054 Год назад ⁺¹
Great video, thank you. I feel like the pill sample calculation is not correct. If there is a very tiny change in mean like 100 -> 100.0000000001, the calc made(from the video) will yield almost 50% that the genius has taken the pill which seems incorrect. I feel like it should be [P(102)-P(100)]/P(100) - I may be wrong but the original calc seems wrong to me. Also the number of zeros for calculated probability should be 5 instead of 6 zeros. Thanks
@CanDoSo_org 2 года назад
Great tutorial. Thanks, man.
@katiedaubney674 2 года назад
This is so clear. Thank you so much!
@woodyrow 2 года назад
So glad you enjoyed it!
@lpt369 8 месяцев назад
Thank You. So helpful!
@woodyrow 8 месяцев назад ⁺¹
Thanks! Glad you found it helpful!
@swap_sid 2 года назад
thanks for the lecture! well explained!
@sakkariyaibrahim2650 Год назад
excellent lecture
@4767039 Год назад
Best explanation! Finally I get it😂
@woodyrow Год назад
Thanks so much! So pleased to hear this.
@karannchew2534 2 года назад ⁺³
34:16 "How much more likely is it that your child will be a genius if they take this pill?"
Probability ( Took a Pill | Given Is a Genius )
Why is it Probability ( Pill | Genius ) and not Probability ( Genius | Pill )
Probability ( Took Red Pill when as a child | Were going to be a genius when taking pill )
@sandeepkarmarkar8813 11 месяцев назад
Great session sir! Just one que: Why did you take rand() < a particular column.. why not >?
@woodyrow 10 месяцев назад
Hi. Sorry for the slow reply! It's just a handy way to get a random variable based on a specific probability. If I say rand()
@mehulajax21 Год назад ⁺¹
How can I simulate the billiards example in R?
@mehulajax21 Год назад ⁺¹
I did it myself. Actually I was doing it correctly. I was getting the distribution of the bernoulli probability of x as 0.4 which is correct based on the solution given in BDA ie (k+1)/(n+2) which is (3+1)/8+2 = 0.4. Also the MAP estimate was 3/8. My mistake was that I was taking the point estimate and cubing it. In reality I had to play 3 bernoulli trials with the obtained value of theta and see which one is actually winning using the uniform rng like you did in excel. After doing that, I can see the mean of quantity generated to be 0.09 and SD of 0.29
@lubosrehounek1039 4 месяца назад
Thanks for this great into. However, I got a bit confused around 1:06 when you evaluate the numerator. You even make the remark "I simplify a bit ...". A then, it seems you actually evaluate the numerator as "P(a=5, b=3).P(Bob wins)" , instead of "P(a=5, b=3| Bob wins).P(Bob wins)". What exactly was simplified here? Is indeed "P(a=5, b=3)" equal to "P(a=5, b=3| Bob wins)" ?
@woodyrow 4 месяца назад
Great question. This is a slippery part of the argument and you're right to ask about it.
Essentially you are right, "P(a=5, b=3)" does equal "P(a=5, b=3| Bob wins)", or at least, the expressions in terms of x are the same.
If I ask you what's the probability of alice having 5 and bob having 3, the answer is (8C3)*x^3*(1-x)^5. If I told you that bob actually went on to win, the probability of alice having 5 and bob having 3 after 8 rounds is still (8C3)*x^3*(1-x)^5. The difference is that you might need to adjust your thinking about what x was. But this is all taken care of by the integration and division.
Hope that helps clarify a bit!
@jamilkhan-dx6vo Год назад
I think there is a mistake in calculation at 28:58 P(E) = 0.445 and not 0.085 (In question related to recession and losing job)
@woodyrow Год назад ⁺¹
I'm not seeing the error. P(E) = p(he loses his job). We calculate this by saying its either that there is a recession and he loses it, or there's not a recession and he loses it. The calculation for that is: 0.1x0.4 (recession and he loses his job) + 0.9x0.05 (no recession and he loses his iob) =0.085. Let me know if that makes sense of if you think there is an error with that.
@jamilkhan-dx6vo Год назад ⁺¹
@@woodyrow Understood sorry my mistake Thanks for the reply
@unochepassa2259 10 месяцев назад
Thanks for the interesting and clear video! I have a doubt about Amira and Jane problem: why do you assume that Jane being late has an impact on Amira being late? From what we know, one could come from the Moon and the other from Mars. Did I miss something?
@woodyrow 10 месяцев назад
Good question. I don't assume that Jane being late had an impact on Amira (in the sense that one caused the other), but we have proved that the events are not independent. This means there is some relationship between one being late and the other being late. If they were truly independent of one another, we would see the property that P(A and J)=P(A)P(J). Since we don't see this, they are not independent. So this could be that Jane being late caused Amira to be late, it could be that Amira being late caused Jane to be late, or it could be some other thing caused both to be late. Eg one is coming from the moon and one is coming from Mars, and a meteor shower caused both rockets to do an extra loop before landing!
@widheg 2 года назад
Excellent!
@hmingthansangavangchhia4913 Год назад
38:44 I don't understand how P(Taking pill given they are genius) = 0.66 translates to number of taking pill is double the number who don't take the pill. Please explain this part.
@TheRookieNerds 2 года назад ⁺¹
Wow that was very clear and engaging
@woodyrow 2 года назад
Thanks!
@datdao6982 8 месяцев назад
Question: at 34:18 The question is if a child take the pill then how likely they will be a genius. Why at 35:25, it is that being a genius is a given? shouldn't it be P(genius | pill)?
@matthewjames7513 Год назад
I'm not sure if I agree with your solution at 20:13. This works assuming conditional probability (or Jane being late is influenced by Amira). But if these were independent events then the answer would just be 20%. If you know nothing about Jane and Amira, is it rational to assume it's conditional or independent? Lastly does causal reasoning play a part here? Thanks :)
@woodyrow Год назад ⁺¹
You are absolutely right that if the events were independent then the probability of Amira being late would just be 20% still. However, the information given in the question actually proves they are not independent. If we knew nothing about them then it's hard to say if it's rational to assume independence. But we DO know something about them: thr information given in the question. So this shows their lateness is not independent of each other. But this does not indicate causality. It might be that Jane's lateness causes Amira's, or the other way around. Or maybe something else causes both (eg maybe they catch the same bus and when this arrives late they are often both late).
@matthewjames7513 Год назад
@@woodyrow Thanks for your reply! I only just discovered your channel and I really like it. I've been going through examples slowly to build intuition.
What if Janes lateness was always caused by bus delays, and Amiras lateness was always caused by bad weather. In this case, they would both be independent because there's no way Janes lateness could depend on Amiras lateness and visa versa. Why does the information in question prove that they're dependant on each other? Is it because of the "70% neither of them is late" fact?
@woodyrow Год назад
@@matthewjames7513 Sorry for the slow reply! If Jane's lateness was always caused by the bus, and Amira's was caused by bad weather, and these didn't interact, then we might expect the events to be independent. However, the probabilities absolutely prove they are not! Remember, the formula P(A&B) = P(A)P(B) is the necessary and sufficient condition for events to be independent.
Look at the conditional calculation for P(Amira late given Jane is late). In general, Amira is late 20% of the time. But when Jane is late, Amira is late 60% of the time! Here's what this means: If I just asked you "what is the probability that Amira is late?", you should say "20%". However, if I also tell you that Jane is late, you should say "60%". That is, if you learn that Jane is late, you change your view about the likelihood of Amira being late. This is what it means for events to be not independent. If one happens, you adjust your view on the other. These figures (20% vs 60%) emerge mathematically from the Venn diagram. And they tell us the events aren't independent.
I hope that all makes sense! I'm really glad you enjoyed this video by the way. Thanks for the questions!
@TimoArchutowski 9 месяцев назад
Do you also have videos on binomial probability? Or perhaps you know of a good introductory course or book?
@krgonline 2 года назад ⁺¹
Great session. A doubt : in calculating Bayesian prob (~1:07), the prob of E/H (numerator) has not been multiplied by prior prob (presumably 0.5). Am I missing something here ?
@woodyrow 2 года назад
Hi. Excellent questions. You are right to wonder about this, and to keep things simple I COMPLETELY ignored the subtle details. Firstly, the prior for this shouldn't be 0.5, since we just don't have any evidence at all before the game starts. So assuming 0.5 at the beginning artificially skews results. In technical terms, this is known as a "nuisance parameter", and we essentially get to ignore it and just integrate. Check out this article for a full explanation of why: jakevdp.github.io/blog/2014/06/06/frequentism-and-bayesianism-2-when-results-differ/
@sakkariyaibrahim2650 Год назад
really enjoyed
@woodyrow Год назад
Thanks so much! Really glad you enjoyed it!
@JessicaVeraBermudez Год назад
Hi Woody, Thanks for this lesson! It is very useful. It's quite challenging to get rid of the frequentist mind after spending my entire life as so, though. I just a have a question: in the simulation, why did you calculated the last probability by using the number of rounds won by either Alice or Bob and not by the number of rounds (10000)? That's how simulations usually work. So I'll be more than grateful if you could help me out here with this doubt.
@woodyrow Год назад
Hi Jessica. We are trying to work out the probability of someone winning from a position where they are losing (by 5 points to 3). So we simulate and find all the situations where someone was losing by this score, and then see (out of these!) how many times they go on to win. If we divided by 10000 we would be working out the probability that someone plays the game, falls 5-3 behind and then goes on to win, which is a different question. Our question was: IF someone is already 5-3 behind, what is the probability that they win. To use the lingo, we want to probability they win "given" that they are 5-3 behind. Hope that makes sense!
@ghostiekos 2 года назад ⁺⁴
notsoErudite has sent her trash goblins. Ty sir woody
@onchang6020 2 года назад
Thank you for the video. I started to see the real power of Bayesian statistics only after watching this video. In the final example (56:45), the problem was first solved the frequentist way, yielding an incorrect probability of 0.05. Then the problem was solved the Bayesian way, yielding a correct probability of 0.09.
I think the frequentist logic is wrong because it consider Bob's chance of winning a point always 3/8. In reality, Bob's chance of winning a point follows a distribution with the mean of 3/8. It is possible to reach the evidence from many values of Bob's chance of winning. It does not have to be 3/8. Is this a valid explanation?
@stefenleung 2 года назад ⁺¹
I've some doubt about the simulation. You just assume the line and ball place randomly with normal distribution.
What if bob is really suck on throwing balls so his ball are always on the side of the table and therefore get to the score of 5::3?
@HuseyinOmerErgen 2 года назад
Then you should use a different prior probability distribution- one in which the odds between Alice-wins-the-round vs Bob-wins-the-round is always greater than 1. And to be fair he didn't assume that, it was given as random in the question itself.
@princedawodu4236 Год назад
Wow, thank you.
@benjamindilorenzo 2 года назад
Thats a great Video Bro! I am doing an Essay right know, comparing frequentist and bayesian approaches. In the example with the Genius and the Red Pill, what would be an frequentist approach? is there any?
@woodyrow 2 года назад ⁺¹
Thanks! Glad you liked it. Frequentists would come to exactly the same conclusion for the Genius/Pill example. They agree with Bayes' theorem in all theoretical settings such as this. The disagreement is in two places: 1) a philosophical disagreement about what probability means, and 2) In situations where the parameters are not known, like the billiards example at the end. In situations where the parameters are known, like when dealing with a known normal distribution, Bayesians and Frequentists agree. These examples are there to show how to work with conditional probability in a range of cases. Thanks for the question!
@elliehomes 10 месяцев назад
Thank you!
@woodyrow 10 месяцев назад
My pleasure!
@nazeerahmed1509 11 месяцев назад
Correction @26:49 'we therefore know that there's a 95% chance that he will NOT lose his job ' instead of 'we therefore know that there's a 95% chance that he will lose his job '
@woodyrow 11 месяцев назад
Good spot! 4 years and I never noticed this!
@catwif 23 дня назад
can someone explain why the pool problem isnt 3/8 * 4/9 * 5/10 = 1/12?
since the data points are updated per point scored
@catwif 23 дня назад
it also correctly hovers 8%-9% so the simulation doesnt fully disprove it
@pacinosky 2 года назад
well explained
@trb4847 2 года назад
For the running question with probabilities represented as areas how did you compute P(T < 14) as 0.106? (32.10 mark)
@woodyrow 2 года назад
I used a normal distribution calculator setting the mean as 15.5, the standard deviation as 1.2 and calculating the probability or a result less than 14. I did it either using excels norm.dist function or using my calculator. Can't remember! You can also use websites like this: onlinestatbook.com/2/calculators/normal_dist.html
@TheColorField 2 года назад
Wouldn't frequentist use the races THAT ALREADY OCCURRED and esimate BayesCamp chances of winning? That seems both practical and intuitive.
@woodyrow 2 года назад
Yes, but those are different races. The strict frequentist approach is to consider the very same event occurring many times. Makes sense for a dice roll, which can essentially be the same every time. Makes no sense with a horse race, since races on previous days with different horses/weather/ground are too different.
But good question. And yes, in reality this is what people do. But this is people being bayesians!
@benjamindilorenzo 2 года назад
Also as a question regarding your simulation: thats pretty evident, but WHY is bayesians answer more precise? isnt it that the frequentist approach also argues with the law of large numbers?
@woodyrow 2 года назад
Not just more precise - it's the only correct answer in this case! The frequentists approach goes wrong when the underlying parameters (e.g. the probability of Bob winning any given game) are not known. Frequentists assume that there is a fixed answer to what this is, and use the available data to determine what they think it is. In this case, they assume it is 3/8. Bayesians don't assume this is fixed, but think there is a distribution of different probabilities that could have explained the data. Depending on how technical you want to get with this, you could check out this article here: www.countbayesie.com/blog/2021/4/27/technically-wrong-when-bayesian-and-frequentist-methods-differ. There's also a good discussion here: stats.stackexchange.com/questions/22/bayesian-and-frequentist-reasoning-in-plain-english. These get quite technical though. Thanks for the question.
@julianmichaelis1745 9 месяцев назад
But aren't you assuming that P(Bob wins) is 100%? So you start with a very strong prior. Or where I am wrong? ;-)
@woodyrow 9 месяцев назад
I don't think I ever assume this, but can you let me know at which point you think I might have?
@zilyuji3143 9 месяцев назад
I guess what u are referring to is the part where he derive the equation for P(a = 5 \land b = 3 | bob win)*P(bob win) on 1:07:49?
This probability simplify to P((a = 5 \land b = 3) \land bob win), which should be equivalent to P(a = 5 \land b =6): this is the only state under which bob could win.
@robwin0072 2 года назад
I am only 4:30 (min/sec) into the video - I will complete the session; however, futurely, die is singular for dice. It's a little unsettling calling a die, dice. ☺️☺️☺️
I will return to this comment for my overall objective opinion- although, to date, I am a Frequentist.
@woodyrow 2 года назад
I'm a modern man - "dice" can be both plural and singular (grammarist.com/usage/dice-die/) 😉 "Die" sounds unnatural and unsettling to me!
Hope you enjoy the rest of the video. Let's see if I can convince you bayesianism is the way to go!
@robwin0072 2 года назад ⁺¹
@@woodyrow Touché
😀😀😀
I made it safely through it.
@acharjyaarijit 2 года назад
If it is possible then I will give it 100 like. Thank you very much.
@woodyrow 2 года назад
Thank you so much! A hundred times!
@lalasalalasa2428 2 года назад
What is first to 6 wins in the game means
@woodyrow 2 года назад
They keep playing this game and recording who wins each time. Once a player has won 6 times they win the whole game. Think of it like tennis! The first player to win 6 games wins the set. It's a bit like that.
@Shkib0y 2 года назад
I have a slight gripe in the disease example, false negatives are independent of false positives and that should be made explicitly clear. Even if it’s just stating the false positive probability and not inferring it without comment.
@woodyrow 2 года назад ⁺¹
Absolutely right. False positives and false negatives are almost never equally likely. And fair point, it would have been better to just mention this, even if I kept things as they are to keep the the maths simple.
@McSwey 2 года назад
very cool

Следующие

Автовоспроизведение

All About that Bayes: Probability, Statistics, and the Quest to Quantify Uncertainty