5 Probability Distributions you should know as a Data Scientist

Поделиться
HTML-код
  • Опубликовано: 27 окт 2024

Комментарии • 46

  • @rishisharma8311
    @rishisharma8311 3 года назад +11

    The real life examples for each of the distribution were amazing !!

    • @CodeEmporium
      @CodeEmporium  3 года назад

      Glad you liked them. Many thanks :)

  • @hkumar7340
    @hkumar7340 2 года назад +12

    1:00 Normal Distribution
    5:37 Log-normal Distribution
    7:30 Uniform Distribution
    8:48 Beta Distribution
    10:33 Chi-squared Distribution.

  • @McMurchie
    @McMurchie 3 года назад +5

    Ahhh, there are like a million Data/ML channels but this is still the only one I subscribe to (after being burned a bit by Siraj). Love this guys ability to articulate complex phenomena in a way that makes sense.

    • @CodeEmporium
      @CodeEmporium  3 года назад

      Thanks for being a part of the community 🙂

    • @McMurchie
      @McMurchie 3 года назад

      @@CodeEmporium pleasure!

  • @harry8175ritchie
    @harry8175ritchie 2 года назад

    Counts are often distributed under a Poisson. The domain is very important to mention when selecting a distribution, and the discrepancy between probability mass functions and density functions. Keep it up man, love your stuff.

    • @CodeEmporium
      @CodeEmporium  2 года назад

      Thank you! More math videos to come!

  • @ronin2963
    @ronin2963 3 года назад

    Nice summary of five different topics that could be their own lessons

    • @CodeEmporium
      @CodeEmporium  3 года назад

      Thank you. Will def dive into these topics in thier videos in some consumable form. I just need to think of the best way to deliver this content

  • @mohammadrezaghiasy6618
    @mohammadrezaghiasy6618 3 года назад

    Hey buddy. Awesome as always. THANK YOU 💓

  • @bipinkapri9986
    @bipinkapri9986 3 года назад +1

    That was really helpful! Amazing content!

    • @CodeEmporium
      @CodeEmporium  3 года назад

      Many thanks and very glad you enjoyed it :)

  • @scott7948
    @scott7948 2 года назад

    You missed tweedie distribution which is used in insurance modelling

  • @hamzadata
    @hamzadata 9 месяцев назад

    Man you are awesome!

  • @doristhebrowndog
    @doristhebrowndog Год назад

    how are y’all so smart… i left everything i learned about statistics back at where it started, at Uni :(

    • @CodeEmporium
      @CodeEmporium  Год назад +3

      Honestly I did the same. But the more you work with this stuff on applications, the better you’ll remember it. :)

  • @yensteel
    @yensteel 3 года назад +2

    Is there a way to create a custom probability distribution from a sample dataset? It can then generate new data with similar characteristics while remaining completely continuous?

    • @CodeEmporium
      @CodeEmporium  3 года назад +2

      Yes that is possible. In python, scipy has distributions where you call a "fit" function and pass in sample data. For example, check out scipy.beta.fit.

    • @yensteel
      @yensteel 3 года назад

      @@CodeEmporium Thank you so much for the reply!

    • @harry8175ritchie
      @harry8175ritchie 2 года назад +1

      @@CodeEmporium Man, I feel like I'm really annoying here. I'm sorry! Be careful with this. Understand your data first: if there's any domain expertise you can throw into this, the data may be enforced to be a certain distribution, despite it not looking like it yet.
      For example: counting the number of times you see cars drive past your house within one hour blocks. Maybe you collect a handful of data. You notice a small tail at 2-5 cars, a peak at 7 cars, and a tail at 10 cars. You might think this is normal, but from the definition of the experiment, this is indeed a Poisson distribution: counting within set intervals.

  • @timz2917
    @timz2917 8 месяцев назад

    The sample means can still be normal even if the samples arent

  • @monkyebrain
    @monkyebrain 3 года назад +1

    Weibull gang stand up!

  • @shaelanderchauhan1963
    @shaelanderchauhan1963 2 года назад

    Data is is just a game of giving 100 different fancy names for the same concept to make it Extremely confusing for learners

  • @k.alipardhan6957
    @k.alipardhan6957 3 года назад

    start at 1:00

    • @k.alipardhan6957
      @k.alipardhan6957 3 года назад

      i think 4 & 5 needed much more details, as much as we got for 1. but good video, thank you

  • @SiyaMedia
    @SiyaMedia 3 года назад

    poison ooops we need to talk about the poisson distribution as well

  • @gokulkurup1584
    @gokulkurup1584 3 года назад

    Really good content

  • @erickballesteros4531
    @erickballesteros4531 2 года назад

    good vid :)

  • @lucio8794
    @lucio8794 3 года назад

    My man, I love your videos, but the audio is often out of sync, just a heads up

    • @CodeEmporium
      @CodeEmporium  3 года назад

      Yep. Thanks for the heads up. I'm trying to get better with this for future videos :)

  • @dragonman101
    @dragonman101 3 года назад

    does anyone else see a lag between audio and video?

    • @CodeEmporium
      @CodeEmporium  3 года назад

      Sorry about that. It happens a couple of times through the video. Will try to correct for future videos

    • @dragonman101
      @dragonman101 3 года назад

      @@CodeEmporium no worries! :) I just couldn't tell if the issue was my computer or the video itself hahaha

  • @larrybird3729
    @larrybird3729 2 года назад

    no gamma :(

  • @tusharbedse9523
    @tusharbedse9523 2 года назад

    R u lipsing bro

    • @CodeEmporium
      @CodeEmporium  2 года назад +1

      Nah. It's your imagination

    • @tusharbedse9523
      @tusharbedse9523 2 года назад

      @@CodeEmporium thanks for replying.... Was watching sm of ur videos ...awseome stuff...thanks!!

  • @ssshukla26
    @ssshukla26 3 года назад

    One those videos where it's implicitly assumes that you know stats before hand and explicitly follow that assumption throughout the video...

    • @CodeEmporium
      @CodeEmporium  3 года назад

      I think only the normal distribution is technical here. The other 4 are a lot easier to pick up. Looking back, maybe could have easier explained the normal distribution. But I'll keep this mind for other videos

  • @kushagrachaturvedi2144
    @kushagrachaturvedi2144 2 года назад

    when i hear u first time its very weird u r voice does not match u. means don't know why its feels like that u r lisping and someone else is talking