Handling Imbalanced Dataset in Machine Learning: Easy Explanation for Data Science Interviews

Поделиться
HTML-код
  • Опубликовано: 7 сен 2024

Комментарии • 41

  • @emma_ding
    @emma_ding  Год назад +5

    Many of you have asked me to share my presentation notes, and now… I have them for you! Download all the PDFs of my Notion pages at www.emmading.com/get-all-my-free-resources. Enjoy!

    • @jerrywang1550
      @jerrywang1550 Год назад

      is it possible to share your notion file? Thank you

    • @emma_ding
      @emma_ding  Год назад +1

      @@jerrywang1550 You can download all the PDFs of my Notion pages at emmading.com/resources by navigating to the individual posts. Enjoy!

    • @jerrywang1550
      @jerrywang1550 Год назад

      @@emma_ding I mean your notion files, not PDF. Thank you

  • @michaeldarmanis8477
    @michaeldarmanis8477 6 месяцев назад +1

    To my view, imbalance of data does not pose a problem. During classification one ought to model class membership distributions, and these may be small. As long as they are correct, there is no problem. One should, of course, use proper scoring rules (i.e. not accuracy) to maximize the classification problem.
    Tetlock's Superforecasting serves as a wonderful and very readable introduction to predicting unbalanced classes.

  • @elonchan9675
    @elonchan9675 Год назад +7

    Hi Emma, it is a really good summary videos on the matter of imbalanced dataset. Thank you and keep up the good work!

  • @AnkurSingh-mk9rc
    @AnkurSingh-mk9rc Год назад +3

    Thanks Emma , these short videos come in handy when preparing for interview

  • @dle3528
    @dle3528 Год назад

    This video is amazing. It was easy to understand and summarized different possibilities for dealing with unbalanced data. Congratulations! Keep helping people. I am very grateful for your explanation!

  • @ankgup87
    @ankgup87 4 месяца назад

    This video helped me clear an interview. Subscribed. Thank you.

  • @psg9278
    @psg9278 7 месяцев назад

    Best Video on ML, I understood very clearly. Thank You

  • @sanyam5685
    @sanyam5685 Год назад +2

    Thanks Emma, Can we also have a series of videos on deploying ML models in production?

    • @emma_ding
      @emma_ding  Год назад

      Thanks for your comment, Sanyam! 😊 I've added your idea to my list of content suggestions.

  • @Itsdanielpeng
    @Itsdanielpeng Год назад

    This is really helpful. thank you so much for putting out these videos!

    • @emma_ding
      @emma_ding  Год назад

      So glad you find them helpful, Daniel! Thanks for watching. 😊

  • @thedislikebutton163
    @thedislikebutton163 Год назад +4

    Checkout this paper on Gumbel loss/activation for LVIS long tailed dataset, interesting method for imbalanced datasets

  • @user-qr4pi4ow7b
    @user-qr4pi4ow7b 7 месяцев назад

    Emma,great explanation and to the point.

  • @ayambavictorndoma5672
    @ayambavictorndoma5672 7 месяцев назад

    I enjoyed this video. Thanks for this Emma

  • @SonuKumar-gt5xs
    @SonuKumar-gt5xs Год назад

    Hi Emma,
    these videos are really good.
    can you make a video on time series analysis

  • @jasonswift7468
    @jasonswift7468 Год назад +2

    Hi Emma. Could you talk about chatGPT (including its model, dataset, algorithms, system design, etc) for the next video? Thank you.

    • @emma_ding
      @emma_ding  Год назад

      Thanks for your comment! 😊 I've added your idea to my list of content suggestions.

  • @spikeydude114
    @spikeydude114 Год назад

    Great topic! Thanks for covering

  • @faisalmahmud5794
    @faisalmahmud5794 17 дней назад

    I have data with class 0: 150 and only two data from class 1.
    is there any way to do classification with this data?

  • @ATN_AI
    @ATN_AI Год назад

    Hi! Is there a way you can share this notion document! Thank you!! Great content

  • @sambidpradhan32
    @sambidpradhan32 Год назад

    Hey Emma..big fan of your work😀,looking for series in model deployment.. if you can add things like processing(batch/stream), serving(batch/realtime) and learning(offline/online) part in production. sorry if it is a big ask🥲

    • @emma_ding
      @emma_ding  Год назад

      Thanks for your comment! I've added your suggestions to my list of content ideas. 😊

  • @kevinpoisson713
    @kevinpoisson713 Год назад

    In the ‘why imbalance is important’ part, the accuracy for rare event predicting model can be solved by relying on other evaluating metric such as precision and recall, isn’t that right?. It’s not explaining the why

  • @mihretdesta9153
    @mihretdesta9153 Год назад

    hey Emma please send me the code for imbalanced image datasets

  • @Aria-ow4cl
    @Aria-ow4cl Год назад

    Hi, Emma! Thanks for sharing. Very helpful materials. But i got a probleme when downloading the presentation notes, somehow the notes for imbalanced dataset is missing, when I click the imbalanced dataset notes, it actually opens the notes for encoding categorical data, could you please help with this?

    • @emma_ding
      @emma_ding  Год назад

      Thank you so much for letting me know! I apologize for the mix-up, and have corrected the issue. Thanks for your patience. 💛

  • @machinelearning6817
    @machinelearning6817 Год назад

    Subscribed !!

  • @srhrsh100
    @srhrsh100 7 месяцев назад

    You are just reading the text written in the book, try to explain with examples and further in detail, apart from what is already mentioned in the book.

  • @efeincir3254
    @efeincir3254 Год назад

    Wonderfull!

  • @owdengodson2990
    @owdengodson2990 Месяц назад

    7:02 **in the minority class

  • @pixelmasque
    @pixelmasque Год назад

    A gorgeous ML scientist

  • @pacesferry
    @pacesferry Год назад

    Hi, audio clipping detected..

  • @DSlayer007
    @DSlayer007 Год назад

    is 75:25 imbalanced dataset

  • @mihretdesta9153
    @mihretdesta9153 Год назад

    please reply me

  • @zbynekba
    @zbynekba 3 месяца назад

    Your content is good, but your strong accent needs improvement.

  • @mohamedsamy2895
    @mohamedsamy2895 3 месяца назад

    So bad