Reinforcement Learning: Thompson Sampling & The Multi Armed Bandit Problem - Part 02

Поделиться
HTML-код
  • Опубликовано: 23 окт 2024

Комментарии • 14

  • @gznqtl
    @gznqtl 4 года назад +4

    Hello Daniel, I just can't belive : Thanks to your series I'm working with python, jupyter and execute IA exercises and It's working!!! I studied mathematics and computer science 35 years ago, I just remember why I loved my carrer. Just the best course! (greetings from México)

  • @hessamjamalkhah9781
    @hessamjamalkhah9781 3 года назад +2

    Excellent example and explanation, thank you
    I hope you decide to continue your videos, they are just perfect

  • @danalex2991
    @danalex2991 2 года назад +1

    What an amazing video! You are the best!

  • @sergeshirokov6064
    @sergeshirokov6064 Год назад +1

    Hello Daniel! Thank you so much for these videos! They are amazing and really helpful! You are a great one

  • @ksriniva
    @ksriniva 4 года назад +1

    Great video and explanation of Thompson Sampling and its practical application through the multi-armed bandit scenario.

  • @EustaquioSantimano
    @EustaquioSantimano 3 года назад +1

    Thank you for the clear explanation.

  • @LionelMessi-fu6wn
    @LionelMessi-fu6wn 4 года назад +2

    Thank you so much! Could you please suggest a good reference book that focuses mainly on reinforcement learning? I would prefer that it starts from scratch.

  • @jorgerios4091
    @jorgerios4091 Год назад

    Just found this video and is the best I've ever seen on this topic. What would be the procedure if the "conversion rate" is changing over time?, my guessing is to take only the last "n" data for the sampling (last 10, last 20, etc.) but in this case which is the "n" minimum value that can be used to apply in the beta distribution?

  • @swetapatra
    @swetapatra 4 года назад +2

    Why if the random number is less than conversion rate , the outcome is 1? It should be other way round no?

  • @veramentegina
    @veramentegina 3 года назад

    thank you. you are the best!!

  • @jesuslopez3306
    @jesuslopez3306 4 года назад

    That is great! Thanksss you so much!!q

  • @iovistypsanelli7974
    @iovistypsanelli7974 4 года назад

    Nice music !

  • @yigitsevim7741
    @yigitsevim7741 Год назад

    You identified that we have $1000 initially. However, we played 1000 turns and for each turn we used each (6) machine. So didnt we spent $6000?

    • @yigitsevim7741
      @yigitsevim7741 Год назад

      Also, when indexing outcomes we use [turn_index] so does it mean that the outcome depends on the number of turn we are in? I thought every spin is independent. Why is the index of the turn affecting the outcome?