Bellman Equation Derived In Excruciatingly Baby Steps

Поделиться
HTML-код
  • Опубликовано: 15 ноя 2024

Комментарии • 9

  • @bharathhegde4665
    @bharathhegde4665 Год назад +1

    I found the derivations quite brief too and was looking for a more rigorous explanation, so this was useful.
    An important point at 19:39 that I think should be mentioned is that you get E(G_(t+1) | s',r,s,a) and since it's a Markov decision process, the rewards obtained from state s' would be independent of what action you took at s and what reward you got before arriving at s'. So this would equal E(G_(t+1) | s') , which you have written.

    • @alex-ai7517
      @alex-ai7517  Год назад +1

      Yep. I suppose there are some other places in my derivation where I haven't been totally explicit about the conditions. For example, I often drop the pi once I have pinned down an action. But that's just because I know I'm not going to need to talk about it again and it's implicit. Thanks for raising this.

  • @ResidualSkill
    @ResidualSkill 3 месяца назад

    thank you I thought I was going crazy seeing those two lines in the deepmind lecture

  • @matthewprestifilippo7673
    @matthewprestifilippo7673 Год назад +1

    lol, cat stool journal. i have a dog and i know the importance of my dog's poo schedule, too.
    great video, man!

    • @swazza9999
      @swazza9999 Год назад

      hahaha. Yeah my Russian Blue had diarrhea for like a year but we finally solved it.

  • @MilesHatler
    @MilesHatler 4 месяца назад

    Video: Excruciating baby steps
    Me watching in 0.5X struggling to keep up:

  • @patiwatatayagul8738
    @patiwatatayagul8738 Год назад

    Dam this is good, thank you for good lecture -- keep doing it!

  • @ssshukla26
    @ssshukla26 3 года назад +1

    Hey nice work man ... Keep such videos coming...

  • @jacekwojcieszynski8368
    @jacekwojcieszynski8368 8 месяцев назад

    Bell eq is so profound