DeepMind x UCL RL Lecture Series - Model-free Prediction [5/13]

Поделиться
HTML-код
  • Опубликовано: 4 фев 2025

Комментарии • 14

  • @intuitivej9327
    @intuitivej9327 3 года назад +5

    How lucky I am... It is a great lecture. It is fun and so understandable since it is well explained.
    Thank you for sharing for all of us.

  • @antoniomanjavacas1466
    @antoniomanjavacas1466 3 года назад +12

    Maybe it's just my humble impression, but I think examples like Bandits or Blackjack are not very intuitive for someone who is just getting into RL, but they are always used as canonical because they appear in Sutton & Barto 🤔

    • @marcin.sobocinski
      @marcin.sobocinski 2 года назад +2

      Unfortunately almost all tutorials, lectures etc. are based on Sutton & Barto book... which is ... well.. not very creative to put it nicely. The book itself is not as good as it should be as a RL bible (for me there is too much historical background and proxy discussion with other RL "fathers"). Still waiting for another "bible" in this topic that would be much more practical and less "academic".

  • @jonas14812
    @jonas14812 2 года назад +1

    Thank you so much for the amazing lecture!

  • @perrysdemos6062
    @perrysdemos6062 3 года назад +2

    This was a great lecture, thank you :)

  • @vslaykovsky
    @vslaykovsky 2 года назад +1

    15:27 Why are function approximators are optimized with mean squared error function (L2) by default? Banach's fixed point theorem uses L-infinity norm which is closer to L1 error function

  • @nasirasadov634
    @nasirasadov634 3 года назад +2

    1:32:41 "Inception"

  • @mysunnyjune
    @mysunnyjune 2 года назад

    I really appreciate the lecture and the effort, but all formular development needs to be done much more rigorously.

  • @ayoghes2277
    @ayoghes2277 3 года назад

    Is there any proof that TD converges to the maximum likelihood estimate of the Markov model, for the given data? If so could anyone direct me to it, please?

    • @juanmoreno9633
      @juanmoreno9633 3 года назад

      Hi!
      Have you found it?
      Thanks in advance.

    • @ayoghes2277
      @ayoghes2277 3 года назад

      @@juanmoreno9633 Hello!
      No, I have not. If you do find it, please let me know. Thank you.

    • @ivanily4
      @ivanily4 2 года назад

      @@ayoghes2277 how about this? link.springer.com/content/pdf/10.1023/A:1022632907294.pdf

  • @theSpicyHam
    @theSpicyHam 3 года назад +1

    it's good that this isn't more straying toward anything military directedly e related at all