CS 285: Lecture 15, Part 1: Offline Reinforcement Learning

Поделиться
HTML-код
  • Опубликовано: 12 дек 2024

Комментарии • 6

  • @baselomari3657
    @baselomari3657 Год назад +1

    At 33:16, shouldn't it be "x*

    • @ResidualSkill
      @ResidualSkill 11 месяцев назад

      yeah seems like a typo

    • @dwpark3761
      @dwpark3761 11 месяцев назад +1

      I don't think so. It is some kind of analogy. Just imagine that f(x) is Q(s,a) in the next page.

    • @binyuwang6563
      @binyuwang6563 2 месяца назад

      It's not a typo. Here x*

  • @AmitSingh-jo8ob
    @AmitSingh-jo8ob Год назад

    Is there a page by lab where i can see all these references (that are being used in slide) at one place?
    Also, is there a servey paper covers all these things?

  • @SphereofTime
    @SphereofTime 8 месяцев назад

    1:00