Deep RL Bootcamp Lecture 3: Deep Q-Networks

Поделиться
HTML-код
  • Опубликовано: 17 янв 2025

Комментарии • 28

  • @mingsumsze6026
    @mingsumsze6026 Год назад +3

    I think I should mention that the lecturer is one of the researchers who proposed dqn, and his name was the first among all the researchers. I like how modest he is hahaha. This is actually one of my favourite lecture. So much insight. Thank you!

  • @tetamusha
    @tetamusha 6 лет назад +6

    Thanks for sharing this lecture and the Deep RL Bootcamp 2017 playlist overall.

  • @anastasiaholovenko2103
    @anastasiaholovenko2103 3 года назад

    Isn't 0:25:29 a pseudo code for DDQN? We have Q and Q^ weights mentioned. On the other hand, the formula for y target is not the one of DDQN as far as I understand...

  • @thomasmao7225
    @thomasmao7225 7 лет назад +16

    Why is the default video speed 0.5?

  • @adilahsan4448
    @adilahsan4448 6 лет назад +1

    Thanks for that awesome lecture... You were very informative and insightful... :)

  • @xXxBladeStormxXx
    @xXxBladeStormxXx 7 лет назад +9

    I did not expect that guy to sound like he does.

  • @ethanjyx
    @ethanjyx 5 лет назад +6

    Some slides cover two or three points. One suggestion I'd give is to split one specific slide into multiple ones or add some animations.

    • @avimohan6594
      @avimohan6594 4 года назад

      Agreed. Sadly, this is true of so many presentations I've sat thru.

  • @ethanjyx
    @ethanjyx 5 лет назад

    Very good explanations!

  • @_mvr_
    @_mvr_ 7 лет назад +66

    watch in 1.5x and thank me later

    • @terrarox
      @terrarox 7 лет назад +5

      1.25 is perfect!

    • @onurtrtr2397
      @onurtrtr2397 7 лет назад +1

      1.25 is better, everything seems natural xd

    • @helenj8238
      @helenj8238 7 лет назад

      THANKS!!

    • @iansullivan8
      @iansullivan8 6 лет назад +8

      I did 1.25 for this guy, and .75 for karpathy

    • @technokicksyourass
      @technokicksyourass 6 лет назад

      LOL, yeah 1.25 or 1.5 speed, I can actually pay attention. This dude is.. sloooooooo...

  • @ashish9670
    @ashish9670 4 года назад

    Thanks for this lecture

  • @ProfessionalTycoons
    @ProfessionalTycoons 6 лет назад

    great talk.

  • @terrarox
    @terrarox 7 лет назад

    What's the question at 12:30?

    • @mdimbesathassanrizvi9654
      @mdimbesathassanrizvi9654 5 лет назад +2

      I believe it was about how frequently the weights of the Q net being learned is copied to the target network. Shouldn't be too frequently to avoid non-stationarity in target computation and again not too less frequently to avoid target network weights being too stale. Needs to be picked up through experimentation.

  • @shiweixiao2574
    @shiweixiao2574 6 лет назад

    nice !!

  • @sca2777
    @sca2777 7 лет назад

    nice

  • @motiurrahman
    @motiurrahman 6 лет назад +2

    He is opposite of Karpathy .

  • @LunnarisLP
    @LunnarisLP 6 лет назад +4

    He doesn't seem like the greatest presenter, and while I guess it's hard to find people who excell at both machine learning AND presenting and I can certainly see his expertice on the topic, he might wanna work on the presentation part a litte :D He made it a bit hard for me to keep paying attention :/

  • @dexlee7277
    @dexlee7277 6 лет назад

    He forgot to fill his belly before doing this

  • @hassamsheikh
    @hassamsheikh 7 лет назад +3

    HE is AWKWARD AF

  • @danny_racho
    @danny_racho 3 года назад

    The guy is demotivating and uninterested in teaching. Please bring David Silver back, that guy makes the information more appealing in my eyes