Paing Thetko
Paing Thetko
  • Видео 17
  • Просмотров 3 161

Видео

Week6Ex1 ( Lunar Landing With Actor-Critic )
Просмотров 42 месяца назад
Week6Ex1 ( Lunar Landing With Actor-Critic )
Week6Ex2 ( Lunar landing with Deep Learning Q-value )
Просмотров 132 месяца назад
Week6Ex2 ( Lunar landing with Deep Learning Q-value )
Mountain car (discrete) environment
Просмотров 222 месяца назад
Mountain car (discrete) environment
Mountain Car continuous env
Просмотров 302 месяца назад
Mountain Car continuous env
Training with Pendulum( continuous action )
Просмотров 872 месяца назад
Training with Pendulum( continuous action )
Training with cartpole (discrete action) using Policy-gradient method
Просмотров 292 месяца назад
Training with cartpole (discrete action) using Policy-gradient method
Assignment 4.2
Просмотров 13 месяца назад
Assignment 4.2
Assignment 4
Просмотров 23 месяца назад
Assignment 4
Nodemcu and Netpie
Просмотров 16Год назад
Nodemcu and Netpie
robotHandGestureControl
Просмотров 16Год назад
robotHandGestureControl

Комментарии

  • @yor.senpai._7233
    @yor.senpai._7233 2 месяца назад

    Great application for reinforcement learning how did you visualise this working via the algorithm!