
- Видео 17
- Просмотров 3 161
Paing Thetko
Добавлен 26 фев 2023
Week6Ex3 ( MountainCar continuous with Actor Critic)
Week6Ex3 ( MountainCar continuous with Actor Critic)
Просмотров: 8
Видео
Week6Ex1 ( Lunar Landing With Actor-Critic )
Просмотров 42 месяца назад
Week6Ex1 ( Lunar Landing With Actor-Critic )
Week6Ex2 ( Lunar landing with Deep Learning Q-value )
Просмотров 132 месяца назад
Week6Ex2 ( Lunar landing with Deep Learning Q-value )
Training with Pendulum( continuous action )
Просмотров 872 месяца назад
Training with Pendulum( continuous action )
Training with cartpole (discrete action) using Policy-gradient method
Просмотров 292 месяца назад
Training with cartpole (discrete action) using Policy-gradient method
Great application for reinforcement learning how did you visualise this working via the algorithm!