Видео

Playing Atari Breakout with Deep Reinforcement Learning

36:03

Playing Atari Breakout with Deep Reinforcement Learning

Просмотров 8 тыс.2 года назад

A tutorial on how to make an AI / reinforcement learning agent beating human-level performance in Atari Breakout with Keras and Google Colab (Pro) Original Paper by Mnih et al. - arxiv.org/abs/1312.5602 The Code from the video - github.com/drVlasov/Breakout

@LeoVlasov10 3 месяца назад
LEGEND ❤
@LeoVlasov10 3 месяца назад
😮
@xanderito-low2409 Год назад
Thanks for the video. Am wondering, can this be applied to the game skiing?
@drvlasov5543 Год назад
I actually am not familiar with this game. It depends upon what kind of reward signal and actions are applied. paperswithcode.com/sota/atari-games-on-atari-2600-skiing you can look here - it looks like Q-learning is not in the list of solutions
@thalesferraz Год назад
Thanks for sharing!
@mikejadwin1304 Год назад
I was struggling on how to install the correct dependencies from openai gym documentation. How exactly would one know which commands to use for a game other than breakout for example?
@user-zn7rg3wt7i Год назад
Thank u for the great tutorial! May I ask which Python version did u use in this one?
@drvlasov5543 Год назад
3.7.15 (the default one with colab)
@YouWatchGames Год назад
Great Video! Two questions: Can you somehow send me the trained model? And how did you get the video, because the only thing I get, is a json containing an empty array under the "video" property.
@drvlasov5543 Год назад
sure, PM me your mail - I'll send the model if you do exactly what i show - you gonna be able to get the mp4 file
@YouWatchGames Год назад
Or do you have twitter? I cant find a way to contact you on youtube. Maybe you forgot to add it to the "about" section.
@drvlasov5543 Год назад
@@YouWatchGames I've uploaded the learned model to github!
@ZephyrineFreiberg 2 года назад
Are there any methods applying policy gradients to solve this problem?
@HarshGhodkar 2 года назад
Can you give the weights of the model at the end of the training? Facing troubles training it, google collab is crashing
@drvlasov5543 2 года назад
Send me an e-mail and I'll send you the trained model General advice is to reduce the buffer size (150-190 k) and to upgrade to Colab PRO:)
@Gordonias 2 года назад
Would love to see an implementation outside of colab for those of us with local gpu resources :) Great video though!
@amr9059 2 года назад
It’s a great example RL. What do you think about Knapsack problem? Can RL solve this problem?
@drvlasov5543 2 года назад
Thank you! According to Wikipedia, you can do it with dynamic programing :) so it's very close to RL

V Labs

Видео

Комментарии