V Labs
V Labs
  • Видео 4
  • Просмотров 9 662
Al Ghadeer Seamless Pipe Mill Construction Process
er Seamless Pipe Mill Construction Process
Просмотров: 77

Видео

Playing Atari Breakout with Deep Reinforcement Learning
Просмотров 8 тыс.2 года назад
A tutorial on how to make an AI / reinforcement learning agent beating human-level performance in Atari Breakout with Keras and Google Colab (Pro) Original Paper by Mnih et al. - arxiv.org/abs/1312.5602 The Code from the video - github.com/drVlasov/Breakout

Комментарии

  • @LeoVlasov10
    @LeoVlasov10 3 месяца назад

    LEGEND ❤

  • @LeoVlasov10
    @LeoVlasov10 3 месяца назад

    😮

  • @xanderito-low2409
    @xanderito-low2409 Год назад

    Thanks for the video. Am wondering, can this be applied to the game skiing?

    • @drvlasov5543
      @drvlasov5543 Год назад

      I actually am not familiar with this game. It depends upon what kind of reward signal and actions are applied. paperswithcode.com/sota/atari-games-on-atari-2600-skiing you can look here - it looks like Q-learning is not in the list of solutions

  • @thalesferraz
    @thalesferraz Год назад

    Thanks for sharing!

  • @mikejadwin1304
    @mikejadwin1304 Год назад

    I was struggling on how to install the correct dependencies from openai gym documentation. How exactly would one know which commands to use for a game other than breakout for example?

  • @user-zn7rg3wt7i
    @user-zn7rg3wt7i Год назад

    Thank u for the great tutorial! May I ask which Python version did u use in this one?

    • @drvlasov5543
      @drvlasov5543 Год назад

      3.7.15 (the default one with colab)

  • @YouWatchGames
    @YouWatchGames Год назад

    Great Video! Two questions: Can you somehow send me the trained model? And how did you get the video, because the only thing I get, is a json containing an empty array under the "video" property.

    • @drvlasov5543
      @drvlasov5543 Год назад

      sure, PM me your mail - I'll send the model if you do exactly what i show - you gonna be able to get the mp4 file

    • @YouWatchGames
      @YouWatchGames Год назад

      Or do you have twitter? I cant find a way to contact you on youtube. Maybe you forgot to add it to the "about" section.

    • @drvlasov5543
      @drvlasov5543 Год назад

      @@YouWatchGames I've uploaded the learned model to github!

  • @ZephyrineFreiberg
    @ZephyrineFreiberg 2 года назад

    Are there any methods applying policy gradients to solve this problem?

  • @HarshGhodkar
    @HarshGhodkar 2 года назад

    Can you give the weights of the model at the end of the training? Facing troubles training it, google collab is crashing

    • @drvlasov5543
      @drvlasov5543 2 года назад

      Send me an e-mail and I'll send you the trained model General advice is to reduce the buffer size (150-190 k) and to upgrade to Colab PRO:)

  • @Gordonias
    @Gordonias 2 года назад

    Would love to see an implementation outside of colab for those of us with local gpu resources :) Great video though!

  • @amr9059
    @amr9059 2 года назад

    It’s a great example RL. What do you think about Knapsack problem? Can RL solve this problem?

    • @drvlasov5543
      @drvlasov5543 2 года назад

      Thank you! According to Wikipedia, you can do it with dynamic programing :) so it's very close to RL