Q-Learning Tutorial 1: Train Gymnasium FrozenLake-v1 with Python Reinforcement Learning

Поделиться
HTML-код
  • Опубликовано: 6 сен 2024

Комментарии • 37

  • @johnnycode
    @johnnycode  11 месяцев назад +2

    Please help me out with a subscribe if this video helped you :) Ready for Deep Q-Learning? ruclips.net/video/EUrWGTCGzlA/видео.html
    My code + list of RL videos in logical order: github.com/johnnycode8/gym_solutions

  • @dariustrabalza6629
    @dariustrabalza6629 6 месяцев назад +1

    Thanks for taking the time to make these videos. It's been really hard to find up to date information on how to use this stuff. there's not many videos out there. You break things down very simple and i appreciate it greatly. thanks for the hard work!

  • @randywilson595
    @randywilson595 3 месяца назад +1

    Thanks For putting out the best reinforcement learning tutorial Video I've ever seen. Line by line Brilliant!!!!!!

  • @Lukas-rj9nr
    @Lukas-rj9nr 4 месяца назад +1

    what a great video. You packed so much in just 12min. Hope you continue to make more videos about RL

  • @helims9384
    @helims9384 10 месяцев назад +1

    Thanks for the great content. I found something interesting about the algorithm. The performance of the algorithm is highly dependent on the results of the epsilon-greedy exploration. If we don't update the informative Q(s,a) table within some episodes (ex.5,000), the results are terrible. It's interesting that the results are inconsistent.

  • @panagiotisseventekidis7153
    @panagiotisseventekidis7153 13 дней назад

    great work

  • @carsonlind2361
    @carsonlind2361 10 месяцев назад

    Subscribed and liked. This has been really helpful in getting started. Thank you!

  • @user-rh9bn1zc5q
    @user-rh9bn1zc5q 7 месяцев назад +1

    Thanks for this tutorial !

  • @user-mt2xt9tk5o
    @user-mt2xt9tk5o 8 месяцев назад +1

    THANKS FOR VIDEO
    in visual studio code i don't have button for stop and pause how can I activate them or install any extension

    • @johnnycode
      @johnnycode  8 месяцев назад

      I think VSCode automatically installs the Python extensions when you open a Python file. If that didn’t happen, check out this reference code.visualstudio.com/docs/editor/debugging

    • @user-mt2xt9tk5o
      @user-mt2xt9tk5o 8 месяцев назад

      @@johnnycode Thank you so much 👍👍👍👍✔✔✔✔

  • @user-ks2kc9qz3d
    @user-ks2kc9qz3d 3 месяца назад

    Bonjour, je félicite pour ces excellent vidéo, je suis entraîné de programmer le même algorithme mais avec plusieurs agents , c.a.d on a plusieurs agents et plusieurs obstacles et plusieurs gouls au même temps, et j'ai trouve pas une méthode de modifier ce programme et intégrer plusieurs agents au même, merci d'avoir clarifier le programme qui fait cette opération, , et merci bien pour votre aide

    • @johnnycode
      @johnnycode  3 месяца назад

      Thank you, good luck on your work.

  • @ElisaFerrari-q5i
    @ElisaFerrari-q5i 2 месяца назад

    Based on what do we assign these values to hyperparameters?

    • @johnnycode
      @johnnycode  2 месяца назад

      Based on trial and error, or a process called hyperparameter tuning.

  • @muhammadtayyabbaig1071
    @muhammadtayyabbaig1071 Месяц назад

    At the start is the q table accurate?How is the q table made accurate and when does it start to follow it?

    • @johnnycode
      @johnnycode  Месяц назад

      The q-table is not accurate at the beginning. It becomes more accurate by updating with the q-learning formula. In the video, I did not talk about the theory and mathematics behind the q-learning formula.

  • @anissahafid9465
    @anissahafid9465 3 месяца назад

    merci d'avoir envoyé le code source de cet vidéo , et merci bien pour ces excellent explication

  • @Lukas-rj9nr
    @Lukas-rj9nr 4 месяца назад

    For a project in uni, I want to train an agend that can behave well on different state spaces. Imagine one agend should be able to solve the FrozenLake-Problem in 5x5, but also in 6x6, 7x7 etc. and also 5x6, 5x7, 6x5, etc.
    How to do that? Do you have an idea or keywords to search for?

    • @johnnycode
      @johnnycode  4 месяца назад

      My video on how to “Build a Custom Gymnasium Reinforcement Learning Environment” ruclips.net/video/AoGRjPt-vms/видео.html does very similar to what you described. However, you don’t have to create a custom environment, you just have to train the agent on all the different FrozenLake map sizes.

  • @kimiochang
    @kimiochang 4 месяца назад +1

    Thanks!

    • @johnnycode
      @johnnycode  4 месяца назад

      Thank you very much!!!

  • @caigao1571
    @caigao1571 9 месяцев назад

    thanks for your explanation

  • @user-ks2kc9qz3d
    @user-ks2kc9qz3d 3 месяца назад

    Je vous remercier infiniment , j'ai déjà voir ce vidéo mais j'ai trouvé pas une méthode pour crier plusieurs robots en même , sachat que leur travail est semblable comme le premier agent, tout on évitons les obstacles et fair recherche de la but ( goal), s'il y a une méthode simple merci d'avoir m'informer et le code source surtout , merci et merci pour vous effort de repondre

    • @johnnycode
      @johnnycode  3 месяца назад +1

      I will try to do some multiagent videos.

    • @anissahafid9465
      @anissahafid9465 3 месяца назад

      @@johnnycode merci et merci, puisque m'intéresse au multi agent ( ou bien multi robots ) et plusieurs goal ( buts ) , si l'un des ces agents trouve un goal il le marque comme fait , et lorsque un autre agent trouve le meme goal il le ignore et complet leur travail de recherche , je vous attend, bon implémentation , bon chanse.

    • @anissahafid9465
      @anissahafid9465 3 месяца назад

      Bonjour , s'il ya des nouveau pour la programmation des multi agent au meme temps , merci infiniment

  • @rayog2707
    @rayog2707 2 месяца назад

    what to do to see the Q-table?

    • @johnnycode
      @johnnycode  2 месяца назад

      The Q-table is a regular Python array, so you can just use a loop to print the value. In my other video, you can visually see the values on the map: ruclips.net/video/1W_LOB-0IEY/видео.html

  • @vickyli-hk6ir
    @vickyli-hk6ir 9 месяцев назад

    very clearly!!!

  • @smgwow
    @smgwow 8 месяцев назад

    Awesome video

  • @thefall0190
    @thefall0190 9 месяцев назад +1

    Will you do this with deep Q-learning version ?

    • @johnnycode
      @johnnycode  9 месяцев назад +2

      Yes, I’m working on it. Will share in a few days.

    • @thefall0190
      @thefall0190 8 месяцев назад

      Thank you !@@johnnycode

    • @johnnycode
      @johnnycode  8 месяцев назад +4

      Hi, my Deep Q-Learning video is out ruclips.net/video/EUrWGTCGzlA/видео.html
      Please check it out.

  • @ApexArtistX
    @ApexArtistX 10 месяцев назад

    awesome tutorial more please...

    • @johnnycode
      @johnnycode  8 месяцев назад +1

      Hi, in case you're looking for a Deep Q-Learning video, I've recently released a detailed one: ruclips.net/video/EUrWGTCGzlA/видео.html