Q-Learning: A Complete Example in Python

Поделиться
HTML-код
  • Опубликовано: 27 ноя 2024

Комментарии • 74

  • @meraldlag4336
    @meraldlag4336 2 года назад +3

    This is easily the best resource on q-learning I’ve ever seen. It’s one simple library available to everyone on any OS (looking at you, *gym* ) and very well explained

  • @gamuchiraindawana2827
    @gamuchiraindawana2827 6 месяцев назад +1

    I don't believe anyone teaches it better than you. Amazing.

  • @shreyashgupta13
    @shreyashgupta13 Год назад +1

    I am learning RL for last 1 year and this is one of the best video. You have taught everything from start to end. Thank you!

  • @kinez_vilicar8
    @kinez_vilicar8 4 года назад +14

    Best Q-Learning lesson ever, better than my uni class. Thank you very much, please keep it up.

  • @patrickng8805
    @patrickng8805 Год назад +1

    The code is much cleaner and easier to understand than the example codes I found in many other Q-Learning books.

  • @kachappillyjean
    @kachappillyjean 2 года назад +4

    That is some incredible teaching skill! Broke down a complex concept just to the right amount of detail . Really appreciate this lesson.

  • @darker2.0
    @darker2.0 Год назад

    The best teacher i never seen. Respect.

  • @takeiteasydragon
    @takeiteasydragon 6 месяцев назад

    Extremely clear explanation for this topic. You are my life saver when I am preparing my finals. Thanks a lot.

  • @2NormalHuman
    @2NormalHuman 2 года назад +1

    Wow this is the best qlearning video with code example that i found on the internet. Thanks for the detailed explanation of every step in the code

  • @srinivasanbalan2469
    @srinivasanbalan2469 4 года назад +2

    You are an excellent teacher, Dr. Soper. Appreciate your support and presentation.

  • @Sunny-qe5el
    @Sunny-qe5el 2 года назад

    WOW!!!
    Thanks a ton for this video mate.
    I have taken the course in reinforcement learning at university and this is by far the great way to make acquainted with Q-learning algorithm in reinforcement learning.

  • @wennie2939
    @wennie2939 3 года назад +1

    Best RL video ever in RUclips! Thank you so much, Dr. Soper!

  • @Виктор-н8р7д
    @Виктор-н8р7д Год назад

    Thank you so much Dr.Super! You're a very gifted teacher. Please don't stop.

  • @IsaacAsante17
    @IsaacAsante17 3 года назад +9

    I'm really impressed by how easily you explained this. I'm going to watch the other videos in your Q-learning series, and subscribe as well. Thanks for this amazing RL tutorial, man.

    • @giancarlo1108
      @giancarlo1108 2 года назад +1

      i never expected to see fl studio tutorial channel here lol

  • @farzero
    @farzero 2 года назад

    Dude thanks a bunch. You gave just the right amount of detail and broke it down simply. Thanks for not bogging us down with a ton of details.

  • @JustSayin24
    @JustSayin24 4 года назад +2

    Writing my dissertation and this is a God-send. Thank you :)

  • @cK1ab
    @cK1ab 2 года назад +1

    Thankyou so much for this clear and practical example. Most videos I've seen just kind of breeze over the environment and states. Keep up the good work!

  • @wherearewegoing400
    @wherearewegoing400 4 года назад +3

    Thank you very much Dr. Soper! I took great value from this video, as I am trying to implement some Q-Learning techniques into a project of my own. Looking forward to your future videos!

  • @christopherminhpham9603
    @christopherminhpham9603 2 года назад

    Very easy to follow and understand Q-learning and see amazing computer 💻 working so well. Encourage myself to learning more about AI. I am not good at maths, and logical problems but I understand easily by your videos. Thank you so much 😊

  • @samarpratapsingh9788
    @samarpratapsingh9788 4 года назад

    Such a complex topic is explained without any hassle!

  • @ronaldleung9975
    @ronaldleung9975 4 года назад +2

    Perfect tutorial with clear sample code, but only 646 views? This video deserves better!

    • @ronaldleung9975
      @ronaldleung9975 4 года назад +2

      Most other videos are just using GYM or Unity library for their video, which are just show-offs, instead of really teaching something.

  • @tjcarter9286
    @tjcarter9286 3 месяца назад

    Thank you for this video, you've explained this excellently and I actually understand the concept now!

  • @saurabhjain9542
    @saurabhjain9542 2 года назад

    Very helpful and engaging lesson. Was having hard time understanding implementation but this visual explanation with code implementation made it very clear. Great job!!

  • @gemini_537
    @gemini_537 6 месяцев назад

    Gemini: This video is about a complete walkthrough of a Q-learning based AI system in Python.
    The video starts with an introduction to the business problem. The problem is about designing a warehouse robot that can travel around the warehouse to pick up items and bring them to a packaging area. The robot needs to learn the shortest path between all the locations in the warehouse.
    Then the video explains the concept of Q-learning, which is a reinforcement learning technique. Q-learning works by letting an agent learn from trial and error. The agent receives rewards for taking good actions and penalties for taking bad actions. Over time, the agent learns to take the actions that will lead to the greatest reward.
    Next, the video dives into the code. The code defines the environment, which includes the states, actions, and rewards. The states are all the possible locations of the robot in the warehouse. The actions are the four directions that the robot can move (up, down, left, and right). The rewards are positive for reaching the packaging area and negative for all other locations.
    The code also defines a Q-learning agent. The agent starts at a random location in the warehouse and then takes a series of actions. The agent learns from the rewards that it receives for its actions. Over time, the agent learns to take the shortest path to the packaging area.
    Once the agent is trained, the video shows how to use the agent to find the shortest path between any two locations in the warehouse. The video also shows how to reverse the path so that the robot can travel from the packaging area to any other location in the warehouse.
    Overall, this video is a great introduction to Q-learning and how it can be used to solve real-world problems.

  • @alinelermer6664
    @alinelermer6664 3 года назад

    Excellent explanation with such a pleasant voice! Thank you so much.

  • @KSATica
    @KSATica Год назад

    You sir THANK YOU.. you broke it down very easy to comprehend and learn. Thank you and thank you

  • @johnmathew3580
    @johnmathew3580 3 года назад

    So peaceful yet very informative. Love this style.

  • @deepaktripathi4094
    @deepaktripathi4094 Год назад

    i am really enjoying going through your videos.

  • @alialhilo5696
    @alialhilo5696 3 года назад

    very beneficial for my master's thesis. THANKS!!

  • @EttoreMastrogiacomo
    @EttoreMastrogiacomo 3 года назад

    Best Q - Learning lesson, congratulation and thank u

  • @alisherabdulkhaev1881
    @alisherabdulkhaev1881 9 месяцев назад

    Awesome! Thank you very much, so intuitive and easy to understand video!!

  • @danalex2991
    @danalex2991 2 года назад

    Amazing video. Better than my uni explanation!

  • @xlz9235
    @xlz9235 2 года назад

    Excellent lesson!Thank you so much

  • @bonadio60
    @bonadio60 2 года назад

    Fantastic lesson!! Thanks so much

  • @balalakshmiar9456
    @balalakshmiar9456 2 года назад

    Amazing explanation. Thanks a lot.

  • @abhaychandra2624
    @abhaychandra2624 5 месяцев назад

    WHAT AN AWESOME VIDEO

  • @j_owatson
    @j_owatson 4 года назад

    Thanks man this video helped out by quite a lot. keep up the good work

  • @ramakantpathak7148
    @ramakantpathak7148 3 года назад

    Thanks for easy presentation

  • @kushankbansal
    @kushankbansal 2 года назад

    Excellent video sir

  • @alexjoseph8433
    @alexjoseph8433 3 года назад

    very well explained.. Thankyou

  • @znypr
    @znypr 3 года назад +1

    Really good video and helped a lot!
    However the piano is a bit too loud :D maybe -20%? :D

  • @escapefelicity2913
    @escapefelicity2913 3 года назад +2

    Get rid of the background noise

  • @reihanehmirjalili7467
    @reihanehmirjalili7467 2 года назад

    AMAZING!!!! Thanks a lot

  • @moienr4104
    @moienr4104 4 года назад

    Your are amazing sir, keep up the good work, thank you

  • @richardngweisiong2895
    @richardngweisiong2895 4 года назад

    Appreciate your knowledge sharing.

  • @vegeta3993
    @vegeta3993 4 года назад

    Great video, very well made.

  • @maltrho
    @maltrho Год назад

    the background music for this is just perfect! (Anybody knows if it is something easily accesable?)

  • @premithmendis8004
    @premithmendis8004 3 года назад

    it is very good that this can automatically show us the shortest path but what if we need to know the q table or the updated table which program uses to find the best action from up down right or left. i meant any action is taken from the updated table. if we can obtain that it is a huge success as well.

    • @premithmendis8004
      @premithmendis8004 3 года назад

      when it comes to a certain point it has to decide where to go so updated table shows the values of each direction, highest value will be selected

  • @brookealviar6415
    @brookealviar6415 4 года назад

    So helpful! Thank you!

  • @paperstars9078
    @paperstars9078 3 года назад +1

    this is awesome. I wonder how long it to took to do all this.

  • @sivakumar-uj4fu
    @sivakumar-uj4fu 3 года назад

    My humble thanks Dr.Daniel for such a clear description of Q-learning in python.
    I am not able to access the notebook for the code.
    Kindly could you help me regarding this. I want to practice programming for creating an environment and execute q-leaning on it.

  • @sezaiunver1454
    @sezaiunver1454 2 года назад

    can u do deep q learning complete example please? i really need that one :)

  • @aminabensalem35
    @aminabensalem35 3 года назад

    Great content

  • @kimjong-un4521
    @kimjong-un4521 Год назад

    What a legend

  • @yanfeixiang2548
    @yanfeixiang2548 4 года назад

    Thank you Dr. Daniel ! This is a excellent Q-leaning instructional video includ comprehensive theory part and practical implementation. I want to inquiry is it still possible to find the link to this notebook now ? I don't find where is the video descroption part.

  • @JanMan37
    @JanMan37 3 года назад

    Thank you for the video. It was really clear and helpful. I have one question.
    In obtaining the shortest path after training, could you explain why epsilon is 0.9? Shouldn't the epsilon = 1.0 to maximize the rewards?

    • @GeromesCat
      @GeromesCat 3 года назад

      He explains it briefly in the video at 13:00. The way I understand it is that you do sometimes want the AI agent to take random actions, instead of always taking the best action, so that it has a chance to explore the environment. Otherwise the AI, as soon as it finds a route to the 'item packaging area', might think it has already found the best route, and stop looking for a better one.

  • @maniys
    @maniys 3 года назад

    this is great!!!!!!!!

  • @tuongnguyen9391
    @tuongnguyen9391 Год назад

    This is quite strange to me, it seems that the agent does not use any of the q-value to create the shortest path

  • @Anonim70498
    @Anonim70498 4 года назад

    Thank you

  • @THE_ONLY_GOD
    @THE_ONLY_GOD 2 года назад

    Thanks for video! However, I don't think that making every navigable tile have a -1 is optimal. If the robot has a bitmap of the environment, and can reference the location it is at, then can base a navigable tiles reward system that increments up to MUCH better GUIDE the robot toward the goal. Rather than a binary goal.

  • @asifferdous2418
    @asifferdous2418 4 года назад

    what type of q learning is used in this?

  • @johnneilcotacte2660
    @johnneilcotacte2660 3 года назад

    error in the get_next_location function. actions not define

  • @ImtithalSaeed
    @ImtithalSaeed 2 года назад

    why the agent gets -1 in its walking area.

  • @MegaDixen
    @MegaDixen 2 года назад

    Really nice lesson , my unversity prof is so bad compared to this.

  • @nikolakrivokapic4962
    @nikolakrivokapic4962 3 года назад +1

    4:50 poor robots, they are never going to be good enough :(

  • @namegoeshere3398
    @namegoeshere3398 4 года назад

    It is not working for me. (code on the website has errers)

    • @j_owatson
      @j_owatson 4 года назад

      Have you installed all the required libraries and are running the correct version of python? And if your running the code on the website run the code in order.
      Edit: try running it natively on your computer

  • @pammasinghkainth
    @pammasinghkainth 7 месяцев назад

    background music in very annoying! But lesson was good