The BEST Q-Learning example! | The Mountain Car Problem

The Most Important Algorithm in Machine Learning

Reinforcement Learning from scratch

First To The Top Wins A Lamborghini

Chiefs quarterback Patrick Mahomes talks after Super Bowl loss to Eagles

Training W/ a Real Life Giant (Worlds Tallest Bodybuilder)

Q Learning simply explained | SARSA and Q-Learning Explanation

Marcus Koseck

Просмотров 22 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 фев 2025
This problem is from a book called Reinforcement Learning: In Introduction by Richard S. Sutton and Andrew G. Barto. I found this problem to be a good way to introduce SARSA and Q-Learning. I am not an expert in reinforcement learning, but I find these kind of ideas interesting. I thought it would be cool to explore reinforcement learning and make a video explaining a concept to the best of my ability. I will be making more videos about reinforcement learning in the future and hopefully my explanations get better as time goes on.
Credits:
I used Manim for the animations.
All of the information on reinforcement learning came from the RL book by Sutton and Barto. I didn't explain the concepts well enough in the video to do the book justice. The book is very well written.
The environment is from AIGym.
GitHub:
github.com/mar...

Комментарии • 23

@SagarHingalAI 15 дней назад ⁺¹
Thanks for the intro video! I’m kinda trying to build an agent from scratch (without using any existing libraries), so first learning the fundamentals
@DC-rk6xf Год назад ⁺³
Thanks for this introductory video. It helped me a lot.
@AlicjaKrzemińska-Ściga 20 дней назад
Really nice approach to intuitively compare SARSA and Q-Learining, thanks!
@michaelomglol 5 месяцев назад ⁺¹
Thanks for this video, helping me a lot with my uni work
@viralshorts9596 Год назад
this really boosted my understanding
@JamonAllan Месяц назад
I have written a code that will absolutely trick and safe guard the AI to never go bad
@MinhNguyen-deadwish 2 месяца назад
Wow, great explained
@ghulamhussainkhansherwani6032 3 месяца назад
great work brother
@Royal--00 Год назад
Very interesting!
@rogerperez6576 Год назад
Nice explanation
@muslumyildiz5694 11 месяцев назад
Thanks
@बिहारीभायजी 6 месяцев назад
very well explained
@malteiwa 3 месяца назад
thank you
@almonteros 10 месяцев назад
Nice.
@aleksantoniak5448 Год назад
Hello, where could i find code for that?
@marcuskoseck98 Год назад
Hello. My github has the code under the "SARSA-and-Q_Learning" tab. Link to the github page is in the description.
@frommarkham424 2 месяца назад
Whatt so q learning tries to predict the future rewards
@jonaskarlsson5901 7 месяцев назад
does this mean it's not even using a neural network?
@manuelabarcacrespo8298 5 месяцев назад
Q-Learning dont use neuraln neutworks, its a table that the agents learns to complete and then uses to solve a problem
@jonaskarlsson5901 5 месяцев назад
@@manuelabarcacrespo8298 is Q learning also used to generate the training data for an NN?
@iceshadow487 Месяц назад
This is a different kind of ML process called Markov Decision Process.
@420_gunna 28 дней назад
This specific one doesn't use a neural network. We use NNs as learned models to predict (e.g.) the Q values of (s,a) pairs in situations where the state space is so large that we can't get good estimates of Q(s, a) using the manner described in this video (because it would just take too long), or at other times using them as our policies themselves. Look up stuff like Deep Q-Learning.

Следующие

Автовоспроизведение

The BEST Q-Learning example! | The Mountain Car Problem

The BEST Q-Learning example! | The Mountain Car Problem

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

Reinforcement Learning from scratch

Reinforcement Learning from scratch

First To The Top Wins A Lamborghini

First To The Top Wins A Lamborghini

Chiefs quarterback Patrick Mahomes talks after Super Bowl loss to Eagles

Chiefs quarterback Patrick Mahomes talks after Super Bowl loss to Eagles

Training W/ a Real Life Giant (Worlds Tallest Bodybuilder)

Training W/ a Real Life Giant (Worlds Tallest Bodybuilder)

Jeep® | Big Game | Harrison Ford x Jeep | Owner’s Manual

Jeep® | Big Game | Harrison Ford x Jeep | Owner’s Manual

AI Learns to Walk (deep reinforcement learning)

AI Learns to Walk (deep reinforcement learning)

AI exploits a gamebreaking bug in Trackmania

AI exploits a gamebreaking bug in Trackmania

Temporal Difference Learning (including Q-Learning) | Reinforcement Learning Part 4

Temporal Difference Learning (including Q-Learning) | Reinforcement Learning Part 4

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Ethical Hacker: "I'll Show You Why Google Has Just Shut Down Their Quantum Chip"

Ethical Hacker: "I'll Show You Why Google Has Just Shut Down Their Quantum Chip"

Q-Learning Explained - A Reinforcement Learning Technique

Q-Learning Explained - A Reinforcement Learning Technique

AI can't cross this line and we don't know why.

AI can't cross this line and we don't know why.

Каждый русский уже знает итальянский язык 😳 @nastyawhere

Каждый русский уже знает итальянский язык 😳 @nastyawhere

НУЖЕН ТОП1 В БИТВЕ БЛОГЕРОВ. День 3

НУЖЕН ТОП1 В БИТВЕ БЛОГЕРОВ. День 3

ЧТО БЫ твоя мама СЪЕЛА за 10.000$

ЧТО БЫ твоя мама СЪЕЛА за 10.000$

ПОВТОРИ ВАК МОМЕНТ - ПОЛУЧИ ГОЛДУ ft. Apollon🗿(STANDOFF 2)

ПОВТОРИ ВАК МОМЕНТ - ПОЛУЧИ ГОЛДУ ft. Apollon🗿(STANDOFF 2)

Фильм В тылу врага : боевик, триллер, драма (2022)

Фильм В тылу врага : боевик, триллер, драма (2022)

Неправильный M&M's мёд 😋

Неправильный M&M's мёд 😋

Редакция. News: 155-я неделя

Редакция. News: 155-я неделя

МОЯ МАШИНА ЛУЧШЕ (Гоча vs Царь)

МОЯ МАШИНА ЛУЧШЕ (Гоча vs Царь)