The BEST Deep Q-Learning example | Cart Pole Problem

Q Learning simply explained | SARSA and Q-Learning Explanation

Generative Model That Won 2024 Nobel Prize

FREE JUICE WRLD SKIN OUT EARLY in Fortnite! (How to Get it, REWARDS, NEW UPDATE)

Watching a Rocket Launch at SpaceX with Elon Musk!

The BEST Q-Learning example! | The Mountain Car Problem

Marcus Koseck

Просмотров 3,9 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 ноя 2024

Комментарии • 5

@marmosetman 2 месяца назад
What do you mean it doesnt have enough energy to go to the flag. No mayter how much back and forth motion it does, the energy requirement to move up to the flag is still the same.
@AsmageddonPrince 5 месяцев назад
I don't feel like I understand the principle from your video- what is the purpose of partitioning the state into tiles? How and when are they assigned a Q value and when is it modified? Are the Q values just zero during the first epoch? Does this work for larger state spaces? Does the agent really learn anything substantial from a replay of a 40k steps Epoch?
@marcuskoseck98 5 месяцев назад
I partition the state into tiles to make a function that relates states with q-values. Think of it this way: I need a relationship between states and future returns. There is no obvious function I can think of to do the job. Instead, I break state space into squares (partitions) and assign that square a random q-value. This is initialization. As the algorithm learns, the q-value will be more representative of the actual q-value. This method doesn't work for larger state spaces. At that point, you would want to use a neural network. For this specific reinforcement learning problem, 40k steps can be helpful in the beginning for exploration. If your algorithm is taking 40k steps after a few thousand epoch, that's the sign your parameterization may be incorrect. Hope this helped!
@iony_mikler 6 месяцев назад
This is very cool progress do u have a code repo for your learning?
@marcuskoseck98 5 месяцев назад ⁺¹
Honestly, I have a bunch of code stored on my computer for various projects. I need to organize the code and upload them. Eventually, I will upload code.

Следующие

Автовоспроизведение

The BEST Deep Q-Learning example | Cart Pole Problem

The BEST Deep Q-Learning example | Cart Pole Problem

Q Learning simply explained | SARSA and Q-Learning Explanation

Q Learning simply explained | SARSA and Q-Learning Explanation

Generative Model That Won 2024 Nobel Prize

Generative Model That Won 2024 Nobel Prize

FREE JUICE WRLD SKIN OUT EARLY in Fortnite! (How to Get it, REWARDS, NEW UPDATE)

FREE JUICE WRLD SKIN OUT EARLY in Fortnite! (How to Get it, REWARDS, NEW UPDATE)

Watching a Rocket Launch at SpaceX with Elon Musk!

Watching a Rocket Launch at SpaceX with Elon Musk!

Hunxho - Ball Forever [Official Video]

Hunxho - Ball Forever [Official Video]

AI Learns to Walk (deep reinforcement learning)

AI Learns to Walk (deep reinforcement learning)

I never understood why you can't go faster than light - until now!

I never understood why you can't go faster than light - until now!

how is this hacking tool legal?

how is this hacking tool legal?

A simple procedural animation technique

A simple procedural animation technique

How to train simple AIs

How to train simple AIs

Reinforcement Learning, by the Book

Reinforcement Learning, by the Book

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

ADHD Is a Curse… Until You Learn This

ADHD Is a Curse… Until You Learn This

HUGE Magnet VS Copper Sphere - Defying Gravity- Will a Neodymium Magnet Float Inside?

HUGE Magnet VS Copper Sphere - Defying Gravity- Will a Neodymium Magnet Float Inside?

Чистка воды совком от денег

Чистка воды совком от денег

МАМАША, Когда обидели Ребёнка (смешное видео, юмор, приколы, поржать)

МАМАША, Когда обидели Ребёнка (смешное видео, юмор, приколы, поржать)

Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny

Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny

Купил Новый Компьютерный Кросовок

Купил Новый Компьютерный Кросовок

Забыта за решёткой на 20 ЛЕТ! Англичанка, о который вы не знали...

Забыта за решёткой на 20 ЛЕТ! Англичанка, о который вы не знали...

ПОБЕГ ИЗ РЕХАБА? ТЕПЕРЬ У МЕНЯ НОВАЯ ЖИЗНЬ.

ПОБЕГ ИЗ РЕХАБА? ТЕПЕРЬ У МЕНЯ НОВАЯ ЖИЗНЬ.