Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

Q-Learning Tutorial in Python - Reinforcement Learning

Custom Environments - Reinforcement Learning with Stable Baselines 3 (P.3)

¥$, North West, Chicago West, Yuno Miles - BOMB

Polo G - No More Heroes (Official Video)

Build a Custom Gymnasium Reinforcement Learning Environment & Train w Q-Learning & Stable Baselines3

Johnny Code

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 30 ноя 2024

Комментарии • 23

@johnnycode 13 дней назад
Ready to get started with Stable Baselines3? ruclips.net/video/OqvXHi_QtT0/видео.html
@graycomet 13 дней назад
Thanks for the great video. It makes me easier to build my own training environments.
@a_samad 7 месяцев назад ⁺³
Thank You!🎉
We need more videos (course) like that: create custom env using open ai gym.
@hrsharma02 7 месяцев назад ⁺²
Thanks, dear 🎉, Please upload more videos on multi-agent RL for robotics and path planning for multi-robot with custom environment in Gymnasium.
@ashutoshmishra5901 7 месяцев назад ⁺⁴
Your Explanation is too good. I have a humble request. Can you make RL training using MuJoCo Ant but registering it as custom environment. The GAIT parameter generation is quite treaky. If possible please make a tutorial on it.
@towerboi-zg3it 7 месяцев назад
I also want this tutorial, so please Johnny
@Ayhamtechtips 7 месяцев назад ⁺²
Very helpful and nice explanation.
Thanks!
@rickyolal 5 месяцев назад ⁺¹
Hey Johnny, I was wondering if you knew how to make the algorithm learn some already known states? I have a challenge related to make a DQN learn and start with already known states stored in a csv file, and I am struggling because I have no idea how to do that. Is it possible?
@johnnycode 5 месяцев назад
I'm guessing if you know those states, then you would know what action to take or not take in relation to those states. For example, a pawn on a chess board can't go backwards, since you know that state is impossible. If my interpretation of your question is correct, then you might want to look into "action masking", which prevents the agent from taking illegal actions. You can start with this SB3 reference, but the concept is not limited to PPO: sb3-contrib.readthedocs.io/en/master/modules/ppo_mask.html
@buzzbuzz1691 8 месяцев назад ⁺¹
Thank you
@arnavmodanwal6295 5 месяцев назад
Hi, your videos are great and helped me a lot since you were using the latest version of stable baseline3...But I am facing an issue that the verbose values are not getting printed in output I have put verbose = 1 and even tried to use verbose = 2 but not getting the desired outputs (like rewards, loss, iterations, ep_len_mean etc.) as it was getting printed in your videos. Can you please help me? Is this due to the custom environment I am using or something else?
Also, tensorboard logs are also not working...
@johnnycode 5 месяцев назад
You should try creating a new conda environment and then install SB3 again. In my SB3 introduction video, I just ran pip install stable-baselines3[extra] and didn't do anything else special: ruclips.net/video/OqvXHi_QtT0/видео.html
@arnavmodanwal6295 5 месяцев назад
@@johnnycode Hi, I will try this one again...Thanks a lot for the reply and your time! Might need your help again...
@arnavmodanwal6295 5 месяцев назад
Hi, @johnnycode, I tried reinstalling the stable-baselines3[extras] but I am not getting the monitor data also the tensorboard logs are also not getting displayed...Is there some issue with the new version of stable-baselines3[extra] can you please give me the version you installed when making the video?
@johnnycode 5 месяцев назад
stable-baselines3 2.0.0
tensorboard 2.13.0
@BoxingBytes-y9i 6 месяцев назад
Thanks for your video. Which ressources would you adivse to learn practical applications of reinforcement learning? I've been trying to implement a bot for a specific game and have to create my own environment and DQN. I'm familiar with neural nets, but all the rest is so hard to find good information on
@johnnycode 6 месяцев назад
Sorry, I'm not an expert. I suggest inquiring at the r/reinforcementlearning subreddit. There are some very knowledgeable people there.
@BoxingBytes-y9i 6 месяцев назад
@@johnnycode Thank you for the answer, will do!
@tieqi5623 6 месяцев назад
Thanks, good video. Dose Gymnasium can support NeoGeo (SNK) roms? How to make it to support?
@johnnycode 6 месяцев назад
It doesn’t support Neo Geo roms. I think it would be extremely hard to bridge that support.
@sergiogirona2988 7 месяцев назад
Could i use it for own game made it with Godot Engine?? Thanks!!
@johnnycode 7 месяцев назад
Yes, of course!
@navaneethbuilds 4 месяца назад ⁺¹
thanks for the amazing video

Следующие

Автовоспроизведение

Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

Q-Learning Tutorial in Python - Reinforcement Learning

Q-Learning Tutorial in Python - Reinforcement Learning

Custom Environments - Reinforcement Learning with Stable Baselines 3 (P.3)

Custom Environments - Reinforcement Learning with Stable Baselines 3 (P.3)

¥$, North West, Chicago West, Yuno Miles - BOMB

¥$, North West, Chicago West, Yuno Miles - BOMB

Polo G - No More Heroes (Official Video)

Polo G - No More Heroes (Official Video)

I Tested $1 VS $10,000 Experiments!

I Tested $1 VS $10,000 Experiments!

Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions

Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions

Q-Learning Tutorial 1: Train Gymnasium FrozenLake-v1 with Python Reinforcement Learning

Q-Learning Tutorial 1: Train Gymnasium FrozenLake-v1 with Python Reinforcement Learning

Building a Custom Environment for Deep Reinforcement Learning with OpenAI Gym and Python

Building a Custom Environment for Deep Reinforcement Learning with OpenAI Gym and Python

Watch Me Build Samantha: Full Demo & Code Breakdown | OpenAI Realtime API Conversational Agent AI

Watch Me Build Samantha: Full Demo & Code Breakdown | OpenAI Realtime API Conversational Agent AI

Tweaking Custom Environment Rewards - Reinforcement Learning with Stable Baselines 3 (P.4)

Tweaking Custom Environment Rewards - Reinforcement Learning with Stable Baselines 3 (P.4)

Reinforcement Learning in Continuous Action Spaces | DDPG Tutorial (Pytorch)

Reinforcement Learning in Continuous Action Spaces | DDPG Tutorial (Pytorch)

Goldman Sachs Has an Open Source Python Package Called GS-Quant

Goldman Sachs Has an Open Source Python Package Called GS-Quant

Stock Trading AI 101: How to Build Your Own Reinforcement Learning Model

Stock Trading AI 101: How to Build Your Own Reinforcement Learning Model

How AI 'Understands' Images (CLIP) - Computerphile

How AI 'Understands' Images (CLIP) - Computerphile

Confronting Ronaldo

Confronting Ronaldo

NEW Scan Run Challenge - Help Vineria to Find Simon Phase 2 Incredibox Sprunki

NEW Scan Run Challenge - Help Vineria to Find Simon Phase 2 Incredibox Sprunki

Растения против зомби но это чикен ган. ПВЗ vs Chicken Gun

Растения против зомби но это чикен ган. ПВЗ vs Chicken Gun

Кто лучше поет - Nikocado Avocado или Gegagedigedagedago Nuggets ? Выбери лучшего !

Кто лучше поет - Nikocado Avocado или Gegagedigedagedago Nuggets ? Выбери лучшего !

Что будет если съесть грибы в Майнкрафте #shorts #майнкрафт #minecraft

Что будет если съесть грибы в Майнкрафте #shorts #майнкрафт #minecraft

БУ ИСПУГАЛСЯ НЕ БОЙСЯ следит за мной❗

БУ ИСПУГАЛСЯ НЕ БОЙСЯ следит за мной❗

МЕНЯ ЗАБЛОКИРОВАЛ РКН

МЕНЯ ЗАБЛОКИРОВАЛ РКН

Путин: У «Орешника» нет ядерного заряда, но мощь как у ядерного оружия

Путин: У «Орешника» нет ядерного заряда, но мощь как у ядерного оружия