Reinforcement Learning: AlphaGo

An introduction to Reinforcement Learning

Evolving Genetic Neural Network Optimizes Poly Bridge Problems

REMBLE - NOT LIKE US FREESTYLE (OFFICIAL MUSIC VIDEO)

Chris Hemsworth Gets Nervous While Eating Spicy Wings | Hot Ones

Genshin Impact Version 4.7 Special Program #NewVersion #SpecialProgram #GenshinImpact

Reinforcement Learning from scratch

Graphics in 5 Minutes

Просмотров 33 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 24 май 2024
How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.
0:00 - intro
0:13 - pong
0:28 - the policy
0:51 - policy as neural network
1:32 - supervised learning
2:51 - reinforcement learning using policy gradient
4:24 - minimizing error using gradient descent
4:45 - probabilistic policy
5:01 - pong from pixels
6:58 - visualizing learned weights
8:18 - pointer to Karpathy "pong from pixels" blogpost

Комментарии • 38

@darthvader4899 Месяц назад ⁺⁶
this is video is super underrated. In fact the whole channel is underrated.
@themax2go 2 месяца назад ⁺¹
agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))
@themathguy3149 7 месяцев назад ⁺³
Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!
@metaljacket8102 Месяц назад ⁺²
This is really awsome! It's the best video that explains DRL in such an easy to understand way!
@ashketchum1244 9 месяцев назад ⁺⁴
I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.
@tushargupta1999 2 месяца назад ⁺²
This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.
@a.aspden 8 месяцев назад ⁺²
Your videos are great. Looking forward to more!
@marcinstrzesak346 7 месяцев назад ⁺¹
Great video, very helpful, easy to understand.
@gmjammin4367 9 месяцев назад ⁺¹
Amazing video as always :)!
@moldo800 4 месяца назад ⁺¹
Excellent. Congratulations ❤
@cloudysh Месяц назад ⁺¹
This was so surprisingly great :3
@mado.madeleine 9 месяцев назад ⁺¹
Super helpful! Thank you 🙏🏽
@CptDoge-rn3ou 7 месяцев назад ⁺¹
I really like the way you visualize what you are talking about. Thank you for putting in the effort!
@jameslibby5215 8 месяцев назад ⁺⁵
Very very underrated channel
@benc7910 4 месяца назад
Underrated, two Rs
@jameslibby5215 4 месяца назад
@@benc7910 thank ya sir
@luiseduardocraizer7416 4 дня назад
Excellent content!
@mohajeramir Месяц назад ⁺¹
Excellent
@nikbivation 9 месяцев назад ⁺¹
thank you for this!
@ireoluwaTH 9 месяцев назад ⁺¹
Thank you!!!
@BlueBirdgg 8 месяцев назад ⁺¹
Can you playlist each one of your topics plz?
I wanted to post on Twitter(X) your video topics but could only post a single video at a time.
Great content by the way. Ty very much.
Your perspective on some topics helped me a lot to get a more intuitive understanding.
@g5min 8 месяцев назад
Good idea! Here's one on generative AI:
ruclips.net/p/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo
Here's one on reinforcement learning
ruclips.net/p/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL
Here's one on LLMs + text-to-image
ruclips.net/p/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu
@BlueBirdgg 8 месяцев назад
@@g5min Ty!
@kniv0gaffel 7 месяцев назад ⁺¹
Brilliant
@solveigberling1662 2 месяца назад ⁺¹
That was dope
@edvinbeqari7551 4 месяца назад
What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.
@maxim_ml 13 дней назад
that was good
@bombur9007 Месяц назад
how many layers should such network have
@mineq4967 Месяц назад
but by what number do you change the weights like you never told us
@axe863 6 месяцев назад ⁺²
Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅
@nischalyou 8 месяцев назад
whats the name of this video game ?
@FRANKONATOR123 9 месяцев назад
Can you share the source code for this project
@g5min 8 месяцев назад
You can follow the link to the Karpathy site at the end of the video, repeated here:
karpathy.github.io/2016/05/31/rl/
@herikaniugu 7 месяцев назад
Imagine using reinforcement learning in quantitative finance 😊
@macratak 9 месяцев назад
ah yes, reinforcement learning. a fundamental computer graphics technology
@g5min 9 месяцев назад ⁺⁵
I think that character/game-AI is pretty central to graphics
@pw7225 9 месяцев назад ⁺¹
Why so negative?
@revimfadli4666 9 месяцев назад
@@g5minespecially AI image generation or processing nowadays

Следующие

Автовоспроизведение

Reinforcement Learning: AlphaGo

Reinforcement Learning: AlphaGo

An introduction to Reinforcement Learning

An introduction to Reinforcement Learning

Evolving Genetic Neural Network Optimizes Poly Bridge Problems

Evolving Genetic Neural Network Optimizes Poly Bridge Problems

REMBLE - NOT LIKE US FREESTYLE (OFFICIAL MUSIC VIDEO)

REMBLE - NOT LIKE US FREESTYLE (OFFICIAL MUSIC VIDEO)

Chris Hemsworth Gets Nervous While Eating Spicy Wings | Hot Ones

Chris Hemsworth Gets Nervous While Eating Spicy Wings | Hot Ones

Genshin Impact Version 4.7 Special Program #NewVersion #SpecialProgram #GenshinImpact

Genshin Impact Version 4.7 Special Program #NewVersion #SpecialProgram #GenshinImpact

These Next Storms Look Even Bigger…

These Next Storms Look Even Bigger…

Actor-Critic Reinforcement for continuous actions!

Actor-Critic Reinforcement for continuous actions!

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Every Kind of Bridge Explained in 15 Minutes

Every Kind of Bridge Explained in 15 Minutes

AI Learns to Walk (deep reinforcement learning)

AI Learns to Walk (deep reinforcement learning)

Watching Neural Networks Learn

Watching Neural Networks Learn

Reinforcement Learning: ChatGPT and RLHF

Reinforcement Learning: ChatGPT and RLHF

Deep Reinforcement Learning: Neural Networks for Learning Control Laws

Deep Reinforcement Learning: Neural Networks for Learning Control Laws

An introduction to Policy Gradient methods - Deep Reinforcement Learning

An introduction to Policy Gradient methods - Deep Reinforcement Learning

😰 Плей Маркет, но Рагата СОШЛА С УМА! Help Monster: Tricky Puzzle - КРИНЖ ПЛЕЙ МАРКЕТА

😰 Плей Маркет, но Рагата СОШЛА С УМА! Help Monster: Tricky Puzzle - КРИНЖ ПЛЕЙ МАРКЕТА

ЭКСТРЕМАЛЬНЫЕ ПРЯТКИ В ЗАКРЫТОМ АКВАПАРКЕ! ГАБАР, СТОЛЯРОВ,СУДАРЬ,СИМКА,ФРОСЯ,МИЛАНА ХАМЕТОВА...

ЭКСТРЕМАЛЬНЫЕ ПРЯТКИ В ЗАКРЫТОМ АКВАПАРКЕ! ГАБАР, СТОЛЯРОВ,СУДАРЬ,СИМКА,ФРОСЯ,МИЛАНА ХАМЕТОВА...

Спустя 10 лет кикбоксер ОТОМСТИЛ за обидное ПОРАЖЕНИЕ #shorts

Спустя 10 лет кикбоксер ОТОМСТИЛ за обидное ПОРАЖЕНИЕ #shorts

У меня новый питомец: огромный черный Скорпион!

У меня новый питомец: огромный черный Скорпион!

Пусть все услышат этот ХИТ! #выпускной #последнийзвонок #школа #музыка #простовнашемклассе

Пусть все услышат этот ХИТ! #выпускной #последнийзвонок #школа #музыка #простовнашемклассе

В МАЙНКРАФТ ВЕРНУЛИ ПЕРВУЮ КАРТУ

В МАЙНКРАФТ ВЕРНУЛИ ПЕРВУЮ КАРТУ

VLOG: ПОДАРИЛА МАШИНУ РОДИТЕЛЯМ

VLOG: ПОДАРИЛА МАШИНУ РОДИТЕЛЯМ

Третий сезон пятой главы «Королевской битвы» Fortnite «Вдребезги» | Видеоролик к выходу главы

Третий сезон пятой главы «Королевской битвы» Fortnite «Вдребезги» | Видеоролик к выходу главы