Introduction to Multi-Agent Reinforcement Learning

Why Concrete Needs Reinforcement

After watching this, your brain will not be the same | Lara Boyd | TEDxVancouver

Our NEW HOME on the Ocean (BOAT TOUR)

"The Reality of the Situation" | Inanimate Insanity S2E16

I Bought A $400,000 Lamborghini Aventador For The Price Of A Toyota Camry

Overcoming the Practical Challenges when using Reinforcement Learning

MATLAB

Просмотров 22 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 8 сен 2024

Комментарии • 22

@BrianBDouglas 5 лет назад ⁺¹⁹
Hey everyone, thanks for watching this video! If you have any questions or comments that you'd like me to see, please leave them under this comment so that I get notified and can respond. Cheers!
@hanlinniu 5 лет назад ⁺¹
Please update faster, quite enjoy watching your video, haha
@BrianBDouglas 5 лет назад ⁺²
@@hanlinniu I appreciate that! I wish I was faster but I'm afraid this is top speed for me.
@hanlinniu 5 лет назад ⁺¹
@@BrianBDouglas ok, no worries, thank you very much anyway!
@maximilianvontannenbusch6906 5 лет назад ⁺⁴
Hi Brian, Thank you for the videos. That was a huge help to kick-start my RL project. Do you know any RL open-source projects which is using the last mentioned RL approaches in which Gains of a classical controller are learned? Because that would be pretty interesting to see.
@RamilShaymardanov 4 года назад
@@BrianBDouglas Thank you for your videos! I have a question: what would be a good heuristic for learning, say, PID gains? would it be error integral, weighted {response time, oscillation, steady state error} costs, or something else entirely?
@alderaminh 5 лет назад ⁺⁹
Combining a traditional control with RL is very fresh to me.
and always thanks to you for uploading a good video.
@pv4343 5 лет назад ⁺⁶
Amazing video, if its possible, in another video you could show us one example of a process control in Matlab, thanks!!
@JamesCapparell-b5i 10 дней назад
I think a follow up video updating any changes that affect this learning sequence. It is five years old, 35 dog years and probably 10 RL years.
@ASDFAHED1985 4 года назад ⁺¹
Thanks a lot Brian, this is very good and useful series. Please, can you help me with some more details about using RL as optmization tool for a calssical control, particularly PI control. I have studied some of the mathwork example, but non of them use the RL as controller-parameters optmization tool. Many thanks.
@nanazeethiopia2892 5 лет назад ⁺¹
nice ...very nice Thanks a lot
@artukikemty 3 года назад
Interesting how control theory and AI are overlapping. IN my view they both have differente potentials but complement each other, so I would say both are neccesary. But at the end, is RL leading to AGI? Or it's just another engineering tool? Great series of videos!
@raihaan1819 4 года назад ⁺²
That final hybrid solution is awesome! Is it worth implementing on a quad and PID controllers? It would be interesting to have an autotuner which is pretty black-box
@soutrikband 3 года назад ⁺²
You may refer to this paper of mine which deals with autotuning of PID controllers using Reinforcement Learning - ieeexplore.ieee.org/abstract/document/8973068
@raihaan1819 3 года назад ⁺¹
@@soutrikband Great, thank you! Will have a read!
@arashhashemi3134 4 года назад
Thank you for your interesting explanation of RL. In the case of RL tuners, specifically for LQR, what do you think would be the best representation of the states? In contrast to RL controller, I don't think dynamics states would be a good choice.
@abdulbasithashraf5480 3 года назад
How to increase robustness? You said dynamically change some parameters. How do you set that up?
@ahmadalghooneh2105 5 лет назад
Love you brian, you arre the besttt
@Twinz2017 4 месяца назад
sa

Следующие

Автовоспроизведение

Introduction to Multi-Agent Reinforcement Learning

Introduction to Multi-Agent Reinforcement Learning

Why Concrete Needs Reinforcement

Why Concrete Needs Reinforcement

After watching this, your brain will not be the same | Lara Boyd | TEDxVancouver

After watching this, your brain will not be the same | Lara Boyd | TEDxVancouver

Our NEW HOME on the Ocean (BOAT TOUR)

Our NEW HOME on the Ocean (BOAT TOUR)

"The Reality of the Situation" | Inanimate Insanity S2E16

"The Reality of the Situation" | Inanimate Insanity S2E16

I Bought A $400,000 Lamborghini Aventador For The Price Of A Toyota Camry

I Bought A $400,000 Lamborghini Aventador For The Price Of A Toyota Camry

Halsey - Ego (Official Video)

Halsey - Ego (Official Video)

Are Electric Cars Worse For The Environment? Myth Busted

Are Electric Cars Worse For The Environment? Myth Busted

AI Learns to Use Stairs (deep reinforcement learning)

AI Learns to Use Stairs (deep reinforcement learning)

Everything You Need to Know About Control Theory

Everything You Need to Know About Control Theory

Japan's robot revolution - BBC Click

Japan's robot revolution - BBC Click

Why Choose Model-Based Reinforcement Learning?

Why Choose Model-Based Reinforcement Learning?

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

The Story of Shor's Algorithm, Straight From the Source | Peter Shor

The Story of Shor's Algorithm, Straight From the Source | Peter Shor

AI Learns To Swing Like Spiderman

AI Learns To Swing Like Spiderman

Бабулька Granny пытается поймать Nuggets Gegagedigedagedago , но не тут то было!

Бабулька Granny пытается поймать Nuggets Gegagedigedagedago , но не тут то было!

ПОТЕРЯЛСЯ ЧЕЛОВЕК ПАУК!?😲😲😲 @Studia_Animatorov_Koodesnik

ПОТЕРЯЛСЯ ЧЕЛОВЕК ПАУК!?😲😲😲 @Studia_Animatorov_Koodesnik

ЭТО САМЫЙ УДОБНЫЙ МОД НА PLANTS VS ZOMBIES!

ЭТО САМЫЙ УДОБНЫЙ МОД НА PLANTS VS ZOMBIES!

чем закончился прикол, смотри в тг «хей! это марьяна!» @Dasha_Da_

чем закончился прикол, смотри в тг «хей! это марьяна!» @Dasha_Da_

Пройди игру и скушаешь борщ (2024)

Пройди игру и скушаешь борщ (2024)

Alat yang Membersihkan Kaki dalam Hitungan Detik 🦶🫧

Alat yang Membersihkan Kaki dalam Hitungan Detik 🦶🫧

Mark Rober vs Dude Perfect- Ultimate Robot Battle

Mark Rober vs Dude Perfect- Ultimate Robot Battle

ПОСТОЯННИК ЛОМБАРДА #шоу #юмор #спб #фитнес #вау

ПОСТОЯННИК ЛОМБАРДА #шоу #юмор #спб #фитнес #вау