Passive Reinforcement Learning

Markov Decision Processes Continued

Markov Decision Processes - Computerphile

michael's garage tour

I Built 100 Houses And Gave Them Away!

Drag Queens Trixie Mattel & Katya React to A Family Affair | I Like To Watch | Netflix

Markov Decision Processes

Bert Huang

Просмотров 75 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 июл 2024
Virginia Tech CS5804
Наука

Комментарии • 41

@hosamfikry2924 5 лет назад ⁺¹⁸
That is the best video I watched so far to understand this topic
@Pexers. 3 года назад ⁺³
Thank you, I spent hours in this algorithm, finally understood it !
@hobby_coding 3 года назад ⁺³
very good lecture maybe the best introduction to this topic i've ever seen on youtube
@syedrumman3920 2 года назад ⁺²
This is such a clear explanation!! Ty for this!! I wish I had taken your class while I was in VT!
@consolesblow 5 лет назад ⁺¹
Thanks a lot! I found this very helpful.
@jff711 3 года назад ⁺²
Thank you very much, very well explained.
@quantlfc 2 года назад
Absolutely amazing lecture!!!
@JustinMasayda 2 года назад
This was fantastic, thank you!
@jub8891 11 месяцев назад
thank you so much, you explain the subject very well and have helped me to understand..
@coeusmaze9413 4 года назад ⁺²
The video provides intuitive but deep understanding in MDP
@ismailasmcalskan2552 4 года назад
Really good video about this topic. Thank you
@xruan6582 4 года назад ⁺¹
can anyone explain (32:00) the switch between two modes (i.e. represented by green and red arrow). To me the green one seems like deterministic rule, the red one seems like stochastic rule. Can they exist simultaneously?
@sander1426-2 4 года назад
Thanks for the explanation!
@Ahmed.r.a 3 месяца назад
thank you for this brilliant explanation. I wished there was a Question with solution to practice on.
@richardm5916 4 года назад
Realy great explaintion on Machine learning
@seanxu6741 Год назад
Fantastic video! Thanks a lot!
@behmandtirgar 4 года назад
I have a question at time 8:30
: if we take an action to go to the left, why Pr(c | b, left) isn't 0.00? (we go to another side)
@srujayop 2 года назад ⁺¹
Is the reward R(s) actually R(s')?
And should that also be multiplied with the transition probability?
max(over a) sum P(s', r|,s, a) [r + gamma*V(s')]
? I am trying to relate the equation presented in the video to standard notation 4 par notation.
@ryanflynn386 5 лет назад ⁺³
This is a great explanation video, thanks so much. Your voice is easy to listen to too haha.
@berty38 5 лет назад ⁺¹⁴
Ryan Flynn Thanks! I’m glad it’s helpful. My smooth voice is a huge disadvantage when I teach morning classes and my students all fall asleep.
@tarik8622 3 года назад
Very interesting topic. And i think that you will make a fortune if you use your voice in publicity field. Best regards.
@peterkimemiah9669 3 года назад
Very good easy to understand.
@linfrancis5204 5 лет назад ⁺¹
Great video. Thank you. Could you please make a similar video while we consider a two-dimensional Markov chain with more states?
@jaideep_yes 5 лет назад
Thank you.
@joshuasegal4161 5 лет назад
What software are you using to make this?? It looks like you have like an infinite page which gives a really clean look
@berty38 5 лет назад ⁺³
Nothing too fancy. This was done with Apple Keynote, and I'm faking that scrolling effect with "Magic Move" animations. I'm always looking for better tools to build useful visuals for lectures.
@sanskarshrivastava5193 3 года назад
Best video for MDP on youtube
@JebbigerJohn 9 месяцев назад
This is so good!!!
@_brenda4975 3 года назад
much better than my lecturer
@treegnome2371 3 года назад
at 17:35, why isn't it gamma = (0,1), instead of (0,1]...if gamma = 1, the influence of the actions farther down the road stays the same as all other actions, rather than shrinking the influence...right?
@rezadarooei248 4 года назад
Thanks for your nice tutorial is it possible upload the slides?
@y-3084 4 года назад
excellent
@zenchiassassin283 4 года назад
What textbook ? thank you very much
@Throwingness 3 года назад ⁺¹
Around 34:00 when there are equations on the screen you should have had a pointer or something to point at what you are talking about. It's not clear.
@dminn 4 года назад
God bless
@EdupugantiAadityaaeb 10 месяцев назад
What is the name of textbook
@suvinaybothra8988 4 года назад
honesty
@abdullahmoiz8151 4 года назад
33:27
@ahmet9446 5 лет назад
The best I find is [4, 1]. I couldn't achieve [4.2, 1.2]. Does anyone achieve [4.2, 1.2]?
@linfrancis5204 5 лет назад
YES, I GOT IT
@izazkhan1640 5 лет назад
jhk

Следующие

Автовоспроизведение

Passive Reinforcement Learning

Passive Reinforcement Learning

Markov Decision Processes Continued

Markov Decision Processes Continued

Markov Decision Processes - Computerphile

Markov Decision Processes - Computerphile

michael's garage tour

michael's garage tour

I Built 100 Houses And Gave Them Away!

I Built 100 Houses And Gave Them Away!

Drag Queens Trixie Mattel & Katya React to A Family Affair | I Like To Watch | Netflix

Drag Queens Trixie Mattel & Katya React to A Family Affair | I Like To Watch | Netflix

Danny Trejo tells his side of the story after Los Angeles 4th of July parade fight

Danny Trejo tells his side of the story after Los Angeles 4th of July parade fight

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Hidden Markov Models

Hidden Markov Models

introduction to Markov Decision Processes (MFD)

introduction to Markov Decision Processes (MFD)

Markov Chains Clearly Explained! Part - 1

Markov Chains Clearly Explained! Part - 1

Markov Decision Processes 1 - Value Iteration | Stanford CS221: AI (Autumn 2019)

Markov Decision Processes 1 - Value Iteration | Stanford CS221: AI (Autumn 2019)

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

Lecture 8: Markov Decision Processes (MDPs)

Lecture 8: Markov Decision Processes (MDPs)

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

АСМР Компьютерный Магазин + Обзор Клавиатуры Royal Kludge RK N80 - 100% Мурашек !

АСМР Компьютерный Магазин + Обзор Клавиатуры Royal Kludge RK N80 - 100% Мурашек !

#phonescreenprotector #tempered #smartphone #temperedglass #cellphone #goodthing #mobilephone #tech

#phonescreenprotector #tempered #smartphone #temperedglass #cellphone #goodthing #mobilephone #tech

3D-КЭШ - НЕДОРОГО? - Тест R7 5700X3D vs R7 5800X3D vs R5 7500F vs i5-12600KF

3D-КЭШ — НЕДОРОГО? — Тест R7 5700X3D vs R7 5800X3D vs R5 7500F vs i5-12600KF

Сравнили apple и xiaomi!

Сравнили apple и xiaomi!

ASMR | РАСПАКОВКА IPAD M4 😱 | КАК ДУМАЕТЕ КАКОЙ АЙПАД ЛУЧШЕ ? #gamestation #gamingislife #shorts

ASMR | РАСПАКОВКА IPAD M4 😱 | КАК ДУМАЕТЕ КАКОЙ АЙПАД ЛУЧШЕ ? #gamestation #gamingislife #shorts

Smart appliances - new gadgets, versatile utensils, tool items #gadgets #shorts

Smart appliances - new gadgets, versatile utensils, tool items #gadgets #shorts

ИКОНКИ И ОБОИ!! Apple выпустила iOS 18 Beta 3 на Айфон! Что нового? Можно ли ставить?!

ИКОНКИ И ОБОИ!! Apple выпустила iOS 18 Beta 3 на Айфон! Что нового? Можно ли ставить?!

Самый дорогой кабель Apple

Самый дорогой кабель Apple