Game Playing 2 - TD Learning, Game Theory | Stanford CS221: Artificial Intelligence (Autumn 2019)

AlphaGo - The Movie | Full award-winning documentary

Overview Artificial Intelligence Course | Stanford CS221: Learn AI (Autumn 2019)

Our Fire Evacuation! Forced To Leave Our Home...

Pachuca (MEX) vs Al Ahly (EGY) Penalty Shootout | Intercontinental Cup | 12/14/2024 | beIN SPORTS

How Employees Are Coffee Badging To Avoid Full Days At The Office

Game Playing 1 - Minimax, Alpha-beta Pruning | Stanford CS221: AI (Autumn 2019)

Stanford Online

Просмотров 39 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 янв 2025

Комментарии • 9

@AkshitSharma0 Год назад
47:09 Why is V(pimax,pi7)=2 and not 5, assuming agent will try to maximize his value while the opponent will act stochastically (ie. 0,2,5 as distributions)
@paladin1410 Год назад
Hi, I believe the agent try to maximize his value with the assumption that the opponent is a minimizer. It is like you do not know what your opponent next move but you will imagine your opponent is a minimizer and calculate the value for your opponent under that assumption. In that scenario, if my policy is pi_max, I always choose the second branch.
@tzy4647 4 месяца назад
agent assuming the opponent will give him the min, so he choose the box with highest value, which is 1 in this case. But in fact the opponent is playing Stochastically, so the agent will get 2 instead of 1. Nothing to do with 5.
@black-sci 10 месяцев назад ⁺¹
Nice Lecture.
@leventaksakal5 Год назад
these algorithms looks cool in theory
@suchalooser1175 Год назад ⁺²
Really good lecture series on reinforcement learning, good balance of math, theory, and actual implementation details!!!
@regismeyssonnier559 Месяц назад
The eval function is the same for the 2 player in chess ?
@suchalooser1175 Год назад ⁺¹
Not sure why this is having less view count, lectures are high quality and detailed.
@saicharanmarrivada5077 10 месяцев назад
@@parmoksha Reinforcement learning also quite popular bro

Следующие

Автовоспроизведение

Game Playing 2 - TD Learning, Game Theory | Stanford CS221: Artificial Intelligence (Autumn 2019)

Game Playing 2 - TD Learning, Game Theory | Stanford CS221: Artificial Intelligence (Autumn 2019)

AlphaGo - The Movie | Full award-winning documentary

AlphaGo - The Movie | Full award-winning documentary

Overview Artificial Intelligence Course | Stanford CS221: Learn AI (Autumn 2019)

Overview Artificial Intelligence Course | Stanford CS221: Learn AI (Autumn 2019)

Our Fire Evacuation! Forced To Leave Our Home...

Our Fire Evacuation! Forced To Leave Our Home...

Pachuca (MEX) vs Al Ahly (EGY) Penalty Shootout | Intercontinental Cup | 12/14/2024 | beIN SPORTS

Pachuca (MEX) vs Al Ahly (EGY) Penalty Shootout | Intercontinental Cup | 12/14/2024 | beIN SPORTS

How Employees Are Coffee Badging To Avoid Full Days At The Office

How Employees Are Coffee Badging To Avoid Full Days At The Office

I.N "HALLUCINATION" | [Stray Kids : SKZ-PLAYER]

I.N "HALLUCINATION" | [Stray Kids : SKZ-PLAYER]

Body Language Expert: Stop Using This, It’s Making People Dislike You, So Are These Subtle Mistakes!

Body Language Expert: Stop Using This, It’s Making People Dislike You, So Are These Subtle Mistakes!

Minimax with Alpha Beta Pruning

Minimax with Alpha Beta Pruning

Выжить в мире ИНФАНТИЛОВ! Профессор Евгений Жаринов о вреде ЕГЭ, убогости Пелевина и смерти Автора

Выжить в мире ИНФАНТИЛОВ! Профессор Евгений Жаринов о вреде ЕГЭ, убогости Пелевина и смерти Автора

Terence Tao at IMO 2024: AI and Mathematics

Terence Tao at IMO 2024: AI and Mathematics

Expectiminimax for Stochastic Games - Complex Systems Simulation and Artificial Life

Expectiminimax for Stochastic Games - Complex Systems Simulation and Artificial Life

Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)

Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Mega-R3. Games, Minimax, Alpha-Beta

Mega-R3. Games, Minimax, Alpha-Beta

Minimax: How Computers Play Games

Minimax: How Computers Play Games

Монтян: ИМ придётся проиграть войну! // Интервью Зеленского, пожары Калифорнии, Трамп шокирует мир

Монтян: ИМ придётся проиграть войну! // Интервью Зеленского, пожары Калифорнии, Трамп шокирует мир

Pink Bot 💀💀 Test IQ Challenge Incredibox Sprunki

Pink Bot 💀💀 Test IQ Challenge Incredibox Sprunki

Homemade PIZZA APPROVED @albert_cancook

Homemade PIZZA APPROVED @albert_cancook

ДЕНЬ РОЖДЕНИЯ БАКУ! ЮБИЛЕЙНОЕ ВИДЕО НА 7 МИЛЛИОНОВ ПОДПИСЧИКОВ!

ДЕНЬ РОЖДЕНИЯ БАКУ! ЮБИЛЕЙНОЕ ВИДЕО НА 7 МИЛЛИОНОВ ПОДПИСЧИКОВ!

спидран по ютуб шортс 103 | Я случайно

спидран по ютуб шортс 103 | Я случайно

Фронт летит в тартарары

Фронт летит в тартарары

КОНФЛИКТ Конора и Авдала. СХВАТКА Германского. На Убу НАЕХАЛИ. Касымбай - ЗАЩИТА ПОЯСА. Смоян VS Хан

КОНФЛИКТ Конора и Авдала. СХВАТКА Германского. На Убу НАЕХАЛИ. Касымбай - ЗАЩИТА ПОЯСА. Смоян VS Хан