Advanced 5. Reachability

6. Monte Carlo Simulation

MIT Introduction to Deep Learning | 6.S191

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

How Employees Are Coffee Badging To Avoid Full Days At The Office

Advanced 4. Monte Carlo Tree Search

MIT OpenCourseWare

Просмотров 28 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 18 дек 2024

Комментарии • 20

@omdobariya7971 4 месяца назад ⁺⁴
Best lecture I found to understand MTCS!
@mare4602 5 лет назад ⁺¹⁴
wish it had better audio quality
@YairCat 3 года назад ⁺³
If I am not mistaken, because the blue node is the action of the opponent then you should not increase the amount of wins in the numerator and only the amount of times he/she played this move. 29:52
@cassoulucas 11 месяцев назад ⁺¹
Yeah, it's a bit weird because i've seen both implementations online but only increasing the win count from the point of view of the player that made the move in a certain node yielded better results for me
@mr.meesicks1801 6 лет назад ⁺²
In 28:49 I am not sure if that's really true in general. In Gelly and Silver 2007 they show that using better simulation policies in UCT does not necessarily translate to better final MCTS performance.
@stephenmontague6930 3 года назад
Yeah, the caveat he gave ("As long as it's not that much more expensive") is crucially important, because a heuristic based selection or rollout policy can be very detrimental - as shown in a number of papers (including recent work) - since heuristics take time that could have been used to process more rollouts, and heuristics may fail in complex scenarios where random exploration may succeed. Enhancements are certainly possible, but any approach should be well-tested.
@hcm9999 2 месяца назад
Question: given enough time, can MCTS (Monte Carlo Tree Search) find the best solution?
The problem about MCTS is that it chooses the child node with the highest probability of having a solution.
As long as those probabilities don't change, MCTS will choose the same node, no matter how many iterations you perfom. That means some leaves (terminal nodes) are unreachable.
If the best solution happens to be in an unreachable leaf, MCTS will never find it.
The problem is how you calculate the probabilities of the child node.
There are formulas like UCT, but I don't know if such formulas are really correct.
The probabilites need to change in such a way to allow MCTS to traverse every single node and every single leaf of the tree.
I don't think UCT does that.
That means some node and leaves are unreachable.
@regismeyssonnier559 Месяц назад
The branching factor of TicTactoe is nine at maximum ??? It's OK or not ?
@yoyoshi2833 Год назад
Marque-page : 47:00
@dsazz801 4 года назад ⁺¹
A nice presentation! Thank you guys :D
@garyaxl5056 3 года назад
@Ramon Major yup, have been using InstaFlixxer for years myself :)
@JonTrần-z4x 21 день назад
ruclips.net/video/xmImNoDc9Z4/видео.html as i understand we don't store node in simulation state because that node can be seen as a child of other node, and when you compute ucb, the result will be affected by the previous simulation => we lost the randomness
So my question is: Am i understand it correctly? If yes, is this only correct with random strategy?
@FullPotatoGaming Год назад
32:30
@achillesarmstrong9639 6 лет назад
interesting
@pnachtwey 5 лет назад
I doubt the person giving the lecture has ever written a chess or go program. His explanation of how to terminate the search was not good. Tic-Tac-Toe is a poor example problem. It is too simple. NIM would be a better example.
I am surprised the algorithm uses a ln and sqrt function as these are time consuming.
Is this all you need to know to use MCTS?
The evaluation routine that is used to evaluate nodes is key. There is no point is search deep if you don't know what you are searching for or searching for the wrong thing.
@AntonPanchishin 3 года назад ⁺¹
The Ln and Sqrt functions come from good math theory with regards to probability of regret, what's the chance that the best option by chance had a few too many losses early on in the sampling and a sub optimal option (a trap perhaps) had by chance a few wins. The theoretical calculation for that is UCB, which includes ln and sqrt. You can try other non-expensive calculations but I think you will find that althrough you can run more simulations, the simulations will not be as effectively used.
@AntonPanchishin 3 года назад ⁺¹
Test out your theories here www.codingame.com/multiplayer/bot-programming/tic-tac-toe and get a really good insight into tradeoffs. Pure MCTS will do pretty well and you'll have a very hard time beating it. Those at the top of the leaderboard mostly use vanilla MCTS.
@pnachtwey 3 года назад ⁺¹
@@AntonPanchishin Thanks for the link. I wrote an Othello program back in 1980 and entered it in the first man machine Othello tournament in Evanston, IL. I met a lot of the Chess pioneers there. Later I wrote my own chess program and entered into USCF chess tournaments but I never got it to play better than I could. I only had a 386 computer. Also, real life got in the way.
@stephenkamenar 3 года назад
live audiences are so gross. so much coughing

Следующие

Автовоспроизведение

Advanced 5. Reachability

Advanced 5. Reachability

6. Monte Carlo Simulation

6. Monte Carlo Simulation

MIT Introduction to Deep Learning | 6.S191

MIT Introduction to Deep Learning | 6.S191

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

How Employees Are Coffee Badging To Avoid Full Days At The Office

How Employees Are Coffee Badging To Avoid Full Days At The Office

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Alpha Zero and Monte Carlo Tree Search

Alpha Zero and Monte Carlo Tree Search

AlphaGo - The Movie | Full award-winning documentary

AlphaGo - The Movie | Full award-winning documentary

Geoff Hinton - Will Digital Intelligence Replace Biological Intelligence? | Vector's Remarkable 2024

Geoff Hinton - Will Digital Intelligence Replace Biological Intelligence? | Vector's Remarkable 2024

[CS188 SP24] LEC06 - Games: Expectimax, Monte Carlo Tree Search

[CS188 SP24] LEC06 - Games: Expectimax, Monte Carlo Tree Search

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

Overview Artificial Intelligence Course | Stanford CS221: Learn AI (Autumn 2019)

Overview Artificial Intelligence Course | Stanford CS221: Learn AI (Autumn 2019)

How do Chess Engines work? Looking at Stockfish and AlphaZero | Oliver Zeigermann

How do Chess Engines work? Looking at Stockfish and AlphaZero | Oliver Zeigermann

Seminar with Professor Geoffrey Hinton, at the Royal Swedish Academy of Engineering Sciences (IVA)

Seminar with Professor Geoffrey Hinton, at the Royal Swedish Academy of Engineering Sciences (IVA)

REAL or FAKE? #beatbox #tiktok

REAL or FAKE? #beatbox #tiktok

I tricked my girlfriend's mom 😳👀@isabellaafro #shortvideo #funny #shorts

I tricked my girlfriend's mom 😳👀@isabellaafro #shortvideo #funny #shorts

Мистер Бист БРОСИЛ МНЕ ЧЕЛЛЕНДЖ на $100,000 в Майнкрафт...

Мистер Бист БРОСИЛ МНЕ ЧЕЛЛЕНДЖ на $100,000 в Майнкрафт...

От первого лица: Школа 7😡 ОТНОШЕНИЯ с ДВУМЯ 💔 УШЛА из ШКОЛЫ 😱ПОДСТАВА от ДИРЕКТОРА ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡 ОТНОШЕНИЯ с ДВУМЯ 💔 УШЛА из ШКОЛЫ 😱ПОДСТАВА от ДИРЕКТОРА ГЛАЗАМИ ШКОЛЬНИКА

1000 дней ПИРАТСТВА! КЛАНЫ Объявили ОХОТУ за НАШИМИ ПОДВОДНЫМИ СОКРОВИЩАМИ! CheZee. Rust | Раст

1000 дней ПИРАТСТВА! КЛАНЫ Объявили ОХОТУ за НАШИМИ ПОДВОДНЫМИ СОКРОВИЩАМИ! CheZee. Rust | Раст

Felix "Unfair" | [Stray Kids : SKZ-PLAYER]

Felix "Unfair" | [Stray Kids : SKZ-PLAYER]

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Что ЭТО ЗНАЧИТ?

Что ЭТО ЗНАЧИТ?