yay, this was actually better than most of the explanatory videos i have seen. thanks for providing us always with informative content, crash course. looking forward for more of these videos
0:17 That cookie looked completely ... edible hahaha! What brand is that? :D Screw Reinforcement Learning .... I'm now officially hungry! [back from the kitchen] Reinforcement Learning is actually extremely interesting! :D
that's the main thing i've been wanting for years and many people have asked for it. So you know, i remember on one other comment asking for it crash course replied that they've been thinking about it as well. Not sure what the chances are now, but that gives me hope
I don't agree with the bagel/donut choice example. Why choose the option of two bagels or donuts vs. the greater risk of more donuts (6) or a guaranteed single donut?
In the john green bot example, is the objective to find the shortest path or get the most points? What would getting more points even do, I feel like in that case exploration is best so that you can find the shortest path, exploiting only when racing another bot
Black & White and Black & White: Creature Island used reinforcement learning. The creature you commanded could learn incredibly complex routines, such as planting a sapling, water the tree with the water miracle, then pick it up and throw it into the resource center and repeat. With enough training. I really hope that we'll see more games exploring that kind of relationship with a computer character. Imagine a game where you're personally teaching a group of monsters how to hunt and then guiding their instincts by reinforcing or punishing a particular set of circumstances, until they conquer their world.
Not sure the kitchen metaphor works for me. Why is the bag more likely to contain donuts than the box? It sure looked like the kind of box that donuts come in to me.
Is there a better reason than consolidating the total amount of stored data the reason we only store a single value per square? Why not store 4 values per square so you can store a value per direction you could go from the current spot. That way you could find/exploit the near black hole shortcut that the current algorithm is too scared to find.
Yo as a black guy with dreadlocks who likes coding, it’s really cool to listen to another black guy with dreadlocks who likes coding
"A trade off between exploration and exploitation" - Thats life
Well, no,
(Conquistador) Life can be both.
Usually is.
yay, this was actually better than most of the explanatory videos i have seen. thanks for providing us always with informative content, crash course. looking forward for more of these videos
8:38 John Green bot from the past
oh man, i didnt notice that
*waves robo hand* Mr. Green Bot! Mr. Green Bot! What if I took TEN actions?
Can we do Crash Course Law?
Like if you agree
Crash course doesn't make sense it makes jibril look bad
How about crash course Current Events. It's so hard to keep up with all the developments in US politics and such
PBS can try invite Meghan Markle and her team to do all the law educational stuff, she is very good at it. :)
@@nealkelly9757 crash course current events would have indefinite number of episodes.
@@masternobody1896 ??? what do you mean?
Loving this series. Thank you so much! There's so much info in every episode; just fantastic.
I WOULD LOVEEEE IT IF CRASH COURSE HAD AN ACCOUNTING COURSE!!❤️️.
Jany JJ try Kahn Academy. I’m pretty sure they have an accounting section.
@@Logan_Explores Alright, thank you so much😻.
US or Canada or other?
Jabril really has it out for bagels.
This is the best explanation I have seen so far - definitely sharing - thank you!
This reminds me of Pavlov from my psychology class.
now,after watching multiple episodes in a row, I really want donuts :P also really enjoying this series :)
I'm really liking this series of videos
Keep ip the good work :)
5:12 open all no risk high reward
This video is fire. Insta-subbed!
Can we get crash course geography
jabril got a new hat did u notice?
Such a great video!
We most definitely do whatever it takes to get more cookies 🍪 😉
0:17 That cookie looked completely ... edible hahaha! What brand is that? :D
Screw Reinforcement Learning .... I'm now officially hungry!
[back from the kitchen]
Reinforcement Learning is actually extremely interesting! :D
Nicely explained, thanks
Are you going to use openai for rl and keras when we come to deep reinforcement learning
When will this playlist be finished.
🏃 Thank You!
Thank you!
Thanks for your Awsome Course, I got interested in Machine learning and I am planning to study that for my M.A.
Can we get crash course music theory?
that's the main thing i've been wanting for years and many people have asked for it. So you know, i remember on one other comment asking for it crash course replied that they've been thinking about it as well. Not sure what the chances are now, but that gives me hope
Can we get computation theory lesson ,CC ?
Cool video!
I don't agree with the bagel/donut choice example. Why choose the option of two bagels or donuts vs. the greater risk of more donuts (6) or a guaranteed single donut?
In the john green bot example, is the objective to find the shortest path or get the most points? What would getting more points even do, I feel like in that case exploration is best so that you can find the shortest path, exploiting only when racing another bot
5:11 I'll just take all three items
This fellow and his donut obsession. I don't know... 😊
Black & White and Black & White: Creature Island used reinforcement learning. The creature you commanded could learn incredibly complex routines, such as planting a sapling, water the tree with the water miracle, then pick it up and throw it into the resource center and repeat. With enough training.
I really hope that we'll see more games exploring that kind of relationship with a computer character. Imagine a game where you're personally teaching a group of monsters how to hunt and then guiding their instincts by reinforcing or punishing a particular set of circumstances, until they conquer their world.
It's Jabril!!!
Not sure the kitchen metaphor works for me. Why is the bag more likely to contain donuts than the box? It sure looked like the kind of box that donuts come in to me.
Why would JohnGreenBot in that battery example only go in straight lines? Would it not be better to go in a diagonal path?
because that is how he was taught to see the room and navigate it. If you want diagonals, maybe we'd have to arrange the room in hexagons instead.
Love you love you love you love ❤️
9:40 is my hometown. Ok AI, good reinforcement for me.
Yeah, I recognized Boston St too :)
Nice ,
Is there a better reason than consolidating the total amount of stored data the reason we only store a single value per square? Why not store 4 values per square so you can store a value per direction you could go from the current spot. That way you could find/exploit the near black hole shortcut that the current algorithm is too scared to find.
I feel there are people in place of power/rich who need to watch this video... >.>
Outline what they should take way from the video and why.
🤘🤘🤘
is this related to dijkstra's or a*?
When jabrils is talking his mouth is moving. That is illegal.
Markov decision process and Q learning, fcking tedious
12th comment is mine...
Agent? like..... Agent Smith???!!!
Who drives a car looking at side ways?
Open Ai and Alpha Go
That was a robot playing Don't Wake Daddy.
Like if AI beats slavery
Knowledge but man I don't understand can you make it easy
This episode is hard because it's oversimplified
First second!
Jabril? Jabril? Laughing too much to type. Hey, I'm Jabril. Unreal, these people.
Rare too see a black guy talking about this subject but glad I did.
You're good but You are going too fast