Размер видео: 1280 X 720853 X 480640 X 360
Показать панель управления
Автовоспроизведение
Автоповтор
Your 12 min video worth than all the playlist about q-learning on youtube👏
i watched so many vids in RL, but this ones the best when it comes to explaining and breaking down the formulas 😭❤thankuskajhjhc
Really enjoying the series. Keep it up
Thanks so much! Super glad you are enjoying this
Thanks, for your pretty efficient good quality videos! not only save time but also gives a complete understanding of topic😍
This was brilliantly explained. Thank you!
Thank you from the bottom of my heart!
You deserve a tons of like!!!
Wow, you are really good at explaining things. Thank you!
This is so underrated
great explanation
Explained well sir!!
What classical tasks are solved by off-policy algorithms? Do we use it to write bots that solves simple computer games?
Excellent Explanation, hats off.
amazing.
your video is really useful!!! thanks a lot
Very Well explained by you sir,It helped alot
very good explained, thanks a lot!
wonderful video! Than you!
Question to the last point you mention: We repeat the procedure many times until the values in the q-table don't change much anymore. Is that considered to be some form of Monte Carlo (within Q-learning)? Enjoy your videos btw, great work!
This is epic
thank you so much that was so helpful
Thank you so much!!!!!!!!!!!!
thank u so much
Allah razı olsun
thanks man
May be wrong I am not an expert but isn’t the Bellman equation supposed to add the reward of the S1 not S2?
bara alah yrham waldik
anh vừa cứu em 1 bàn thua trong thấy =))) tưởng rớt môn hên gặp anh😀😀😀
bro how you are speaking like an american?suggest me some tips as well
Instead of saying grid you could say almost say DFA
Q*
Your 12 min video worth than all the playlist about q-learning on youtube👏
i watched so many vids in RL, but this ones the best when it comes to explaining and breaking down the formulas 😭❤thankuskajhjhc
Really enjoying the series. Keep it up
Thanks so much! Super glad you are enjoying this
Thanks, for your pretty efficient good quality videos! not only save time but also gives a complete understanding of topic😍
This was brilliantly explained. Thank you!
Thank you from the bottom of my heart!
You deserve a tons of like!!!
Wow, you are really good at explaining things. Thank you!
This is so underrated
great explanation
Explained well sir!!
What classical tasks are solved by off-policy algorithms? Do we use it to write bots that solves simple computer games?
Excellent Explanation, hats off.
amazing.
your video is really useful!!! thanks a lot
Very Well explained by you sir,It helped alot
very good explained, thanks a lot!
wonderful video! Than you!
Question to the last point you mention: We repeat the procedure many times until the values in the q-table don't change much anymore. Is that considered to be some form of Monte Carlo (within Q-learning)? Enjoy your videos btw, great work!
This is epic
thank you so much that was so helpful
Thank you so much!!!!!!!!!!!!
thank u so much
Allah razı olsun
thanks man
May be wrong I am not an expert but isn’t the Bellman equation supposed to add the reward of the S1 not S2?
bara alah yrham waldik
anh vừa cứu em 1 bàn thua trong thấy =))) tưởng rớt môn hên gặp anh
😀😀😀
bro how you are speaking like an american?
suggest me some tips as well
Instead of saying grid you could say almost say DFA
Q*