From BC supervised RL, to online/offline.. RL has still a supervised reward structure (sensor feedback), where genetic are the least supervised random case. Anyways, the RL feedback loop is always true for all living creatures on earth. I would regard RL as the super term on a continuous supervision spectrum dependent on the skill level of sensor feedback.
You are so good at making this stuff understandable
Fascinating and well presented. Thank you!
an excellent talk! but it would be nice if it was a bit louder
Excellent talk, thank you sir.
From BC supervised RL, to online/offline.. RL has still a supervised reward structure (sensor feedback), where genetic are the least supervised random case. Anyways, the RL feedback loop is always true for all living creatures on earth. I would regard RL as the super term on a continuous supervision spectrum dependent on the skill level of sensor feedback.
Nice lecture, but the sound is very low. Tyr to use a better microphone next time.