Discovering Reward Functions

Marcus explains how in simple tasks, the reward function can be easily defined, but in more complex problems, it becomes challenging. He shares examples of elevator control and building general agents. The human should give the reboot on the fly in the case of general agents.