Optimistic Agents are Asymptotically Optimal

Peter Sunehag, Marcus Hutter

We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.

Knowledge Graph

arrow_drop_up

Comments

Sign up or login to leave a comment