Page
%P
-
Chapter and Conference Paper
Exploration Bonuses Based on Upper Confidence Bounds for Sparse Reward Games
Recent deep reinforcement learning (RL) algorithms have achieved super-human-level performance in many Atari games. However, a closer look at their performance reveals that the algorithms fall short of humans ...