Search Results - Springer

Page %P

Close Plain text

Sort By Newest First Oldest First

Chapter and Conference Paper

Exploration Bonuses Based on Upper Confidence Bounds for Sparse Reward Games

Recent deep reinforcement learning (RL) algorithms have achieved super-human-level performance in many Atari games. However, a closer look at their performance reveals that the algorithms fall short of humans ...

Naoki Mizukami, Jun Suzuki, Hirotaka Kameko… in Advances in Computer Games (2017)