Patrick McGuire: Deep Exploration via Bootstrapped DQN. (arXiv:1602.04621v2 [cs.LG] UPDATED)

Sunday, July 3, 2016

Deep Exploration via Bootstrapped DQN. (arXiv:1602.04621v2 [cs.LG] UPDATED)

Efficient exploration in complex environments remains a major challenge for reinforcement learning. We propose bootstrapped DQN, a simple algorithm that explores in a computationally and statistically efficient manner through use of randomized value functions. Unlike dithering strategies such as epsilon-greedy exploration, bootstrapped DQN carries out temporally-extended (or deep) exploration; this can lead to exponentially faster learning. We demonstrate these benefits in complex stochastic MDPs and in the large-scale Arcade Learning Environment. Bootstrapped DQN substantially improves learning times and performance across most Atari games.

DONATE to arXiv: One hundred percent of your contribution will fund improvements and new initiatives to benefit arXiv's global scientific community. Please join the Simons Foundation and our generous member organizations and research labs in supporting arXiv. https://goo.gl/QIgRpr

from cs.AI updates on arXiv.org http://ift.tt/1QilDlK
via IFTTT

Patrick McGuire

Latest YouTube Video

Sunday, July 3, 2016

Deep Exploration via Bootstrapped DQN. (arXiv:1602.04621v2 [cs.LG] UPDATED)

No comments:

Click to Show Support

Click to Show Support