Latest YouTube Video

Wednesday, January 6, 2016

Angrier Birds: Bayesian reinforcement learning. (arXiv:1601.01297v1 [cs.AI])

We train a reinforcement learner to play a simplified version of the game Angry Birds. The learner is provided with a game state in a manner similar to the output that could be produced by computer vision algorithms. We improve on the efficiency of regular {\epsilon}-greedy Q-Learning with linear function approximation through more systematic exploration in Randomized Least Squares Value Iteration (RLSVI), an algorithm that samples its policy from a posterior distribution on optimal policies. With larger state-action spaces, efficient exploration becomes increasingly important, as evidenced by the faster learning in RLSVI.

Donate to arXiv



from cs.AI updates on arXiv.org http://ift.tt/1JYqbfo
via IFTTT

No comments: