Patrick McGuire: Options Discovery with Budgeted Reinforcement Learning. (arXiv:1611.06824v3 [cs.LG] UPDATED)

Wednesday, February 22, 2017

Options Discovery with Budgeted Reinforcement Learning. (arXiv:1611.06824v3 [cs.LG] UPDATED)

We consider the problem of learning hierarchical policies for Reinforcement Learning able to discover options, an option corresponding to a sub-policy over a set of primitive actions. Different models have been proposed during the last decade that usually rely on a predefined set of options. We specifically address the problem of automatically discovering options in decision processes. We describe a new learning model called Budgeted Option Neural Network (BONN) able to discover options based on a budgeted learning objective. The BONN model is evaluated on different classical RL problems, demonstrating both quantitative and qualitative interesting results.

from cs.AI updates on arXiv.org http://ift.tt/2fWM0V2
via IFTTT

Patrick McGuire

Latest YouTube Video

Wednesday, February 22, 2017

Options Discovery with Budgeted Reinforcement Learning. (arXiv:1611.06824v3 [cs.LG] UPDATED)

No comments:

Click to Show Support

Click to Show Support