Patrick McGuire: Value Iteration Networks. (arXiv:1602.02867v2 [cs.AI] UPDATED)

Monday, May 30, 2016

Value Iteration Networks. (arXiv:1602.02867v2 [cs.AI] UPDATED)

We introduce the value iteration network (VIN): a fully differentiable neural network with a `planning module' embedded within. VINs can learn to plan, and are suitable for predicting outcomes that involve planning-based reasoning, such as policies for reinforcement learning. Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a convolutional neural network, and trained end-to-end using standard backpropagation. We evaluate VIN based policies on discrete and continuous path-planning domains, and on a natural-language based search task. We show that by learning an explicit planning computation, VIN policies generalize better to new, unseen domains.

from cs.AI updates on arXiv.org http://ift.tt/1XhcmQD
via IFTTT

Patrick McGuire

Latest YouTube Video

Monday, May 30, 2016

Value Iteration Networks. (arXiv:1602.02867v2 [cs.AI] UPDATED)

No comments:

Click to Show Support

Click to Show Support