Using Dimensionality Reduction to Exploit Constraints in Reinforcement Learning
Item Status
Embargo End Date
Date
Abstract
Reinforcement learning in the high-dimensional,
continuous spaces typical in robotics, remains a challenging
problem. To overcome this challenge, a popular approach has
been to use demonstrations to find an appropriate initialisation
of the policy in an attempt to reduce the number of iterations
needed to find a solution. Here, we present an alternative
way to incorporate prior knowledge from demonstrations of
individual postures into learning, by extracting the inherent
problem structure to find an efficient state representation.
In particular, we use probabilistic, nonlinear dimensionality
reduction to capture latent constraints present in the data. By
learning policies in the learnt latent space, we are able to solve
the planning problem in a reduced space that automatically
satisfies task constraints. As shown in our experiments, this
reduces the exploration needed and greatly accelerates the
learning. We demonstrate our approach for learning a bimanual
reaching task on the 19-DOF KHR-1HV humanoid.
This item appears in the following Collection(s)

