Deictic Image Maps: An Abstraction For Learning Pose Invariant Manipulation Policies

06/26/2018
by   Robert Platt, et al.
0

In applications of deep reinforcement learning to robotics, it is often the case that we want to learn pose invariant policies: policies that are invariant to changes in the position and orientation of objects in the world. For example, consider a peg-in-hole insertion task. If the agent learns to insert a peg into one hole, we would like that policy to generalize to holes presented in different poses. Unfortunately, this is a challenge using conventional methods. This paper proposes a novel state and action abstraction that is invariant to pose shifts called deictic image maps that can be used with deep reinforcement learning. We provide broad conditions under which optimal abstract policies are optimal for the underlying system. Finally, we show that the method can help solve challenging robotic manipulation problems.

READ FULL TEXT
research
06/03/2019

Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies

This paper proposes a novel scheme for the watermarking of Deep Reinforc...
research
12/10/2021

Reward-Based Environment States for Robot Manipulation Policy Learning

Training robot manipulation policies is a challenging and open problem i...
research
03/19/2018

Composable Deep Reinforcement Learning for Robotic Manipulation

Model-free deep reinforcement learning has been shown to exhibit good pe...
research
06/01/2020

Invariant Policy Optimization: Towards Stronger Generalization in Reinforcement Learning

A fundamental challenge in reinforcement learning is to learn policies t...
research
01/10/2022

Verified Probabilistic Policies for Deep Reinforcement Learning

Deep reinforcement learning is an increasingly popular technique for syn...
research
06/07/2023

Generalization Across Observation Shifts in Reinforcement Learning

Learning policies which are robust to changes in the environment are cri...
research
10/30/2019

Learning Algorithmic Solutions to Symbolic Planning Tasks with a Neural Computer

A key feature of intelligent behavior is the ability to learn abstract s...

Please sign up or login with your details

Forgot password? Click here to reset