SO(2)-Equivariant Reinforcement Learning

03/08/2022
by   Dian Wang, et al.
0

Equivariant neural networks enforce symmetry within the structure of their convolutional layers, resulting in a substantial improvement in sample efficiency when learning an equivariant or invariant function. Such models are applicable to robotic manipulation learning which can often be formulated as a rotationally symmetric problem. This paper studies equivariant model architectures in the context of Q-learning and actor-critic reinforcement learning. We identify equivariant and invariant characteristics of the optimal Q-function and the optimal policy and propose equivariant DQN and SAC algorithms that leverage this structure. We present experiments that demonstrate that our equivariant versions of DQN and SAC can be significantly more sample efficient than competing algorithms on an important class of robotic manipulation problems.

READ FULL TEXT

page 7

page 17

research
09/04/2020

Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization

Continuous control is a widely applicable area of reinforcement learning...
research
02/23/2021

Good Actors can come in Smaller Sizes: A Case Study on the Value of Actor-Critic Asymmetry

Actors and critics in actor-critic reinforcement learning algorithms are...
research
11/06/2020

Adversarial Skill Learning for Robust Manipulation

Deep reinforcement learning has made significant progress in robotic man...
research
12/16/2021

Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic

Model-based reinforcement learning algorithms, which aim to learn a mode...
research
08/28/2023

Symmetric Models for Visual Force Policy Learning

While it is generally acknowledged that force feedback is beneficial to ...
research
08/16/2021

Optimal Actor-Critic Policy with Optimized Training Datasets

Actor-critic (AC) algorithms are known for their efficacy and high perfo...
research
04/19/2023

CASOG: Conservative Actor-critic with SmOoth Gradient for Skill Learning in Robot-Assisted Intervention

Robot-assisted intervention has shown reduced radiation exposure to phys...

Please sign up or login with your details

Forgot password? Click here to reset