NEARL: Non-Explicit Action Reinforcement Learning for Robotic Control

11/02/2020
by   Nan Lin, et al.
10

Traditionally, reinforcement learning methods predict the next action based on the current state. However, in many situations, directly applying actions to control systems or robots is dangerous and may lead to unexpected behaviors because action is rather low-level. In this paper, we propose a novel hierarchical reinforcement learning framework without explicit action. Our meta policy tries to manipulate the next optimal state and actual action is produced by the inverse dynamics model. To stabilize the training process, we integrate adversarial learning and information bottleneck into our framework. Under our framework, widely available state-only demonstrations can be exploited effectively for imitation learning. Also, prior knowledge and constraints can be applied to meta policy. We test our algorithm in simulation tasks and its combination with imitation learning. The experimental results show the reliability and robustness of our algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
08/08/2020

Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

Many modern methods for imitation learning and inverse reinforcement lea...
research
03/23/2021

Meta-Adversarial Inverse Reinforcement Learning for Decision-making Tasks

Learning from demonstrations has made great progress over the past few y...
research
09/20/2022

A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret

In various control task domains, existing controllers provide a baseline...
research
12/14/2020

Active Hierarchical Imitation and Reinforcement Learning

Humans can leverage hierarchical structures to split a task into sub-tas...
research
12/16/2021

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning

Effective exploration continues to be a significant challenge that preve...
research
06/11/2017

Meta learning Framework for Automated Driving

The success of automated driving deployment is highly depending on the a...
research
04/07/2022

3D Perception based Imitation Learning under Limited Demonstration for Laparoscope Control in Robotic Surgery

Automatic laparoscope motion control is fundamentally important for surg...

Please sign up or login with your details

Forgot password? Click here to reset