Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning

03/04/2022
by   Hongpeng Cao, et al.
0

Deep reinforcement learning (DRL) is a promising approach to solve complex control tasks by learning policies through interactions with the environment. However, the training of DRL policies requires large amounts of training experiences, making it impractical to learn the policy directly on physical systems. Sim-to-real approaches leverage simulations to pretrain DRL policies and then deploy them in the real world. Unfortunately, the direct real-world deployment of pretrained policies usually suffers from performance deterioration due to the different dynamics, known as the reality gap. Recent sim-to-real methods, such as domain randomization and domain adaptation, focus on improving the robustness of the pretrained agents. Nevertheless, the simulation-trained policies often need to be tuned with real-world data to reach optimal performance, which is challenging due to the high cost of real-world samples. This work proposes a distributed cloud-edge architecture to train DRL agents in the real world in real-time. In the architecture, the inference and training are assigned to the edge and cloud, separating the real-time control loop from the computationally expensive training loop. To overcome the reality gap, our architecture exploits sim-to-real transfer strategies to continue the training of simulation-pretrained agents on a physical system. We demonstrate its applicability on a physical inverted-pendulum control system, analyzing critical parameters. The real-world experiments show that our architecture can adapt the pretrained DRL agents to unseen dynamics consistently and efficiently.

READ FULL TEXT

page 1

page 3

research
09/22/2022

Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Deep reinforcement learning (DRL) is a promising method to learn control...
research
02/01/2018

VR Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control

This paper deals with the reality gap from a novel perspective, targetin...
research
11/03/2021

What Robot do I Need? Fast Co-Adaptation of Morphology and Control using Graph Neural Networks

The co-adaptation of robot morphology and behaviour becomes increasingly...
research
04/25/2023

Roll-Drop: accounting for observation noise with a single parameter

This paper proposes a simple strategy for sim-to-real in Deep-Reinforcem...
research
02/18/2018

Sim-To-Real Optimization Of Complex Real World Mobile Network with Imperfect Information via Deep Reinforcement Learning from Self-play

Mobile network that millions of people use every day is one of the most ...
research
05/30/2021

Reducing the Deployment-Time Inference Control Costs of Deep Reinforcement Learning Agents via an Asymmetric Architecture

Deep reinforcement learning (DRL) has been demonstrated to provide promi...
research
11/01/2020

Learning When to Switch: Composing Controllers to Traverse a Sequence of Terrain Artifacts

Legged robots often use separate control policies that are highly engine...

Please sign up or login with your details

Forgot password? Click here to reset