C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning

09/14/2022
by   Ci Chen, et al.
0

With the rise of computing power, using data-driven approaches for co-designing robots' morphology and controller has become a feasible way. Nevertheless, evaluating the fitness of the controller under each morphology is time-consuming. As a pioneering data-driven method, Co-adaptation utilizes a double-network mechanism with the aim of learning a Q function conditioned on morphology parameters to replace the traditional evaluation of a diverse set of candidates, thereby speeding up optimization. In this paper, we find that Co-adaptation ignores the existence of exploration error during training and state-action distribution shift during parameter transmitting, which hurt the performance. We propose the framework of the concurrent network that couples online and offline RL methods. By leveraging the behavior cloning term flexibly, we mitigate the impact of the above issues on the results. Simulation and physical experiments are performed to demonstrate that our proposed method outperforms baseline algorithms, which illustrates that the proposed method is an effective way of discovering the optimal combination of morphology and controller.

READ FULL TEXT
research
11/15/2019

Data-efficient Co-Adaptation of Morphology and Behaviour with Deep Reinforcement Learning

Humans and animals are capable of quickly learning new behaviours to sol...
research
09/19/2023

Memory-based Controllers for Efficient Data-driven Control of Soft Robots

Controller design for soft robots is challenging due to nonlinear deform...
research
06/13/2023

A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning

Offline reinforcement learning (RL) provides a promising solution to lea...
research
02/22/2023

Universal Morphology Control via Contextual Modulation

Learning a universal policy across different robot morphologies can sign...
research
10/20/2021

Data-Driven Offline Optimization For Architecting Hardware Accelerators

Industry has gradually moved towards application-specific hardware accel...
research
11/24/2022

Control and Morphology Optimization of Passive Asymmetric Structures for Robotic Swimming

Aquatic creatures exhibit remarkable adaptations of their body to effici...
research
05/03/2019

Data-efficient Learning of Morphology and Controller for a Microrobot

Robot design is often a slow and difficult process requiring the iterati...

Please sign up or login with your details

Forgot password? Click here to reset