One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control

07/09/2020
by   Wenlong Huang, et al.
9

Reinforcement learning is typically concerned with learning control policies tailored to a particular agent. We investigate whether there exists a single global policy that can generalize to control a wide variety of agent morphologies – ones in which even dimensionality of state and action spaces changes. We propose to express this global policy as a collection of identical modular neural networks, dubbed as Shared Modular Policies (SMP), that correspond to each of the agent's actuators. Every module is only responsible for controlling its corresponding actuator and receives information from only its local sensors. In addition, messages are passed between modules, propagating information between distant modules. We show that a single modular policy can successfully generate locomotion behaviors for several planar agents with different skeletal structures such as monopod hoppers, quadrupeds, bipeds, and generalize to variants not seen during training – a process that would normally require training and manual hyperparameter tuning for each morphology. We observe that a wide variety of drastically diverse locomotion styles across morphologies as well as centralized coordination emerges via message passing between decentralized modules purely from the reinforcement learning objective. Videos and code at https://huangwl18.github.io/modular-rl/

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2023

Feudal Graph Reinforcement Learning

We focus on learning composable policies to control a variety of physica...
research
10/31/2022

Learning Modular Robot Visual-motor Locomotion Policies

Control policy learning for modular robot locomotion has previously been...
research
02/14/2019

Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity

Contemporary sensorimotor learning approaches typically start with an ex...
research
05/20/2021

Learning Modular Robot Control Policies

To make a modular robotic system both capable and scalable, the controll...
research
05/30/2023

Subequivariant Graph Reinforcement Learning in 3D Environments

Learning a shared policy that guides the locomotion of different agents ...
research
11/08/2018

Modular Architecture for StarCraft II with Deep Reinforcement Learning

We present a novel modular architecture for StarCraft II AI. The archite...
research
10/28/2022

Learning Modular Simulations for Homogeneous Systems

Complex systems are often decomposed into modular subsystems for enginee...

Please sign up or login with your details

Forgot password? Click here to reset