Temporally Layered Architecture for Efficient Continuous Control

05/30/2023
by   Devdhar Patel, et al.
0

We present a temporally layered architecture (TLA) for temporally adaptive control with minimal energy expenditure. The TLA layers a fast and a slow policy together to achieve temporal abstraction that allows each layer to focus on a different time scale. Our design draws on the energy-saving mechanism of the human brain, which executes actions at different timescales depending on the environment's demands. We demonstrate that beyond energy saving, TLA provides many additional advantages, including persistent exploration, fewer required decisions, reduced jerk, and increased action repetition. We evaluate our method on a suite of continuous control tasks and demonstrate the significant advantages of TLA over existing methods when measured over multiple important metrics. We also introduce a multi-objective score to qualitatively assess continuous control policies and demonstrate a significantly better score for TLA. Our training algorithm uses minimal communication between the slow and fast layers to train both policies simultaneously, making it viable for future applications in distributed control.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/25/2022

Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

We present temporally layered architecture (TLA), a biologically inspire...
research
04/13/2021

TASAC: Temporally Abstract Soft Actor-Critic for Continuous Control

We propose temporally abstract soft actor-critic (TASAC), an off-policy ...
research
09/25/2022

Temporally Extended Successor Representations

We present a temporally extended variation of the successor representati...
research
05/13/2019

Learning and Exploiting Multiple Subgoals for Fast Exploration in Hierarchical Reinforcement Learning

Hierarchical Reinforcement Learning (HRL) exploits temporally extended a...
research
03/27/2019

Autoregressive Policies for Continuous Control Deep Reinforcement Learning

Reinforcement learning algorithms rely on exploration to discover new be...
research
05/31/2023

Latent Exploration for Reinforcement Learning

In Reinforcement Learning, agents learn policies by exploring and intera...
research
09/02/2021

Habitual and Reflective Control in Hierarchical Predictive Coding

In cognitive science, behaviour is often separated into two types. Refle...

Please sign up or login with your details

Forgot password? Click here to reset