Reinforcement Learning with Chromatic Networks

07/10/2019
by   Xingyou Song, et al.
6

We present a new algorithm for finding compact neural networks encoding reinforcement learning (RL) policies. To do it, we leverage in the novel RL setting the theory of pointer networks and ENAS-type algorithms for combinatorial optimization of RL policies as well as recent evolution strategies (ES) optimization methods, and propose to define the combinatorial search space to be the the set of different edge-partitionings (colorings) into same-weight classes. For several RL tasks, we manage to learn colorings translating to effective policies parameterized by as few as 17 weight parameters, providing 6x compression over state-of-the-art compact policies based on Toeplitz matrices. We believe that our work is one of the first attempts to propose a rigorous approach to training structured neural network architectures for RL problems that are of interest especially in mobile robotics with limited storage and computational resources.

READ FULL TEXT

page 5

page 8

page 12

page 13

page 14

page 15

page 16

page 17

research
01/19/2021

ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

We introduce ES-ENAS, a simple neural architecture search (NAS) algorith...
research
04/06/2018

Structured Evolution with Compact Architectures for Scalable Policy Optimization

We present a new method of blackbox optimization via gradient approximat...
research
08/27/2020

Neural Learning of One-of-Many Solutions for Combinatorial Problems in Structured Output Spaces

Recent research has proposed neural architectures for solving combinator...
research
09/08/2020

Evolutionary Reinforcement Learning via Cooperative Coevolutionary Negatively Correlated Search

Evolutionary algorithms (EAs) have been successfully applied to optimize...
research
03/06/2019

Training in Task Space to Speed Up and Guide Reinforcement Learning

Recent breakthroughs in the reinforcement learning (RL) community have m...
research
06/19/2020

An Ode to an ODE

We present a new paradigm for Neural ODE algorithms, calledODEtoODE, whe...
research
01/07/2021

Active Screening for Recurrent Diseases: A Reinforcement Learning Approach

Active screening is a common approach in controlling the spread of recur...

Please sign up or login with your details

Forgot password? Click here to reset