Genetic Policy Optimization

11/03/2017
by   Tanmay Gangwani, et al.
0

Genetic algorithms have been widely used in many practical optimization problems. Inspired by natural selection, operators, including mutation, crossover and selection, provide effective heuristics for search and black-box optimization. However, they have not been shown useful for deep reinforcement learning, possibly due to the catastrophic consequence of parameter crossovers of neural networks. Here, we present Genetic Policy Optimization (GPO), a new genetic algorithm for sample-efficient deep policy optimization. GPO uses imitation learning for policy crossover in the state space and applies policy gradient methods for mutation. Our experiments on Mujoco tasks show that GPO as a genetic algorithm is able to provide superior performance over the state-of-the-art policy gradient methods and achieves comparable or higher sample efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2016

On the performance of different mutation operators of a subpopulation-based genetic algorithm for multi-robot task allocation problems

The performance of different mutation operators is usually evaluated in ...
research
01/02/2022

Applications of Gaussian Mutation for Self Adaptation in Evolutionary Genetic Algorithms

In recent years, optimization problems have become increasingly more pre...
research
05/07/2019

REGAL: Transfer Learning For Fast Optimization of Computation Graphs

We present a deep reinforcement learning approach to optimizing the exec...
research
02/19/2019

Deep Reinforcement Learning using Genetic Algorithm for Parameter Optimization

Reinforcement learning (RL) enables agents to take decision based on a r...
research
01/08/2021

Learning Low-Correlation GPS Spreading Codes with a Policy Gradient Algorithm

With the birth of the next-generation GPS III constellation and the upco...
research
07/11/2019

Imitation-Projected Policy Gradient for Programmatic Reinforcement Learning

We present Imitation-Projected Policy Gradient (IPPG), an algorithmic fr...
research
01/13/2022

Direct Mutation and Crossover in Genetic Algorithms Applied to Reinforcement Learning Tasks

Neuroevolution has recently been shown to be quite competitive in reinfo...

Please sign up or login with your details

Forgot password? Click here to reset