Policy Manifold Search: Exploring the Manifold Hypothesis for Diversity-based Neuroevolution

04/27/2021
by   Nemanja Rakicevic, et al.
0

Neuroevolution is an alternative to gradient-based optimisation that has the potential to avoid local minima and allows parallelisation. The main limiting factor is that usually it does not scale well with parameter space dimensionality. Inspired by recent work examining neural network intrinsic dimension and loss landscapes, we hypothesise that there exists a low-dimensional manifold, embedded in the policy network parameter space, around which a high-density of diverse and useful policies are located. This paper proposes a novel method for diversity-based policy search via Neuroevolution, that leverages learned representations of the policy network parameters, by performing policy search in this learned representation space. Our method relies on the Quality-Diversity (QD) framework which provides a principled approach to policy search, and maintains a collection of diverse policies, used as a dataset for learning policy representations. Further, we use the Jacobian of the inverse-mapping function to guide the search in the representation space. This ensures that the generated samples remain in the high-density regions, after mapping back to the original space. Finally, we evaluate our contributions on four continuous-control tasks in simulated environments, and compare to diversity-based baselines.

READ FULL TEXT
research
12/15/2020

Policy Manifold Search for Improving Diversity-based Neuroevolution

Diversity-based approaches have recently gained popularity as an alterna...
research
10/24/2022

Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains

Quality-Diversity algorithms, among which MAP-Elites, have emerged as po...
research
04/10/2021

Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search Algorithms

Reinforcement learning agents need a reward signal to learn successful p...
research
12/29/2022

On the Geometry of Reinforcement Learning in Continuous State and Action Spaces

Advances in reinforcement learning have led to its successful applicatio...
research
12/07/2022

Curiosity creates Diversity in Policy Search

When searching for policies, reward-sparse environments often lack suffi...
research
11/27/2019

Learning Neural Search Policies for Classical Planning

Heuristic forward search is currently the dominant paradigm in classical...
research
06/08/2023

Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces

Quality Diversity (QD) algorithms have been proposed to search for a lar...

Please sign up or login with your details

Forgot password? Click here to reset