Mimicking Evolution with Reinforcement Learning

03/31/2020
by   João P. Abrantes, et al.
0

Evolution gave rise to human and animal intelligence here on Earth. We argue that the path to developing artificial human-like-intelligence will pass through mimicking the evolutionary process in a nature-like simulation. In Nature, there are two processes driving the development of the brain: evolution and learning. Evolution acts slowly, across generations, and amongst other things, it defines what agents learn by changing their internal reward function. Learning acts fast, across one's lifetime, and it quickly updates agents' policy to maximise pleasure and minimise pain. The reward function is slowly aligned with the fitness function by evolution, however, as agents evolve the environment and its fitness function also change, increasing the misalignment between reward and fitness. It is extremely computationally expensive to replicate these two processes in simulation. This work proposes Evolution via Evolutionary Reward (EvER) that allows learning to single-handedly drive the search for policies with increasingly evolutionary fitness by ensuring the alignment of the reward function with the fitness function. In this search, EvER makes use of the whole state-action trajectories that agents go through their lifetime. In contrast, current evolutionary algorithms discard this information and consequently limit their potential efficiency at tackling sequential decision problems. We test our algorithm in two simple bio-inspired environments and show its superiority at generating more capable agents at surviving and reproducing their genes when compared with a state-of-the-art evolutionary algorithm.

READ FULL TEXT
research
05/08/2019

Learning to Evolve

Evolution and learning are two of the fundamental mechanisms by which li...
research
03/29/2022

Efficiently Evolving Swarm Behaviors Using Grammatical Evolution With PPA-style Behavior Trees

Evolving swarm behaviors with artificial agents is computationally expen...
research
12/02/2020

Policy Supervectors: General Characterization of Agents by their Behaviour

By studying the underlying policies of decision-making agents, we can le...
research
09/02/2019

Evolutionary reinforcement learning of dynamical large deviations

We show how to calculate dynamical large deviations using evolutionary r...
research
07/07/2022

Evolutionary Stability of Other-Regarding Preferences Under Complexity Costs

The evolution of preferences that account for other agents' fitness, or ...
research
01/09/2010

Incorporating characteristics of human creativity into an evolutionary art algorithm

A perceived limitation of evolutionary art and design algorithms is that...
research
09/07/2016

Non-Evolutionary Superintelligences Do Nothing, Eventually

There is overwhelming evidence that human intelligence is a product of D...

Please sign up or login with your details

Forgot password? Click here to reset