Benchmarking projective simulation in navigation problems

04/23/2018
by   Alexey A. Melnikov, et al.
0

Projective simulation (PS) is a model for intelligent agents with a deliberation capacity that is based on episodic memory. The model has been shown to provide a flexible framework for constructing reinforcement-learning agents, and it allows for quantum mechanical generalization, which leads to a speed-up in deliberation time. PS agents have been applied successfully in the context of complex skill learning in robotics, and in the design of state-of-the-art quantum experiments. In this paper, we study the performance of projective simulation in two benchmarking problems in navigation, namely the grid world and the mountain car problem. The performance of PS is compared to standard tabular reinforcement learning approaches, Q-learning and SARSA. Our comparison demonstrates that the performance of PS and standard learning approaches are qualitatively and quantitatively similar, while it is much easier to choose optimal model parameters in case of projective simulation, with a reduced computational effort of one to two orders of magnitude. Our results show that the projective simulation model stands out for its simplicity in terms of the number of model parameters, which makes it simple to set up the learning agent in unknown task environments.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

research
05/21/2014

Projective simulation applied to the grid-world and the mountain-car problem

We study the model of projective simulation (PS) which is a novel approa...
research
04/09/2015

Projective simulation with generalization

The ability to generalize is an important feature of any intelligent age...
research
09/16/2021

ROS-X-Habitat: Bridging the ROS Ecosystem with Embodied AI

We introduce ROS-X-Habitat, a software interface that bridges the AI Hab...
research
12/02/2022

Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning

Legged systems have many advantages when compared to their wheeled count...
research
02/21/2022

Autonomous Warehouse Robot using Deep Q-Learning

In warehouses, specialized agents need to navigate, avoid obstacles and ...
research
07/10/2020

NaviGAN: A Generative Approach for Socially Compliant Navigation

Robots navigating in human crowds need to optimize their paths not only ...
research
09/08/2022

Double Q-Learning for Citizen Relocation During Natural Hazards

Natural disasters can cause substantial negative socio-economic impacts ...

Please sign up or login with your details

Forgot password? Click here to reset