Planning From Pixels in Atari With Learned Symbolic Representations

12/16/2020
by   Andrea Dittadi, et al.
0

Width-based planning methods have been shown to yield state-of-the-art performance in the Atari 2600 domain using pixel input. One successful approach, RolloutIW, represents states with the B-PROST boolean feature set. An augmented version of RolloutIW, π-IW, shows that learned features can be competitive with handcrafted ones for width-based search. In this paper, we leverage variational autoencoders (VAEs) to learn features directly from pixels in a principled manner, and without supervision. The inference model of the trained VAEs extracts boolean features from pixels, and RolloutIW plans with these features. The resulting combination outperforms the original RolloutIW and human professional play on Atari 2600 and drastically reduces the size of the feature set.

READ FULL TEXT

page 6

page 15

page 16

research
01/15/2021

Hierarchical Width-Based Planning and Learning

Width-based search methods have demonstrated state-of-the-art performanc...
research
01/10/2018

Planning with Pixels in (Almost) Real Time

Recently, width-based planning methods have been shown to yield state-of...
research
04/12/2019

Deep Policies for Width-Based Planning in Pixel Domains

Width-based planning has demonstrated great success in recent years due ...
research
09/30/2021

Width-Based Planning and Active Learning for Atari

Width-based planning has shown promising results on Atari 2600 games usi...
research
06/09/2021

Planning for Novelty: Width-Based Algorithms for Common Problems in Control, Planning and Reinforcement Learning

Width-based algorithms search for solutions through a general definition...
research
12/21/2021

On the Size and Width of the Decoder of a Boolean Threshold Autoencoder

In this paper, we study the size and width of autoencoders consisting of...

Please sign up or login with your details

Forgot password? Click here to reset