Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations

02/23/2020
by   Xiao Ma, et al.
0

Deep reinforcement learning is successful in decision making for sophisticated games, such as Atari, Go, etc. However, real-world decision making often requires reasoning with partial information extracted from complex visual observations. This paper presents Discriminative Particle Filter Reinforcement Learning (DPFRL), a new reinforcement learning framework for complex partial observations. DPFRL encodes a differentiable particle filter in the neural network policy for explicit reasoning with partial observations over time. The particle filter maintains a belief using learned discriminative update, which is trained end-to-end for decision making. We show that using the discriminative update instead of standard generative models results in significantly improved performance, especially for tasks with complex visual observations, because they circumvent the difficulty of modeling complex observations that are irrelevant to decision making. In addition, to extract features from the particle belief, we propose a new type of belief feature based on the moment generating function. DPFRL outperforms state-of-the-art POMDP RL models in Flickering Atari Games, an existing POMDP RL benchmark, and in Natural Flickering Atari Games, a new, more challenging POMDP RL benchmark introduced in this paper. Further, DPFRL performs well for visual navigation with real-world data in the Habitat environment.

READ FULL TEXT
research
08/06/2020

Contrastive Variational Model-Based Reinforcement Learning for Complex Observations

Deep model-based reinforcement learning (MBRL) has achieved great sample...
research
04/22/2021

Reinforcement Learning using Guided Observability

Due to recent breakthroughs, reinforcement learning (RL) has demonstrate...
research
11/16/2020

Blind Decision Making: Reinforcement Learning with Delayed Observations

Reinforcement learning typically assumes that the state update from the ...
research
09/01/2023

End-to-end Lidar-Driven Reinforcement Learning for Autonomous Racing

Reinforcement Learning (RL) has emerged as a transformative approach in ...
research
05/23/2018

Particle Filter Networks: End-to-End Probabilistic Localization From Visual Observations

Particle filters sequentially approximate posterior distributions by sam...
research
10/04/2018

Image-based Guidance of Autonomous Aircraft for Wildfire Surveillance and Prediction

Small unmanned aircraft can help firefighters combat wildfires by provid...
research
03/28/2017

Inverse Reinforcement Learning from Incomplete Observation Data

Inverse reinforcement learning (IRL) aims to explain observed strategic ...

Please sign up or login with your details

Forgot password? Click here to reset