Robust Reinforcement Learning under model misspecification

03/29/2021
by   Lebin Yu, et al.
12

Reinforcement learning has achieved remarkable performance in a wide range of tasks these days. Nevertheless, some unsolved problems limit its applications in real-world control. One of them is model misspecification, a situation where an agent is trained and deployed in environments with different transition dynamics. We propose an novel framework that utilize history trajectory and Partial Observable Markov Decision Process Modeling to deal with this dilemma. Additionally, we put forward an efficient adversarial attack method to assist robust training. Our experiments in four gym domains validate the effectiveness of our framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
09/23/2019

PAC Reinforcement Learning without Real-World Feedback

This work studies reinforcement learning in the Sim-to-Real setting, in ...
research
05/11/2018

Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes

In recent years, reinforcement learning has achieved many remarkable suc...
research
11/14/2022

Parallel Automatic History Matching Algorithm Using Reinforcement Learning

Reformulating the history matching problem from a least-square mathemati...
research
09/11/2019

Correlation Priors for Reinforcement Learning

Many decision-making problems naturally exhibit pronounced structures in...
research
05/17/2018

Fast reinforcement learning for decentralized MAC optimization

In this paper, we propose a novel decentralized framework for optimizing...
research
02/15/2019

Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations

In real-world scenarios, the observation data for reinforcement learning...
research
11/29/2022

Discrete Control in Real-World Driving Environments using Deep Reinforcement Learning

Training self-driving cars is often challenging since they require a vas...

Please sign up or login with your details

Forgot password? Click here to reset