Neural Episodic Control with State Abstraction

01/27/2023
by   Zhuo Li, et al.
2

Existing Deep Reinforcement Learning (DRL) algorithms suffer from sample inefficiency. Generally, episodic control-based approaches are solutions that leverage highly-rewarded past experiences to improve sample efficiency of DRL algorithms. However, previous episodic control-based approaches fail to utilize the latent information from the historical behaviors (e.g., state transitions, topological similarities, etc.) and lack scalability during DRL training. This work introduces Neural Episodic Control with State Abstraction (NECSA), a simple but effective state abstraction-based episodic control containing a more comprehensive episodic memory, a novel state evaluation, and a multi-step state analysis. We evaluate our approach to the MuJoCo and Atari tasks in OpenAI gym domains. The experimental results indicate that NECSA achieves higher sample efficiency than the state-of-the-art episodic control-based approaches. Our data and code are available at the project website[<https://sites.google.com/view/drl-necsa>].

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2022

BBReach: Tight and Scalable Black-Box Reachability Analysis of Deep Reinforcement Learning Systems

Reachability analysis is a promising technique to automatically prove or...
research
03/21/2021

Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study

Over the past several years there has been a considerable research inves...
research
02/09/2021

Measuring Progress in Deep Reinforcement Learning Sample Efficiency

Sampled environment transitions are a critical input to deep reinforceme...
research
06/13/2021

Learning on Abstract Domains: A New Approach for Verifiable Guarantee in Reinforcement Learning

Formally verifying Deep Reinforcement Learning (DRL) systems is a challe...
research
08/11/2023

Learning to Team-Based Navigation: A Review of Deep Reinforcement Learning Techniques for Multi-Agent Pathfinding

Multi-agent pathfinding (MAPF) is a critical field in many large-scale r...
research
06/15/2023

Evolutionary Curriculum Training for DRL-Based Navigation Systems

In recent years, Deep Reinforcement Learning (DRL) has emerged as a prom...
research
03/02/2018

Model-Free Control for Distributed Stream Data Processing using Deep Reinforcement Learning

In this paper, we focus on general-purpose Distributed Stream Data Proce...

Please sign up or login with your details

Forgot password? Click here to reset