Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation

10/09/2021
by   Junhong Shen, et al.
0

Recently, deep reinforcement learning (RL) has achieved remarkable empirical success by integrating deep neural networks into RL frameworks. However, these algorithms often require a large number of training samples and admit little theoretical understanding. To mitigate these issues, we propose a theoretically principled nearest neighbor (NN) function approximator that can improve the value networks in deep RL methods. Inspired by human similarity judgments, the NN approximator estimates the action values using rollouts on past observations and can provably obtain a small regret bound that depends only on the intrinsic complexity of the environment. We present (1) Nearest Neighbor Actor-Critic (NNAC), an online policy gradient algorithm that demonstrates the practicality of combining function approximation with deep RL, and (2) a plug-and-play NN update module that aids the training of existing deep RL methods. Experiments on classical control and MuJoCo locomotion tasks show that the NN-accelerated agents achieve higher sample efficiency and stability than the baseline agents. Based on its theoretical benefits, we believe that the NN approximator can be further applied to other complex domains to speed-up learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2023

Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning

The synergies between Quality-Diversity (QD) and Deep Reinforcement Lear...
research
01/01/2019

Complementary reinforcement learning toward explainable agents

Reinforcement learning (RL) algorithms allow agents to learn skills and ...
research
07/16/2021

Nearest neighbor Methods and their Applications in Design of 5G Beyond Wireless Networks

In this paper, we present an overview of Nearest neighbor (NN) methods, ...
research
09/25/2019

Benefit of Interpolation in Nearest Neighbor Algorithms

The over-parameterized models attract much attention in the era of data ...
research
08/24/2020

Improved Memories Learning

We propose Improved Memories Learning (IMeL), a novel algorithm that tur...
research
10/28/2021

Cooperative Deep Q-learning Framework for Environments Providing Image Feedback

In this paper, we address two key challenges in deep reinforcement learn...
research
06/14/2022

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

Policy-gradient methods in Reinforcement Learning(RL) are very universal...

Please sign up or login with your details

Forgot password? Click here to reset