Cooperative Deep Q-learning Framework for Environments Providing Image Feedback

10/28/2021
by   Krishnan Raghavan, et al.
0

In this paper, we address two key challenges in deep reinforcement learning setting, sample inefficiency and slow learning, with a dual NN-driven learning approach. In the proposed approach, we use two deep NNs with independent initialization to robustly approximate the action-value function in the presence of image inputs. In particular, we develop a temporal difference (TD) error-driven learning approach, where we introduce a set of linear transformations of the TD error to directly update the parameters of each layer in the deep NN. We demonstrate theoretically that the cost minimized by the error-driven learning (EDL) regime is an approximation of the empirical cost and the approximation error reduces as learning progresses, irrespective of the size of the network. Using simulation analysis, we show that the proposed methods enables faster learning and convergence and requires reduced buffer size (thereby increasing the sample efficiency).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2021

Learning to Control using Image Feedback

Learning to control complex systems using non-traditional feedback, e.g....
research
05/29/2022

Representation Gap in Deep Reinforcement Learning

Deep reinforcement learning gives the promise that an agent learns good ...
research
06/23/2020

The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning

Multi-step (also called n-step) methods in reinforcement learning (RL) h...
research
06/29/2023

Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning

We propose a novel value approximation method, namely Eigensubspace Regu...
research
10/09/2021

Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation

Recently, deep reinforcement learning (RL) has achieved remarkable empir...
research
09/17/2018

Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement Learning

Motor control is a set of time-varying muscle excitations which generate...
research
01/01/2019

A Theoretical Analysis of Deep Q-Learning

Despite the great empirical success of deep reinforcement learning, its ...

Please sign up or login with your details

Forgot password? Click here to reset