DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning

01/31/2022
by   Hassam Sheikh, et al.
0

Application of ensemble of neural networks is becoming an imminent tool for advancing the state-of-the-art in deep reinforcement learning algorithms. However, training these large numbers of neural networks in the ensemble has an exceedingly high computation cost which may become a hindrance in training large-scale systems. In this paper, we propose DNS: a Determinantal Point Process based Neural Network Sampler that specifically uses k-dpp to sample a subset of neural networks for backpropagation at every training step thus significantly reducing the training time and computation cost. We integrated DNS in REDQ for continuous control tasks and evaluated on MuJoCo environments. Our experiments show that DNS augmented REDQ outperforms baseline REDQ in terms of average cumulative reward and achieves this using less than 50 when measured in FLOPS.

READ FULL TEXT
research
01/19/2020

FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback

Reinforcement learning has been successful in training autonomous agents...
research
06/08/2021

Dynamic Sparse Training for Deep Reinforcement Learning

Deep reinforcement learning has achieved significant success in many dec...
research
06/14/2016

Model-Free Episodic Control

State of the art deep reinforcement learning algorithms take many millio...
research
11/28/2022

AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning

Deep reinforcement learning has achieved great success in various fields...
research
02/10/2023

Gauge-equivariant neural networks as preconditioners in lattice QCD

We demonstrate that a state-of-the art multi-grid preconditioner can be ...
research
09/07/2017

A Deep Reinforcement Learning Chatbot

We present MILABOT: a deep reinforcement learning chatbot developed by t...
research
09/15/2018

Towards Better Interpretability in Deep Q-Networks

Deep reinforcement learning techniques have demonstrated superior perfor...

Please sign up or login with your details

Forgot password? Click here to reset