Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)

06/03/2019
by   Vahid Behzadan, et al.
0

This paper investigates the effectiveness of adversarial training in enhancing the robustness of Deep Q-Network (DQN) policies to state-space perturbations. We first present a formal analysis of adversarial training in DQN agents and its performance with respect to the proportion of adversarial perturbations to nominal observations used for training. Next, we consider the sample-inefficiency of current adversarial training techniques, and propose a novel Adversarially-Guided Exploration (AGE) mechanism based on a modified hybrid of the ϵ-greedy algorithm and Boltzmann exploration. We verify the feasibility of this exploration mechanism through experimental evaluation of its performance in comparison with the traditional decaying ϵ-greedy and parameter-space noise exploration algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2017

Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger

Recent developments have established the vulnerability of deep Reinforce...
research
06/07/2018

On Adversarial Risk and Training

In this work we formally define the notions of adversarial perturbations...
research
10/09/2019

Adversarial Training: embedding adversarial perturbations into the parameter space of a neural network to build a robust system

Adversarial training, in which a network is trained on both adversarial ...
research
01/12/2022

Towards Adversarially Robust Deep Image Denoising

This work systematically investigates the adversarial robustness of deep...
research
08/30/2021

Investigating Vulnerabilities of Deep Neural Policies

Reinforcement learning policies based on deep neural networks are vulner...
research
09/14/2021

Improving Gradient-based Adversarial Training for Text Classification by Contrastive Learning and Auto-Encoder

Recent work has proposed several efficient approaches for generating gra...
research
02/18/2019

Optimized data exploration applied to the simulation of a chemical process

In complex simulation environments, certain parameter space regions may ...

Please sign up or login with your details

Forgot password? Click here to reset