Adversary A3C for Robust Reinforcement Learning

12/01/2019
by   Zhaoyuan Gu, et al.
21

Asynchronous Advantage Actor Critic (A3C) is an effective Reinforcement Learning (RL) algorithm for a wide range of tasks, such as Atari games and robot control. The agent learns policies and value function through trial-and-error interactions with the environment until converging to an optimal policy. Robustness and stability are critical in RL; however, neural network can be vulnerable to noise from unexpected sources and is not likely to withstand very slight disturbances. We note that agents generated from mild environment using A3C are not able to handle challenging environments. Learning from adversarial examples, we proposed an algorithm called Adversary Robust A3C (AR-A3C) to improve the agent's performance under noisy environments. In this algorithm, an adversarial agent is introduced to the learning process to make it more robust against adversarial disturbances, thereby making it more adaptive to noisy environments. Both simulations and real-world experiments are carried out to illustrate the stability of the proposed algorithm. The AR-A3C algorithm outperforms A3C in both clean and noisy environments.

READ FULL TEXT

page 7

page 9

research
06/09/2021

Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL

Evaluating the worst-case performance of a reinforcement learning (RL) a...
research
04/20/2022

SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics

Although Reinforcement Learning (RL) is effective for sequential decisio...
research
06/19/2020

NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration

Deep reinforcement learning has been applied more and more widely nowada...
research
01/12/2022

Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees

In this work we present a preliminary investigation of a novel algorithm...
research
02/14/2023

Regret-Based Optimization for Robust Reinforcement Learning

Deep Reinforcement Learning (DRL) policies have been shown to be vulnera...
research
03/08/2017

Robust Adversarial Reinforcement Learning

Deep neural networks coupled with fast simulation and improved computati...
research
03/06/2023

Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment

Reinforcement learning (RL) has recently proven itself as a powerful ins...

Please sign up or login with your details

Forgot password? Click here to reset