Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning

05/30/2022
by   Yinglun Xu, et al.
0

We study data poisoning attacks on online deep reinforcement learning (DRL) where the attacker is oblivious to the learning algorithm used by the agent and does not necessarily have full knowledge of the environment. We demonstrate the intrinsic vulnerability of state-of-the-art DRL algorithms by designing a general reward poisoning framework called adversarial MDP attacks. We instantiate our framework to construct several new attacks which only corrupt the rewards for a small fraction of the total training timesteps and make the agent learn a low-performing policy. Our key insight is that the state-of-the-art DRL algorithms strategically explore the environment to find a high-performing policy. Our attacks leverage this insight to construct a corrupted environment for misleading the agent towards learning low-performing policies with a limited attack budget. We provide a theoretical analysis of the efficiency of our attack and perform an extensive evaluation. Our results show that our attacks efficiently poison agents learning with a variety of state-of-the-art DRL algorithms, such as DQN, PPO, SAC, etc. under several popular classical control and MuJoCo environments.

READ FULL TEXT

page 8

page 9

page 14

page 15

research
05/18/2023

Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning

We propose the first black-box targeted attack against online deep reinf...
research
05/12/2021

Adversarial Reinforcement Learning in Dynamic Channel Access and Power Control

Deep reinforcement learning (DRL) has recently been used to perform effi...
research
10/04/2021

Automating Privilege Escalation with Deep Reinforcement Learning

AI-based defensive solutions are necessary to defend networks and inform...
research
06/16/2021

Real-time Attacks Against Deep Reinforcement Learning Policies

Recent work has discovered that deep reinforcement learning (DRL) polici...
research
03/01/2019

TrojDRL: Trojan Attacks on Deep Reinforcement Learning Agents

Recent work has identified that classification models implemented as neu...
research
07/16/2018

Online Robust Policy Learning in the Presence of Unknown Adversaries

The growing prospect of deep reinforcement learning (DRL) being used in ...
research
07/19/2022

Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning

Although Deep Reinforcement Learning (DRL) has been popular in many disc...

Please sign up or login with your details

Forgot password? Click here to reset