Adversarial Cheap Talk

11/20/2022
by   Chris Lu, et al.
0

Adversarial attacks in reinforcement learning (RL) often assume highly-privileged access to the victim's parameters, environment, or data. Instead, this paper proposes a novel adversarial setting called a Cheap Talk MDP in which an Adversary can merely append deterministic messages to the Victim's observation, resulting in a minimal range of influence. The Adversary cannot occlude ground truth, influence underlying environment dynamics or reward signals, introduce non-stationarity, add stochasticity, see the Victim's actions, or access their parameters. Additionally, we present a simple meta-learning algorithm called Adversarial Cheap Talk (ACT) to train Adversaries in this setting. We demonstrate that an Adversary trained with ACT can still significantly influence the Victim's training and testing performance, despite the highly constrained setting. Affecting train-time performance reveals a new attack vector and provides insight into the success and failure modes of existing RL algorithms. More specifically, we show that an ACT Adversary is capable of harming performance by interfering with the learner's function approximation, or instead helping the Victim's performance by outputting useful features. Finally, we show that an ACT Adversary can manipulate messages during train-time to directly and arbitrarily control the Victim at test-time.

READ FULL TEXT

page 9

page 23

research
01/21/2021

Robust Reinforcement Learning on State Observations with Learned Optimal Adversary

We study the robustness of reinforcement learning (RL) with adversariall...
research
02/16/2021

Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments

We study black-box reward poisoning attacks against reinforcement learni...
research
05/28/2019

Snooping Attacks on Deep Reinforcement Learning

Adversarial attacks have exposed a significant security vulnerability in...
research
01/14/2021

How to Attack and Defend 5G Radio Access Network Slicing with Reinforcement Learning

Reinforcement learning (RL) for network slicing is considered in the 5G ...
research
10/13/2022

Observed Adversaries in Deep Reinforcement Learning

In this work, we point out the problem of observed adversaries for deep ...
research
02/01/2019

Optimal Adversarial Attack on Autoregressive Models

We investigate optimal adversarial attacks against time series forecast ...
research
06/06/2018

Adversarial Regression with Multiple Learners

Despite the considerable success enjoyed by machine learning techniques ...

Please sign up or login with your details

Forgot password? Click here to reset