Stealing Deep Reinforcement Learning Models for Fun and Profit

06/09/2020
by   Kangjie Chen, et al.
10

In this paper, we present the first attack methodology to extract black-box Deep Reinforcement Learning (DRL) models only from their actions with the environment. Model extraction attacks against supervised Deep Learning models have been widely studied. However, those techniques cannot be applied to the reinforcement learning scenario due to DRL models' high complexity, stochasticity and limited observable information. Our methodology overcomes those challenges by proposing two techniques. The first technique is an RNN classifier which can reveal the training algorithms of the target black-box DRL model only based on its predicted actions. The second technique is the adoption of imitation learning to replicate the model from the extracted training algorithm. Experimental results indicate that the integration of these two techniques can effectively recover the DRL models with high fidelity. We also demonstrate a use case to show that our model extraction attack can significantly improve the success rate of adversarial attacks, making the DRL models more vulnerable.

READ FULL TEXT

page 2

page 4

page 5

page 7

research
06/03/2019

Adversarial Exploitation of Policy Imitation

This paper investigates a class of attacks targeting the confidentiality...
research
05/14/2020

Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning

Adversarial attacks against conventional Deep Learning (DL) systems and ...
research
07/21/2019

Characterizing Attacks on Deep Reinforcement Learning

Deep reinforcement learning (DRL) has achieved great success in various ...
research
05/18/2023

Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning

We propose the first black-box targeted attack against online deep reinf...
research
10/22/2020

Adversarial Attacks on Deep Algorithmic Trading Policies

Deep Reinforcement Learning (DRL) has become an appealing solution to al...
research
08/02/2021

Adversarial Attacks Against Deep Reinforcement Learning Framework in Internet of Vehicles

Machine learning (ML) has made incredible impacts and transformations in...
research
06/14/2021

Learning-Aided Heuristics Design for Storage System

Computer systems such as storage systems normally require transparent wh...

Please sign up or login with your details

Forgot password? Click here to reset