Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies

06/03/2019
by   Vahid Behzadan, et al.
0

This paper proposes a novel scheme for the watermarking of Deep Reinforcement Learning (DRL) policies. This scheme provides a mechanism for the integration of a unique identifier within the policy in the form of its response to a designated sequence of state transitions, while incurring minimal impact on the nominal performance of the policy. The applications of this watermarking scheme include detection of unauthorized replications of proprietary policies, as well as enabling the graceful interruption or termination of DRL activities by authorized entities. We demonstrate the feasibility of our proposal via experimental evaluation of watermarking a DQN policy trained in the Cartpole environment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2019

RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies

This paper investigates the resilience and robustness of Deep Reinforcem...
research
10/26/2021

Learning Collaborative Policies to Solve NP-hard Routing Problems

Recently, deep reinforcement learning (DRL) frameworks have shown potent...
research
03/22/2023

P^3O: Transferring Visual Representations for Reinforcement Learning via Prompting

It is important for deep reinforcement learning (DRL) algorithms to tran...
research
06/26/2018

Deictic Image Maps: An Abstraction For Learning Pose Invariant Manipulation Policies

In applications of deep reinforcement learning to robotics, it is often ...
research
10/13/2022

Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks

Payment channel networks (PCNs) are a layer-2 blockchain scalability sol...
research
07/27/2023

FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks

We propose FLARE, the first fingerprinting mechanism to verify whether a...
research
08/02/2017

Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings

This paper is the first attempt to learn the policy of an inquiry dialog...

Please sign up or login with your details

Forgot password? Click here to reset