Adversarial Policies in Learning Systems with Malicious Experts

01/02/2020
by   S. Rasoul Etesami, et al.
0

We consider a learning system based on the conventional multiplicative weight (MW) rule that combines experts' advice to predict a sequence of true outcomes. It is assumed that one of the experts is malicious and aims to impose the maximum loss on the system. The loss of the system is naturally defined to be the aggregate absolute difference between the sequence of predicted outcomes and the true outcomes. We consider this problem under both offline and online settings. In the offline setting where the malicious expert must choose its entire sequence of decisions a priori, we show somewhat surprisingly that a simple greedy policy of always reporting false prediction is asymptotically optimal with an approximation ratio of 1+O(√(ln N/N)), where N is the total number of prediction stages. In particular, we describe a policy that closely resembles the structure of the optimal offline policy. For the online setting where the malicious expert can adaptively make its decisions, we show that the optimal online policy can be efficiently computed by solving a dynamic program in O(N^2). Our results provide a new direction for vulnerability assessment of commonly used learning algorithms to adversarial attacks where the threat is an integral part of the system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2020

Malicious Experts versus the multiplicative weights algorithm in online prediction

We consider a prediction problem with two experts and a forecaster. We a...
research
02/07/2020

The Power of Linear Controllers in LQR Control

The Linear Quadratic Regulator (LQR) framework considers the problem of ...
research
04/13/2021

Sequential Ski Rental Problem

The classical 'buy or rent' ski-rental problem was recently considered i...
research
06/18/2012

Learning the Experts for Online Sequence Prediction

Online sequence prediction is the problem of predicting the next element...
research
08/07/2022

Optimal Tracking in Prediction with Expert Advice

We study the prediction with expert advice setting, where the aim is to ...
research
07/31/2020

Online Prediction With History-Dependent Experts: The General Case

We study the problem of prediction of binary sequences with expert advic...
research
09/13/2017

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

We consider the problem of sequentially making decisions that are reward...

Please sign up or login with your details

Forgot password? Click here to reset