Adversarial Attacks on Online Learning to Rank with Click Feedback

05/26/2023
by   Jinhang Zuo, et al.
0

Online learning to rank (OLTR) is a sequential decision-making problem where a learning agent selects an ordered list of items and receives feedback through user clicks. Although potential attacks against OLTR algorithms may cause serious losses in real-world applications, little is known about adversarial attacks on OLTR. This paper studies attack strategies against multiple variants of OLTR. Our first result provides an attack strategy against the UCB algorithm on classical stochastic bandits with binary feedback, which solves the key issues caused by bounded and discrete feedback that previous works can not handle. Building on this result, we design attack algorithms against UCB-based OLTR algorithms in position-based and cascade models. Finally, we propose a general attack strategy against any algorithm under the general click model. Each attack algorithm manipulates the learning agent into choosing the target attack item T-o(T) times, incurring a cumulative cost of o(T). Experiments on synthetic and real data further validate the effectiveness of our proposed attack algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2023

Adversarial Attacks on Online Learning to Rank with Stochastic Click Models

We propose the first study of adversarial attacks on online learning to ...
research
06/06/2018

TopRank: A practical algorithm for online stochastic ranking

Online learning to rank is a sequential decision-making problem where in...
research
08/27/2018

Data Poisoning Attacks against Online Learning

We consider data poisoning attacks, a class of adversarial attacks on ma...
research
10/18/2021

When Are Linear Stochastic Bandits Attackable?

We study adversarial attacks on linear stochastic bandits, a sequential ...
research
03/07/2017

Online Learning to Rank in Stochastic Click Models

Online learning to rank is a core problem in information retrieval and m...
research
04/03/2023

Is Stochastic Mirror Descent Vulnerable to Adversarial Delay Attacks? A Traffic Assignment Resilience Study

Intelligent Navigation Systems (INS) are exposed to an increasing number...
research
07/20/2023

Adversarial attacks for mixtures of classifiers

Mixtures of classifiers (a.k.a. randomized ensembles) have been proposed...

Please sign up or login with your details

Forgot password? Click here to reset