Lessons from the AdKDD'21 Privacy-Preserving ML Challenge

01/31/2022
by   Eustache Diemert, et al.
0

Designing data sharing mechanisms providing performance and strong privacy guarantees is a hot topic for the Online Advertising industry. Namely, a prominent proposal discussed under the Improving Web Advertising Business Group at W3C only allows sharing advertising signals through aggregated, differentially private reports of past displays. To study this proposal extensively, an open Privacy-Preserving Machine Learning Challenge took place at AdKDD'21, a premier workshop on Advertising Science with data provided by advertising company Criteo. In this paper, we describe the challenge tasks, the structure of the available datasets, report the challenge results, and enable its full reproducibility. A key finding is that learning models on large, aggregated data in the presence of a small set of unaggregated data points can be surprisingly efficient and cheap. We also run additional experiments to observe the sensitivity of winning methods to different parameters such as privacy budget or quantity of available privileged side information. We conclude that the industry needs either alternate designs for private data sharing or a breakthrough in learning with aggregated data only to keep ad relevance at a reasonable level.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2018

Generating Differentially Private Datasets Using GANs

In this paper, we present a technique for generating artificial datasets...
research
09/10/2019

Privacy-Preserving Bandits

Contextual bandit algorithms (CBAs) often rely on personal data to provi...
research
11/30/2020

Gradient Sparsification Can Improve Performance of Differentially-Private Convex Machine Learning

We use gradient sparsification to reduce the adverse effect of different...
research
05/26/2023

Seeding with Differentially Private Network Information

When designing interventions in public health, development, and educatio...
research
06/03/2021

THEMIS: A Decentralized Privacy-Preserving Ad Platform with Reporting Integrity

Online advertising fuels the (seemingly) free internet. However, althoug...
research
12/05/2017

Approaching the Ad Placement Problem with Online Linear Classification: The winning solution to the NIPS'17 Ad Placement Challenge

The task of computational advertising is to select the most suitable adv...
research
04/29/2022

Seeing without Looking: Analysis Pipeline for Child Sexual Abuse Datasets

The online sharing and viewing of Child Sexual Abuse Material (CSAM) are...

Please sign up or login with your details

Forgot password? Click here to reset