Replication-Robust Payoff-Allocation with Applications in Machine Learning Marketplaces

06/25/2020
by   Dongge Han, et al.
15

The ever-increasing take-up of machine learning techniques requires ever-more application-specific training data. Manually collecting such training data is a tedious and time-consuming process. Data marketplaces represent a compelling alternative, providing an easy way for acquiring data from potential data providers. A key component of such marketplaces is the compensation mechanism for data providers. Classic payoff-allocation methods such as the Shapley value can be vulnerable to data-replication attacks, and are infeasible to compute in the absence of efficient approximation algorithms. To address these challenges, we present an extensive theoretical study on the vulnerabilities of game theoretic payoff-allocation schemes to replication attacks. Our insights apply to a wide range of payoff-allocation schemes, and enable the design of customised replication-robust payoff-allocations. Furthermore, we present a novel efficient sampling algorithm for approximating payoff-allocation schemes based on marginal contributions. In our experiments, we validate the replication-robustness of classic payoff-allocation schemes and new payoff-allocation schemes derived from our theoretical insights. We also demonstrate the efficiency of our proposed sampling algorithm on a wide range of machine learning tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2019

Collaborative Machine Learning Markets with Data-Replication-Robust Payments

We study the problem of collaborative machine learning markets where mul...
research
06/13/2018

Enabling End-To-End Machine Learning Replicability: A Case Study in Educational Data Mining

The use of machine learning techniques has expanded in education researc...
research
06/30/2022

Bio-inspired Machine Learning: programmed death and replication

We analyze algorithmic and computational aspects of biological phenomena...
research
05/14/2018

Early Scheduling in Parallel State Machine Replication

State machine replication is standard approach to fault tolerance. One o...
research
12/04/2019

A Survey of Game Theoretic Approaches for Adversarial Machine Learning in Cybersecurity Tasks

Machine learning techniques are currently used extensively for automatin...
research
06/05/2023

Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm

High-quality machine learning models are dependent on access to high-qua...
research
02/04/2023

Getting to "rate-optimal” in ranking selection

In their 2004 seminal paper, Glynn and Juneja formally and precisely est...

Please sign up or login with your details

Forgot password? Click here to reset