Collaborative Machine Learning Markets with Data-Replication-Robust Payments

11/08/2019
by   Olga Ohrimenko, et al.
0

We study the problem of collaborative machine learning markets where multiple parties can achieve improved performance on their machine learning tasks by combining their training data. We discuss desired properties for these machine learning markets in terms of fair revenue distribution and potential threats, including data replication. We then instantiate a collaborative market for cases where parties share a common machine learning task and where parties' tasks are different. Our marketplace incentivizes parties to submit high quality training and true validation data. To this end, we introduce a novel payment division function that is robust-to-replication and customized output models that perform well only on requested machine learning tasks. In experiments, we validate the assumptions underlying our theoretical analysis and show that these are approximately satisfied for commonly used machine learning models.

READ FULL TEXT
research
08/18/2021

Data Pricing in Machine Learning Pipelines

Machine learning is disruptive. At the same time, machine learning can o...
research
06/25/2020

Replication-Robust Payoff-Allocation with Applications in Machine Learning Marketplaces

The ever-increasing take-up of machine learning techniques requires ever...
research
07/15/2020

Differential Replication in Machine Learning

When deployed in the wild, machine learning models are usually confronte...
research
01/07/2019

A New Perspective on Machine Learning: How to do Perfect Supervised Learning

In this work, we introduce the concept of bandlimiting into the theory o...
research
06/13/2018

Enabling End-To-End Machine Learning Replicability: A Case Study in Educational Data Mining

The use of machine learning techniques has expanded in education researc...
research
10/24/2020

Collaborative Machine Learning with Incentive-Aware Model Rewards

Collaborative machine learning (ML) is an appealing paradigm to build hi...
research
06/05/2023

Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm

High-quality machine learning models are dependent on access to high-qua...

Please sign up or login with your details

Forgot password? Click here to reset