Computing a Data Dividend

05/06/2019
by   Eric Bax, et al.
0

Quality data is a fundamental contributor to success in statistics and machine learning. If a statistical assessment or machine learning leads to decisions that create value, data contributors may want a share of that value. This paper presents methods to assess the value of individual data samples, and of sets of samples, to apportion value among different data contributors. We use Shapley values for individual samples and Owen values for combined samples, and show that these values can be computed in polynomial time in spite of their definitions having numbers of terms that are exponential in the number of samples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2023

Computation of Reliability Statistics for Finite Samples of Success-Failure Experiments

Computational method for statistical measures of reliability, confidence...
research
11/12/2018

What is my data worth? From data properties to data value

Data today fuels both the economy and advances in machine learning and A...
research
02/11/2022

Pseudo Polynomial-Time Top-k Algorithms for d-DNNF Circuits

We are interested in computing k most preferred models of a given d-DNNF...
research
11/02/2020

p-value peeking and estimating extrema

A pervasive issue in statistical hypothesis testing is that the reported...
research
06/14/2020

High-precision Wasserstein barycenters in polynomial time

Computing Wasserstein barycenters is a fundamental geometric problem wit...
research
11/22/2020

A decentralized aggregation mechanism for training deep learning models using smart contract system for bank loan prediction

Data privacy and sharing has always been a critical issue when trying to...
research
09/20/2021

Fast TreeSHAP: Accelerating SHAP Value Computation for Trees

SHAP (SHapley Additive exPlanation) values are one of the leading tools ...

Please sign up or login with your details

Forgot password? Click here to reset