Data valuation: The partial ordinal Shapley value for machine learning

05/02/2023
by   Jie Liu, et al.
0

Data valuation using Shapley value has emerged as a prevalent research domain in machine learning applications. However, it is a challenge to address the role of order in data cooperation as most research lacks such discussion. To tackle this problem, this paper studies the definition of the partial ordinal Shapley value by group theory in abstract algebra. Besides, since the calculation of the partial ordinal Shapley value requires exponential time, this paper also gives three algorithms for approximating the results. The Truncated Monte Carlo algorithm is derived from the classic Shapley value approximation algorithm. The Classification Monte Carlo algorithm and the Classification Truncated Monte Carlo algorithm are based on the fact that the data points in the same class provide similar information, then we can accelerate the calculation by leaving out some data points in each class.

READ FULL TEXT
research
08/22/2019

Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms

Given a data set D containing millions of data points and a data consume...
research
05/31/2019

Ordinal Bucketing for Game Trees using Dynamic Quantile Approximation

In this paper, we present a simple and cheap ordinal bucketing algorithm...
research
09/05/2019

On ultrametric 1-median selection

Consider the problem of finding a point in an ultrametric space with the...
research
11/20/2019

Automatic Differentiable Monte Carlo: Theory and Application

Differentiable programming has emerged as a key programming paradigm emp...
research
01/03/2018

Optimal Learning from the Doob-Dynkin lemma

The Doob-Dynkin Lemma gives conditions on two functions X and Y that ens...
research
04/05/2019

Data Shapley: Equitable Valuation of Data for Machine Learning

As data becomes the fuel driving technological and economic growth, a fu...
research
12/22/2022

Scaffolding Generation using a 3D Physarum Polycephalum Simulation

In this demo, we present a novel technique for approximating topological...

Please sign up or login with your details

Forgot password? Click here to reset