Practical Coreset Constructions for Machine Learning

03/19/2017
by   Olivier Bachem, et al.
0

We investigate coresets - succinct, small summaries of large data sets - so that solutions found on the summary are provably competitive with solution found on the full data set. We provide an overview over the state-of-the-art in coreset construction for machine learning. In Section 2, we present both the intuition behind and a theoretically sound framework to construct coresets for general problems and apply it to k-means clustering. In Section 3 we summarize existing coreset construction algorithms for a variety of machine learning problems such as maximum likelihood estimation of mixture models, Bayesian non-parametric models, principal component analysis, regression and general empirical risk minimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2017

Scalable and Distributed Clustering via Lightweight Coresets

Coresets are compact representations of data sets such that models train...
research
04/11/2019

Robust Coreset Construction for Distributed Machine Learning

Motivated by the need of solving machine learning problems over distribu...
research
06/18/2021

On the benefits of maximum likelihood estimation for Regression and Forecasting

We advocate for a practical Maximum Likelihood Estimation (MLE) approach...
research
08/21/2015

Strong Coresets for Hard and Soft Bregman Clustering with Applications to Exponential Family Mixtures

Coresets are efficient representations of data sets such that models tra...
research
10/03/2016

cleverhans v2.0.0: an adversarial machine learning library

cleverhans is a software library that provides standardized reference im...
research
05/02/2016

Tradeoffs for Space, Time, Data and Risk in Unsupervised Learning

Faced with massive data, is it possible to trade off (statistical) risk,...
research
11/27/2017

One-Shot Coresets: The Case of k-Clustering

Scaling clustering algorithms to massive data sets is a challenging task...

Please sign up or login with your details

Forgot password? Click here to reset