Competing with the Empirical Risk Minimizer in a Single Pass

12/20/2014
by   Roy Frostig, et al.
0

In many estimation problems, e.g. linear and logistic regression, we wish to minimize an unknown objective given only unbiased samples of the objective function. Furthermore, we aim to achieve this using as few samples as possible. In the absence of computational constraints, the minimizer of a sample average of observed data -- commonly referred to as either the empirical risk minimizer (ERM) or the M-estimator -- is widely regarded as the estimation strategy of choice due to its desirable statistical convergence properties. Our goal in this work is to perform as well as the ERM, on every problem, while minimizing the use of computational resources such as running time and space usage. We provide a simple streaming algorithm which, under standard regularity assumptions on the underlying problem, enjoys the following properties: * The algorithm can be implemented in linear time with a single pass of the observed data, using space linear in the size of a single sample. * The algorithm achieves the same statistical rate of convergence as the empirical risk minimizer on every problem, even considering constant factors. * The algorithm's performance depends on the initial error at a rate that decreases super-polynomially. * The algorithm is easily parallelizable. Moreover, we quantify the (finite-sample) rate at which the algorithm becomes competitive with the ERM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2019

Nonregular and Minimax Estimation of Individualized Thresholds in High Dimension with Binary Responses

Given a large number of covariates Z, we consider the estimation of a hi...
research
03/03/2023

Rate adaptive estimation of the center of a symmetric distribution

Given univariate random variables Y_1, …, Y_n with the Uniform(θ_0 - 1, ...
research
08/14/2019

Least Squares Approximation for a Distributed System

In this work we develop a distributed least squares approximation (DLSA)...
research
02/21/2017

Stochastic Canonical Correlation Analysis

We tightly analyze the sample complexity of CCA, provide a learning algo...
research
10/09/2015

Conditional Risk Minimization for Stochastic Processes

We study the task of learning from non-i.i.d. data. In particular, we ai...
research
10/23/2016

Online Classification with Complex Metrics

We present a framework and analysis of consistent binary classification ...

Please sign up or login with your details

Forgot password? Click here to reset