STORM: Foundations of End-to-End Empirical Risk Minimization on the Edge

06/25/2020
by   Benjamin Coleman, et al.
6

Empirical risk minimization is perhaps the most influential idea in statistical learning, with applications to nearly all scientific and technical domains in the form of regression and classification models. To analyze massive streaming datasets in distributed computing environments, practitioners increasingly prefer to deploy regression models on edge rather than in the cloud. By keeping data on edge devices, we minimize the energy, communication, and data security risk associated with the model. Although it is equally advantageous to train models at the edge, a common assumption is that the model was originally trained in the cloud, since training typically requires substantial computation and memory. To this end, we propose STORM, an online sketch for empirical risk minimization. STORM compresses a data stream into a tiny array of integer counters. This sketch is sufficient to estimate a variety of surrogate losses over the original dataset. We provide rigorous theoretical analysis and show that STORM can estimate a carefully chosen surrogate loss for the least-squares objective. In an exhaustive experimental comparison for linear regression models on real-world datasets, we find that STORM allows accurate regression models to be trained.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2016

Scalable Approximations for Generalized Linear Problems

In stochastic optimization, the population risk is generally approximate...
research
04/25/2021

Performance of Empirical Risk Minimization for Linear Regression with Dependent Data

This paper establishes bounds on the performance of empirical risk minim...
research
09/08/2015

Empirical risk minimization is consistent with the mean absolute percentage error

We study in this paper the consequences of using the Mean Absolute Perce...
research
05/09/2016

Mean Absolute Percentage Error for regression models

We study in this paper the consequences of using the Mean Absolute Perce...
research
12/06/2018

Deep Embedding using Bayesian Risk Minimization with Application to Sketch Recognition

In this paper, we address the problem of hand-drawn sketch recognition. ...
research
12/17/2019

Performance of regression models as a function of experiment noise

A challenge in developing machine learning regression models is that it ...
research
01/14/2019

Diagnostics for Regression Models with Discrete Outcomes Using Surrogate Empirical Residual Distribution Functions

Making informed decisions about model adequacy has been an outstanding i...

Please sign up or login with your details

Forgot password? Click here to reset