Sever: A Robust Meta-Algorithm for Stochastic Optimization

03/07/2018
by   Ilias Diakonikolas, et al.
0

In high dimensions, most machine learning methods are brittle to even a small fraction of structured outliers. To address this, we introduce a new meta-algorithm that can take in a base learner such as least squares or stochastic gradient descent, and harden the learner to be resistant to outliers. Our method, Sever, possesses strong theoretical guarantees yet is also highly scalable -- beyond running the base learner itself, it only requires computing the top singular vector of a certain n × d matrix. We apply Sever on a drug design dataset and a spam classification dataset, and find that in both cases it has substantially greater robustness than several baselines. On the spam dataset, with 1% corruptions, we achieved 7.4% test error, compared to 13.4%-20.5% for the baselines, and 3% error on the uncorrupted dataset. Similarly, on the drug design dataset, with 10% corruptions, we achieved 1.42 mean-squared error test error, compared to 1.51-2.33 for the baselines, and 1.23 error on the uncorrupted dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2021

Robust Regression via Model Based Methods

The mean squared error loss is widely used in many applications, includi...
research
04/10/2021

SGD Implicitly Regularizes Generalization Error

We derive a simple and model-independent formula for the change in the g...
research
03/15/2021

Evolving parametrized Loss for Image Classification Learning on Small Datasets

This paper proposes a meta-learning approach to evolving a parametrized ...
research
08/25/2020

Solving Stochastic Compositional Optimization is Nearly as Easy as Solving Stochastic Optimization

Stochastic compositional optimization generalizes classic (non-compositi...
research
07/18/2020

MTL2L: A Context Aware Neural Optimiser

Learning to learn (L2L) trains a meta-learner to assist the learning of ...
research
03/20/2021

Properties of point forecast reconciliation approaches

Point forecast reconciliation of collection of time series with linear a...
research
12/06/2021

Diagnostic Assessment Generation via Combinatorial Search

Initial assessment tests are crucial in capturing learner knowledge stat...

Please sign up or login with your details

Forgot password? Click here to reset