Data splitting improves statistical performance in overparametrized regimes

10/21/2021
by   Nicole Mücke, et al.
0

While large training datasets generally offer improvement in model performance, the training process becomes computationally expensive and time consuming. Distributed learning is a common strategy to reduce the overall training time by exploiting multiple computing devices. Recently, it has been observed in the single machine setting that overparametrization is essential for benign overfitting in ridgeless regression in Hilbert spaces. We show that in this regime, data splitting has a regularizing effect, hence improving statistical performance and computational complexity at the same time. We further provide a unified framework that allows to analyze both the finite and infinite dimensional setting. We numerically demonstrate the effect of different model parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2022

Optimal Ratio for Data Splitting

It is common to split a dataset into training and testing sets before fi...
research
03/23/2023

Backdoor Defense via Adaptively Splitting Poisoned Dataset

Backdoor defenses have been studied to alleviate the threat of deep neur...
research
03/01/2021

A unified formulation of splitting-based implicit time integration schemes

Splitting-based time integration approaches such as fractional steps, al...
research
09/01/2022

A Genetic Algorithm-based Framework for Learning Statistical Power Manifold

Statistical power is a measure of the replicability of a categorical hyp...
research
11/27/2020

CoRe: An Efficient Coarse-refined Training Framework for BERT

In recent years, BERT has made significant breakthroughs on many natural...
research
07/05/2018

A Boo(n) for Evaluating Architecture Performance

We point out important problems with the common practice of using the be...
research
09/02/2022

LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices

Monocular depth estimation is an essential task in the computer vision c...

Please sign up or login with your details

Forgot password? Click here to reset