AdaSelection: Accelerating Deep Learning Training through Data Subsampling

06/19/2023
by   Minghe Zhang, et al.
0

In this paper, we introduce AdaSelection, an adaptive sub-sampling method to identify the most informative sub-samples within each minibatch to speed up the training of large-scale deep learning models without sacrificing model performance. Our method is able to flexibly combines an arbitrary number of baseline sub-sampling methods incorporating the method-level importance and intra-method sample-level importance at each iteration. The standard practice of ad-hoc sampling often leads to continuous training with vast amounts of data from production environments. To improve the selection of data instances during forward and backward passes, we propose recording a constant amount of information per instance from these passes. We demonstrate the effectiveness of our method by testing it across various types of inputs and tasks, including the classification tasks on both image and language datasets, as well as regression tasks. Compared with industry-standard baselines, AdaSelection consistently displays superior performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2021

One Backward from Ten Forward, Subsampling for Large-Scale Deep Learning

Deep learning models in large-scale machine learning systems are often c...
research
06/14/2016

DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Large-scale supervised classification algorithms, especially those based...
research
04/30/2019

Test Selection for Deep Learning Systems

Testing of deep learning models is challenging due to the excessive numb...
research
11/23/2018

On the Importance of Strong Baselines in Bayesian Deep Learning

Like all sub-fields of machine learning, Bayesian Deep Learning is drive...
research
10/11/2022

Improving Sample Efficiency of Deep Learning Models in Electricity Market

The superior performance of deep learning relies heavily on a large coll...
research
08/30/2018

Nested multi-instance classification

There are classification tasks that take as inputs groups of images rath...

Please sign up or login with your details

Forgot password? Click here to reset