A Meta-Learning Approach to Predicting Performance and Data Requirements

03/02/2023
by   Achin Jain, et al.
0

We propose an approach to estimate the number of samples required for a model to reach a target performance. We find that the power law, the de facto principle to estimate model performance, leads to large error when using a small dataset (e.g., 5 samples per class) for extrapolation. This is because the log-performance error against the log-dataset size follows a nonlinear progression in the few-shot regime followed by a linear progression in the high-shot regime. We introduce a novel piecewise power law (PPL) that handles the two data regimes differently. To estimate the parameters of the PPL, we introduce a random forest regressor trained via meta learning that generalizes across classification/detection tasks, ResNet/ViT based architectures, and random/pre-trained initializations. The PPL improves the performance estimation on average by 37 datasets, compared to the power law. We further extend the PPL to provide a confidence bound and use it to limit the prediction horizon that reduces over-estimation of data by 76

READ FULL TEXT

page 14

page 15

research
05/18/2021

Sample Efficient Linear Meta-Learning by Alternating Minimization

Meta-learning synthesizes and leverages the knowledge from a given set o...
research
09/13/2019

Meta-Learning for Few-Shot Time Series Classification

Deep neural networks (DNNs) have achieved state-of-the-art results on ti...
research
03/08/2023

Meta-learning Control Variates: Variance Reduction with Limited Data

Control variates can be a powerful tool to reduce the variance of Monte ...
research
07/08/2020

Meta-Learning One-Class Classification with DeepSets: Application in the Milky Way

We explore in this paper the use of neural networks designed for point-c...
research
06/24/2023

Is Pre-training Truly Better Than Meta-Learning?

In the context of few-shot learning, it is currently believed that a fix...
research
07/04/2022

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

Given a small training data set and a learning algorithm, how much more ...
research
08/22/2022

MetaRF: Differentiable Random Forest for Reaction Yield Prediction with a Few Trails

Artificial intelligence has deeply revolutionized the field of medicinal...

Please sign up or login with your details

Forgot password? Click here to reset