A Survey of Learning Curves with Bad Behavior: or How More Data Need Not Lead to Better Performance

11/25/2022
by   Marco Loog, et al.
0

Plotting a learner's generalization performance against the training set size results in a so-called learning curve. This tool, providing insight in the behavior of the learner, is also practically valuable for model selection, predicting the effect of more training data, and reducing the computational complexity of training. We set out to make the (ideal) learning curve concept precise and briefly discuss the aforementioned usages of such curves. The larger part of this survey's focus, however, is on learning curves that show that more data does not necessarily leads to better generalization performance. A result that seems surprising to many researchers in the field of artificial intelligence. We point out the significance of these findings and conclude our survey with an overview and discussion of open problems in this area that warrant further theoretical and empirical investigation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2021

The Shape of Learning Curves: a Review

Learning curves provide insight into the dependence of a learner's gener...
research
07/11/2019

Minimizers of the Empirical Risk and Risk Monotonicity

Plotting a learner's average performance against the number of training ...
research
04/05/2018

A Survey of Miss-Ratio Curve Construction Techniques

Miss-ratio curve (MRC), or equivalently hit-ratio curve (HRC), construct...
research
01/28/2022

Learning Curves for Decision Making in Supervised Machine Learning – A Survey

Learning curves are a concept from social sciences that has been adopted...
research
05/05/2019

Decision Making with Machine Learning and ROC Curves

The Receiver Operating Characteristic (ROC) curve is a representation of...
research
06/08/2022

Estimation of Predictive Performance in High-Dimensional Data Settings using Learning Curves

In high-dimensional prediction settings, it remains challenging to relia...
research
09/26/2020

Small Data, Big Decisions: Model Selection in the Small-Data Regime

Highly overparametrized neural networks can display curiously strong gen...

Please sign up or login with your details

Forgot password? Click here to reset