A Survey of Learning on Small Data

07/29/2022
by   Xiaofeng Cao, et al.
0

Learning on big data brings success for artificial intelligence (AI), but the annotation and training costs are expensive. In future, learning on small data is one of the ultimate purposes of AI, which requires machines to recognize objectives and scenarios relying on small data as humans. A series of machine learning models is going on this way such as active learning, few-shot learning, deep clustering. However, there are few theoretical guarantees for their generalization performance. Moreover, most of their settings are passive, that is, the label distribution is explicitly controlled by one specified sampling scenario. This survey follows the agnostic active sampling under a PAC (Probably Approximately Correct) framework to analyze the generalization error and label complexity of learning on small data using a supervised and unsupervised fashion. With these theoretical analyses, we categorize the small data learning models from two geometric perspectives: the Euclidean and non-Euclidean (hyperbolic) mean representation, where their optimization solutions are also presented and discussed. Later, some potential learning scenarios that may benefit from small data learning are then summarized, and their potential learning scenarios are also analyzed. Finally, some challenging applications such as computer vision, natural language processing that may benefit from learning on small data are also surveyed.

READ FULL TEXT
research
11/27/2022

Deep Active Learning for Computer Vision: Past and Future

As an important data selection schema, active learning emerges as the es...
research
07/22/2019

Less (Data) Is More: Why Small Data Holds the Key to the Future of Artificial Intelligence

The claims that big data holds the key to enterprise successes and that ...
research
04/02/2021

A Survey on Semi-parametric Machine Learning Technique for Time Series Forecasting

Artificial Intelligence (AI) has recently shown its capabilities for alm...
research
03/06/2023

Artificial Intelligence: 70 Years Down the Road

Artificial intelligence (AI) has a history of nearly a century from its ...
research
06/30/2022

Data-Efficient Learning via Minimizing Hyperspherical Energy

Deep learning on large-scale data is dominant nowadays. The unprecedente...
research
05/26/2022

Deep Active Learning with Noise Stability

Uncertainty estimation for unlabeled data is crucial to active learning....

Please sign up or login with your details

Forgot password? Click here to reset