One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes

02/15/2021
by   Ilia Sucholutsky, et al.
6

Increasingly large datasets are rapidly driving up the computational costs of machine learning. Prototype generation methods aim to create a small set of synthetic observations that accurately represent a training dataset but greatly reduce the computational cost of learning from it. Assigning soft labels to prototypes can allow increasingly small sets of prototypes to accurately represent the original training dataset. Although foundational work on `less than one'-shot learning has proven the theoretical plausibility of learning with fewer than one observation per class, developing practical algorithms for generating such prototypes remains an unexplored territory. We propose a novel, modular method for generating soft-label prototypical lines that still maintains representational accuracy even when there are fewer prototypes than the number of classes in the data. In addition, we propose the Hierarchical Soft-Label Prototype k-Nearest Neighbor classification algorithm based on these prototypical lines. We show that our method maintains high classification accuracy while greatly reducing the number of prototypes required to represent a dataset, even when working with severely imbalanced and difficult data. Our code is available at https://github.com/ilia10000/SLkNN.

READ FULL TEXT

page 2

page 5

page 6

research
09/17/2020

'Less Than One'-Shot Learning: Learning N Classes From M<N Samples

Deep neural networks require large training sets but suffer from high co...
research
02/06/2023

CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets

Open vocabulary models (e.g. CLIP) have shown strong performance on zero...
research
07/22/2022

Learning from Multiple Annotator Noisy Labels via Sample-wise Label Fusion

Data lies at the core of modern deep learning. The impressive performanc...
research
11/17/2020

Close Category Generalization

Out-of-distribution generalization is a core challenge in machine learni...
research
12/26/2020

Few Shot Learning With No Labels

Few-shot learners aim to recognize new categories given only a small num...
research
03/03/2020

Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications

Trained on large datasets, deep learning (DL) can accurately classify vi...
research
10/31/2020

Optimal 1-NN Prototypes for Pathological Geometries

Using prototype methods to reduce the size of training datasets can dras...

Please sign up or login with your details

Forgot password? Click here to reset