Persistent Mixture Model Networks for Few-Shot Image Classification

by   Arman Afrasiyabi, et al.

We introduce Persistent Mixture Model (PMM) networks for representation learning in the few-shot image classification context. While previous methods represent classes with a single centroid or rely on post hoc clustering methods, our method learns a mixture model for each base class jointly with the data representation in an end-to-end manner. The PMM training algorithm is organized into two main stages: 1) initial training and 2) progressive following. First, the initial estimate for multi-component mixtures is learned for each class in the base domain using a combination of two loss functions (competitive and collaborative). The resulting network is then progressively refined through a leader-follower learning procedure, which uses the current estimate of the learner as a fixed "target" network. This target network is used to make a consistent assignment of instances to mixture components, in order to increase performance while stabilizing the training. The effectiveness of our joint representation/mixture learning approach is demonstrated with extensive experiments on four standard datasets and four backbones. In particular, we demonstrate that when we combine our robust representation with recent alignment- and margin-based approaches, we achieve new state-of-the-art results in the inductive setting, with an absolute accuracy for 5-shot classification of 82.45 60.70


page 1

page 2

page 3

page 4


Associative Alignment for Few-shot Image Classification

Few-shot image classification aims at training a model by using only a f...

von Mises-Fisher Mixture Model-based Deep learning: Application to Face Verification

A number of pattern recognition tasks, e.g., face verification, can be b...

A LASSO-Penalized BIC for Mixture Model Selection

The efficacy of family-based approaches to mixture model-based clusterin...

Adaptive Prototypical Networks with Label Words and Joint Representation Learning for Few-Shot Relation Classification

Relation classification (RC) task is one of fundamental tasks of informa...

Matching Feature Sets for Few-Shot Image Classification

In image classification, it is common practice to train deep networks to...

Prototypical Clustering Networks for Dermatological Disease Diagnosis

We consider the problem of image classification for the purpose of aidin...

Generalized Zero and Few-Shot Transfer for Facial Forgery Detection

We propose Deep Distribution Transfer(DDT), a new transfer learning appr...

Please sign up or login with your details

Forgot password? Click here to reset