Mutual-Information Based Few-Shot Classification

06/23/2021
by   Malik Boudiaf, et al.
0

We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. We motivate our transductive loss by deriving a formal relation between the classification accuracy and mutual-information maximization. Furthermore, we propose a new alternating-direction solver, which substantially speeds up transductive inference over gradient-based optimization, while yielding competitive accuracy. We also provide a convergence analysis of our solver based on Zangwill's theory and bound-optimization arguments. TIM inference is modular: it can be used on top of any base-training feature extractor. Following standard transductive few-shot settings, our comprehensive experiments demonstrate that TIM outperforms state-of-the-art methods significantly across various datasets and networks, while used on top of a fixed feature extractor trained with simple cross-entropy on the base classes, without resorting to complex meta-learning schemes. It consistently brings between 2 improvement in accuracy over the best performing method, not only on all the well-established few-shot benchmarks but also on more challenging scenarios, with random tasks, domain shift and larger numbers of classes, as in the recently introduced META-DATASET. Our code is publicly available at https://github.com/mboudiaf/TIM. We also publicly release a standalone PyTorch implementation of META-DATASET, along with additional benchmarking results, at https://github.com/mboudiaf/pytorch-meta-dataset.

READ FULL TEXT

page 4

page 18

research
08/25/2020

Transductive Information Maximization For Few-Shot Learning

We introduce Transductive Infomation Maximization (TIM) for few-shot lea...
research
04/24/2022

Realistic Evaluation of Transductive Few-Shot Learning

Transductive inference is widely used in few-shot learning, as it levera...
research
12/01/2022

Mutual Information-based Generalized Category Discovery

We introduce an information-maximization approach for the Generalized Ca...
research
11/25/2022

A Strong Baseline for Generalized Few-Shot Semantic Segmentation

This paper introduces a generalized few-shot segmentation framework with...
research
10/26/2022

Towards Practical Few-Shot Query Sets: Transductive Minimum Description Length Inference

Standard few-shot benchmarks are often built upon simplifying assumption...
research
12/11/2020

Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need?

Few-shot segmentation has recently attracted substantial interest, with ...
research
06/15/2020

Dissimilarity Mixture Autoencoder for Deep Clustering

In this paper, we introduce the Dissimilarity Mixture Autoencoder (DMAE)...

Please sign up or login with your details

Forgot password? Click here to reset