Will Multi-modal Data Improves Few-shot Learning?

07/25/2021
by   Zilun Zhang, et al.
0

Most few-shot learning models utilize only one modality of data. We would like to investigate qualitatively and quantitatively how much will the model improve if we add an extra modality (i.e. text description of the image), and how it affects the learning procedure. To achieve this goal, we propose four types of fusion method to combine the image feature and text feature. To verify the effectiveness of improvement, we test the fusion methods with two classical few-shot learning models - ProtoNet and MAML, with image feature extractors such as ConvNet and ResNet12. The attention-based fusion method works best, which improves the classification accuracy by a large margin around 30 comparing to the baseline result.

READ FULL TEXT
research
07/08/2018

Large Margin Few-Shot Learning

The key issue of few-shot learning is learning to generalize. In this pa...
research
03/23/2020

Additive Angular Margin for Few Shot Learning to Classify Clinical Endoscopy Images

Endoscopy is a widely used imaging modality to diagnose and treat diseas...
research
11/22/2018

Self Paced Adversarial Training for Multimodal Few-shot Learning

State-of-the-art deep learning algorithms yield remarkable results in ma...
research
04/14/2017

Distributional Modeling on a Diet: One-shot Word Learning from Text Only

We test whether distributional models can do one-shot learning of defini...
research
02/19/2019

Adaptive Cross-Modal Few-Shot Learning

Metric-based meta-learning techniques have successfully been applied to ...
research
03/12/2020

MVLoc: Multimodal Variational Geometry-Aware Learning for Visual Localization

Recent learning-based research has achieved impressive results in the fi...
research
11/02/2022

Rethinking the Metric in Few-shot Learning: From an Adaptive Multi-Distance Perspective

Few-shot learning problem focuses on recognizing unseen classes given a ...

Please sign up or login with your details

Forgot password? Click here to reset