Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning

07/06/2023
by   Jishnu Jaykumar P, et al.
0

We propose a novel framework for few-shot learning by leveraging large-scale vision-language models such as CLIP. Motivated by the unimodal prototypical networks for few-shot learning, we introduce PROTO-CLIP that utilizes image prototypes and text prototypes for few-shot learning. Specifically, PROTO-CLIP adapts the image encoder and text encoder in CLIP in a joint fashion using few-shot examples. The two encoders are used to compute prototypes of image classes for classification. During adaptation, we propose aligning the image and text prototypes of corresponding classes. Such a proposed alignment is beneficial for few-shot classification due to the contributions from both types of prototypes. We demonstrate the effectiveness of our method by conducting experiments on benchmark datasets for few-shot learning as well as in the real world for robot perception.

READ FULL TEXT
research
03/23/2021

Detecting Hate Speech with GPT-3

Sophisticated language models such as OpenAI's GPT-3 can generate hatefu...
research
10/20/2022

Visual-Semantic Contrastive Alignment for Few-Shot Image Classification

Few-Shot learning aims to train and optimize a model that can adapt to u...
research
05/23/2020

Fine-Grain Few-Shot Vision via Domain Knowledge as Hyperspherical Priors

Prototypical networks have been shown to perform well at few-shot learni...
research
03/25/2021

Learning Dynamic Alignment via Meta-filter for Few-shot Learning

Few-shot learning (FSL), which aims to recognise new classes by adapting...
research
07/14/2022

Instance Selection Mechanisms for Human-in-the-Loop Systems in Few-Shot Learning

Business analytics and machine learning have become essential success fa...
research
05/15/2023

Learning More Discriminative Local Descriptors for Few-shot Learning

Few-shot learning for image classification comes up as a hot topic in co...
research
10/05/2021

Task Affinity with Maximum Bipartite Matching in Few-Shot Learning

We propose an asymmetric affinity score for representing the complexity ...

Please sign up or login with your details

Forgot password? Click here to reset