ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion

06/26/2023
by   Yingjun Du, et al.
0

Prototype-based meta-learning has emerged as a powerful technique for addressing few-shot learning challenges. However, estimating a deterministic prototype using a simple average function from a limited number of examples remains a fragile process. To overcome this limitation, we introduce ProtoDiff, a novel framework that leverages a task-guided diffusion model during the meta-training phase to gradually generate prototypes, thereby providing efficient class representations. Specifically, a set of prototypes is optimized to achieve per-task prototype overfitting, enabling accurately obtaining the overfitted prototypes for individual tasks. Furthermore, we introduce a task-guided diffusion process within the prototype space, enabling the meta-learning of a generative process that transitions from a vanilla prototype to an overfitted prototype. ProtoDiff gradually generates task-specific prototypes from random noise during the meta-test stage, conditioned on the limited samples available for the new task. Furthermore, to expedite training and enhance ProtoDiff's performance, we propose the utilization of residual prototype learning, which leverages the sparsity of the residual prototype. We conduct thorough ablation studies to demonstrate its ability to accurately capture the underlying prototype distribution and enhance generalization. The new state-of-the-art performance on within-domain, cross-domain, and few-task few-shot classification further substantiates the benefit of ProtoDiff.

READ FULL TEXT

page 4

page 7

page 17

research
09/04/2023

Adaptive Parametric Prototype Learning for Cross-Domain Few-Shot Classification

Cross-domain few-shot classification induces a much more challenging pro...
research
12/15/2021

Hierarchical Variational Memory for Few-shot Learning Across Domains

Neural memory enables fast adaptation to new tasks with just a few train...
research
05/03/2023

Generative Meta-Learning for Zero-Shot Relation Triplet Extraction

The zero-shot relation triplet extraction (ZeroRTE) task aims to extract...
research
09/11/2019

Learning to Propagate for Graph Meta-Learning

Meta-learning extracts the common knowledge acquired from learning diffe...
research
04/02/2022

AutoProtoNet: Interpretability for Prototypical Networks

In meta-learning approaches, it is difficult for a practitioner to make ...
research
01/29/2019

Hyperspherical Prototype Networks

This paper introduces hyperspherical prototype networks, which unify reg...
research
08/11/2020

Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling

Visual Storytelling (VIST) is a task to tell a narrative story about a c...

Please sign up or login with your details

Forgot password? Click here to reset