Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions

09/15/2023
by   Tianxu Wu, et al.
0

The challenge in fine-grained visual categorization lies in how to explore the subtle differences between different subclasses and achieve accurate discrimination. Previous research has relied on large-scale annotated data and pre-trained deep models to achieve the objective. However, when only a limited amount of samples is available, similar methods may become less effective. Diffusion models have been widely adopted in data augmentation due to their outstanding diversity in data generation. However, the high level of detail required for fine-grained images makes it challenging for existing methods to be directly employed. To address this issue, we propose a novel approach termed the detail reinforcement diffusion model (DRDM), which leverages the rich knowledge of large models for fine-grained data augmentation and comprises two key components including discriminative semantic recombination (DSR) and spatial knowledge reference (SKR). Specifically, DSR is designed to extract implicit similarity relationships from the labels and reconstruct the semantic mapping between labels and instances, which enables better discrimination of subtle differences between different subclasses. Furthermore, we introduce the SKR module, which incorporates the distributions of different datasets as references in the feature space. This allows the SKR to aggregate the high-dimensional distribution of subclass features in few-shot FGVC tasks, thus expanding the decision boundary. Through these two critical components, we effectively utilize the knowledge from large models to address the issue of data scarcity, resulting in improved performance for fine-grained visual recognition tasks. Extensive experiments demonstrate the consistent performance gain offered by our DRDM.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 10

research
10/06/2020

Domain Adaptive Transfer Learning on Visual Attention Aware Data Augmentation for Fine-grained Visual Categorization

Fine-Grained Visual Categorization (FGVC) is a challenging topic in comp...
research
06/08/2023

Coping with Change: Learning Invariant and Minimum Sufficient Representations for Fine-Grained Visual Categorization

Fine-grained visual categorization (FGVC) is a challenging task due to s...
research
04/06/2020

Attribute Mix: Semantic Data Augmentation for Fine Grained Recognition

Collecting fine-grained labels usually requires expert-level domain know...
research
06/14/2020

FenceMask: A Data Augmentation Approach for Pre-extracted Image Features

We propose a novel data augmentation method named 'FenceMask' that exhib...
research
03/03/2023

Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems

Self-supervised learning (SSL) strategies have demonstrated remarkable p...
research
04/21/2022

R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Fine-grained visual categorization (FGVC) aims to discriminate similar s...
research
02/17/2022

On Guiding Visual Attention with Language Specification

While real world challenges typically define visual categories with lang...

Please sign up or login with your details

Forgot password? Click here to reset