MetaFormer: A Unified Meta Framework for Fine-Grained Recognition

03/05/2022
by   Qishuai Diao, et al.
0

Fine-Grained Visual Classification(FGVC) is the task that requires recognizing the objects belonging to multiple subordinate categories of a super-category. Recent state-of-the-art methods usually design sophisticated learning pipelines to tackle this task. However, visual information alone is often not sufficient to accurately differentiate between fine-grained visual categories. Nowadays, the meta-information (e.g., spatio-temporal prior, attribute, and text description) usually appears along with the images. This inspires us to ask the question: Is it possible to use a unified and simple framework to utilize various meta-information to assist in fine-grained identification? To answer this problem, we explore a unified and strong meta-framework(MetaFormer) for fine-grained visual classification. In practice, MetaFormer provides a simple yet effective approach to address the joint learning of vision and various meta-information. Moreover, MetaFormer also provides a strong baseline for FGVC without bells and whistles. Extensive experiments demonstrate that MetaFormer can effectively use various meta-information to improve the performance of fine-grained recognition. In a fair comparison, MetaFormer can outperform the current SotA approaches with only vision information on the iNaturalist2017 and iNaturalist2018 datasets. Adding meta-information, MetaFormer can exceed the current SotA approaches by 5.9 on CUB-200-2011 and NABirds, which significantly outperforms the SotA approaches. The source code and pre-trained models are released athttps://github.com/dqshuai/MetaFormer.

READ FULL TEXT

page 4

page 8

research
07/24/2022

Explored An Effective Methodology for Fine-Grained Snake Recognition

Fine-Grained Visual Classification (FGVC) is a longstanding and fundamen...
research
03/16/2023

ELFIS: Expert Learning for Fine-grained Image Recognition Using Subsets

Fine-Grained Visual Recognition (FGVR) tackles the problem of distinguis...
research
06/12/2019

Presence-Only Geographical Priors for Fine-Grained Image Classification

Appearance information alone is often not sufficient to accurately diffe...
research
09/01/2023

Fine-grained Recognition with Learnable Semantic Data Augmentation

Fine-grained image recognition is a longstanding computer vision challen...
research
09/19/2023

Latent Space Energy-based Model for Fine-grained Open Set Recognition

Fine-grained open-set recognition (FineOSR) aims to recognize images bel...
research
08/31/2018

Hierarchical CVAE for Fine-Grained Hate Speech Classification

Existing work on automated hate speech detection typically focuses on bi...
research
03/23/2019

V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation

We propose V2CNet, a new deep learning framework to automatically transl...

Please sign up or login with your details

Forgot password? Click here to reset