Semantic Contrastive Bootstrapping for Single-positive Multi-label Recognition

07/15/2023
by   Cheng Chen, et al.
0

Learning multi-label image recognition with incomplete annotation is gaining popularity due to its superior performance and significant labor savings when compared to training with fully labeled datasets. Existing literature mainly focuses on label completion and co-occurrence learning while facing difficulties with the most common single-positive label manner. To tackle this problem, we present a semantic contrastive bootstrapping (Scob) approach to gradually recover the cross-object relationships by introducing class activation as semantic guidance. With this learning guidance, we then propose a recurrent semantic masked transformer to extract iconic object-level representations and delve into the contrastive learning problems on multi-label classification tasks. We further propose a bootstrapping framework in an Expectation-Maximization fashion that iteratively optimizes the network parameters and refines semantic guidance to alleviate possible disturbance caused by wrong semantic guidance. Extensive experimental results demonstrate that the proposed joint learning framework surpasses the state-of-the-art models by a large margin on four public multi-label image recognition benchmarks. Codes can be found at https://github.com/iCVTEAM/Scob.

READ FULL TEXT

page 2

page 5

page 6

page 10

page 12

page 13

page 15

research
07/24/2021

Multi-Label Image Classification with Contrastive Learning

Recently, as an effective way of learning latent representations, contra...
research
08/20/2019

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition

Recognizing multiple labels of images is a practical and challenging tas...
research
12/21/2021

Structured Semantic Transfer for Multi-Label Recognition with Partial Labels

Multi-label image recognition is a fundamental yet practical task becaus...
research
11/23/2022

Texts as Images in Prompt Tuning for Multi-Label Image Recognition

Prompt tuning has been employed as an efficient way to adapt large visio...
research
03/10/2022

AGCN: Augmented Graph Convolutional Network for Lifelong Multi-label Image Recognition

The Lifelong Multi-Label (LML) image recognition builds an online class-...
research
10/10/2021

Transformer-based Dual Relation Graph for Multi-label Image Recognition

The simultaneous recognition of multiple objects in one image remains a ...
research
03/24/2017

Improving Classification by Improving Labelling: Introducing Probabilistic Multi-Label Object Interaction Recognition

This work deviates from easy-to-define class boundaries for object inter...

Please sign up or login with your details

Forgot password? Click here to reset