LCCo: Lending CLIP to Co-Segmentation

08/22/2023
by   Xin Duan, et al.
0

This paper studies co-segmenting the common semantic object in a set of images. Existing works either rely on carefully engineered networks to mine the implicit semantic information in visual features or require extra data (i.e., classification labels) for training. In this paper, we leverage the contrastive language-image pre-training framework (CLIP) for the task. With a backbone segmentation network that independently processes each image from the set, we introduce semantics from CLIP into the backbone features, refining them in a coarse-to-fine manner with three key modules: i) an image set feature correspondence module, encoding global consistent semantic information of the image set; ii) a CLIP interaction module, using CLIP-mined common semantics of the image set to refine the backbone feature; iii) a CLIP regularization module, drawing CLIP towards this co-segmentation task, identifying the best CLIP semantic and using it to regularize the backbone feature. Experiments on four standard co-segmentation benchmark datasets show that the performance of our method outperforms state-of-the-art methods.

READ FULL TEXT

page 1

page 4

page 6

page 7

research
07/04/2022

Distilling Ensemble of Explanations for Weakly-Supervised Pre-Training of Image Segmentation Models

While fine-tuning pre-trained networks has become a popular way to train...
research
04/11/2022

Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation

This paper presents a novel framework to integrate both semantic and ins...
research
05/07/2019

Feature-Fused Context-Encoding Network for Neuroanatomy Segmentation

Automatic segmentation of fine-grained brain structures remains a challe...
research
05/03/2023

SimSC: A Simple Framework for Semantic Correspondence with Temperature Learning

We propose SimSC, a remarkably simple framework, to address the problem ...
research
01/05/2021

CycleSegNet: Object Co-segmentation with Cycle Refinement and Region Correspondence

Image co-segmentation is an active computer vision task which aims to se...
research
08/31/2023

Coarse-to-Fine Amodal Segmentation with Shape Prior

Amodal object segmentation is a challenging task that involves segmentin...
research
05/22/2023

Contextualising Implicit Representations for Semantic Tasks

Prior works have demonstrated that implicit representations trained only...

Please sign up or login with your details

Forgot password? Click here to reset