Advancing Referring Expression Segmentation Beyond Single Image

05/21/2023
by   Yixuan Wu, et al.
0

Referring Expression Segmentation (RES) is a widely explored multi-modal task, which endeavors to segment the pre-existing object within a single image with a given linguistic expression. However, in broader real-world scenarios, it is not always possible to determine if the described object exists in a specific image. Typically, we have a collection of images, some of which may contain the described objects. The current RES setting curbs its practicality in such situations. To overcome this limitation, we propose a more realistic and general setting, named Group-wise Referring Expression Segmentation (GRES), which expands RES to a collection of related images, allowing the described objects to be present in a subset of input images. To support this new setting, we introduce an elaborately compiled dataset named Grouped Referring Dataset (GRD), containing complete group-wise annotations of target objects described by given expressions. We also present a baseline method named Grouped Referring Segmenter (GRSer), which explicitly captures the language-vision and intra-group vision-vision interactions to achieve state-of-the-art results on the proposed GRES and related tasks, such as Co-Salient Object Detection and RES. Our dataset and codes will be publicly released in https://github.com/yixuan730/group-res.

READ FULL TEXT

page 1

page 2

page 4

page 5

research
07/24/2023

Exposing the Troublemakers in Described Object Detection

Detecting objects based on language descriptions is a popular task that ...
research
12/27/2022

Position-Aware Contrastive Alignment for Referring Image Segmentation

Referring image segmentation aims to segment the target object described...
research
03/12/2022

Differentiated Relevances Embedding for Group-based Referring Expression Comprehension

Referring expression comprehension (REC) aims to locate a certain object...
research
11/07/2015

Generation and Comprehension of Unambiguous Object Descriptions

We propose a method that can generate an unambiguous description (known ...
research
08/26/2023

Beyond One-to-One: Rethinking the Referring Image Segmentation

Referring image segmentation aims to segment the target object referred ...
research
06/30/2018

Improved Techniques for Learning to Dehaze and Beyond: A Collective Study

This paper reviews the collective endeavors by the team of authors in ex...
research
04/26/2023

Learnable Ophthalmology SAM

Segmentation is vital for ophthalmology image analysis. But its various ...

Please sign up or login with your details

Forgot password? Click here to reset