As one of the fundamental functions of autonomous driving system, freesp...
Panoptic Narrative Grounding (PNG) is an emerging task whose goal is to
...
Referring video object segmentation aims to predict foreground labels fo...
Lane detection is a challenging task that requires predicting complex
to...
Recently proposed fine-grained 3D visual grounding is an essential and
c...
Given a natural language expression and an image/video, the goal of refe...
Language-queried video actor segmentation aims to predict the pixel-leve...
Learning to capture dependencies between spatial positions is essential ...
Referring image segmentation aims to predict the foreground mask of the
...
Referring image segmentation aims at segmenting the foreground masks of ...