Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge

11/15/2021
by   Jiyang Qi, et al.
12

Although deep learning methods have achieved advanced video object recognition performance in recent years, perceiving heavily occluded objects in a video is still a very challenging task. To promote the development of occlusion understanding, we collect a large-scale dataset called OVIS for video instance segmentation in the occluded scenario. OVIS consists of 296k high-quality instance masks and 901 occluded scenes. While our human vision systems can perceive those occluded objects by contextual reasoning and association, our experiments suggest that current video understanding systems cannot. On the OVIS dataset, all baseline methods encounter a significant performance degradation of about 80 which demonstrates that there is still a long way to go in understanding obscured objects and videos in a complex real-world scenario. To facilitate the research on new paradigms for video understanding systems, we launched a challenge based on the OVIS dataset. The submitted top-performing algorithms have achieved much higher performance than our baselines. In this paper, we will introduce the OVIS dataset and further dissect it by analyzing the results of baselines and submitted methods. The OVIS dataset and challenge information can be found at http://songbai.site/ovis .

READ FULL TEXT

page 2

page 4

page 6

page 9

research
02/02/2021

Occluded Video Instance Segmentation

Can our video understanding systems perceive objects when a heavy occlus...
research
11/17/2020

SeekNet: Improved Human Instance Segmentation via Reinforcement Learning Based Optimized Robot Relocation

Amodal recognition is the ability of the system to detect occluded objec...
research
08/17/2022

DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations

With the recent development of Deep Learning applied to Computer Vision,...
research
08/16/2021

Real-time Human-Centric Segmentation for Complex Video Scenes

Most existing video tasks related to "human" focus on the segmentation o...
research
03/23/2020

Learning Object Permanence from Video

Object Permanence allows people to reason about the location of non-visi...
research
01/20/2023

Occlusion Reasoning for Skeleton Extraction of Self-Occluded Tree Canopies

In this work, we present a method to extract the skeleton of a self-occl...
research
07/29/2020

MessyTable: Instance Association in Multiple Camera Views

We present an interesting and challenging dataset that features a large ...

Please sign up or login with your details

Forgot password? Click here to reset