re-OBJ: Jointly Learning the Foreground and Background for Object Instance Re-identification

09/17/2019
by   Vaibhav Bansal, et al.
10

Conventional approaches to object instance re-identification rely on matching appearances of the target objects among a set of frames. However, learning appearances of the objects alone might fail when there are multiple objects with similar appearance or multiple instances of same object class present in the scene. This paper proposes that partial observations of the background can be utilized to aid in the object re-identification task for a rigid scene, especially a rigid environment with a lot of reoccurring identical models of objects. Using an extension to the Mask R-CNN architecture, we learn to encode the important and distinct information in the background jointly with the foreground relevant to rigid real-world scenarios such as an indoor environment where objects are static and the camera moves around the scene. We demonstrate the effectiveness of our joint visual feature in the re-identification of objects in the ScanNet dataset and show a relative improvement of around 28.25 in the rank-1 accuracy over the deepSort method.

READ FULL TEXT

page 2

page 9

page 10

research
06/10/2022

Object Instance Identification in Dynamic Environments

We study the problem of identifying object instances in a dynamic enviro...
research
11/28/2016

What Is Around The Camera?

How much does a single image reveal about the environment it was taken i...
research
02/20/2020

BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

We present BlockGAN, an image generative model that learns object-aware ...
research
04/22/2014

Fast Approximate Matching of Cell-Phone Videos for Robust Background Subtraction

We identify a novel instance of the background subtraction problem that ...
research
10/07/2022

Scene-level Tracking and Reconstruction without Object Priors

We present the first real-time system capable of tracking and reconstruc...
research
10/11/2022

AMICO: Amodal Instance Composition

Image composition aims to blend multiple objects to form a harmonized im...
research
02/05/2022

Catch Me if You Can: A Novel Task for Detection of Covert Geo-Locations (CGL)

Most visual scene understanding tasks in the field of computer vision in...

Please sign up or login with your details

Forgot password? Click here to reset