Improving Visual Relation Detection using Depth Maps

05/02/2019
by   Sahand Sharifzadeh, et al.
0

State of the art visual relation detection methods have been relying on features extracted from RGB images including objects' 2D positions. In this paper, we argue that the 3D positions of objects in space can provide additional valuable information about object relations. This information helps not only to detect spatial relations, such as "standing behind", but also non-spatial relations, such as "holding". Since 3D information of a scene is not easily accessible, we propose incorporating a pre-trained RGB-to-Depth model within visual relation detection frameworks. We discuss different feature extraction strategies from depth maps and show their critical role in relation detection. Our experiments confirm that the performance of state-of-the-art visual relation detection approaches can significantly be improved by utilizing depth map information.

READ FULL TEXT

page 2

page 4

page 8

research
02/27/2017

Visual Translation Embedding Network for Visual Relation Detection

Visual relations, such as "person ride bike" and "bike next to car", off...
research
07/29/2017

Weakly-supervised learning of visual relations

This paper introduces a novel approach for modeling visual relations bet...
research
01/12/2021

Predicting Relative Depth between Objects from Semantic Features

Vision and language tasks such as Visual Relation Detection and Visual Q...
research
08/23/2022

DepthFake: a depth-based strategy for detecting Deepfake videos

Fake content has grown at an incredible rate over the past few years. Th...
research
05/10/2019

Support Relation Analysis for Objects in Multiple View RGB-D Images

Understanding physical relations between objects, especially their suppo...
research
12/23/2016

Two-stream convolutional neural network for accurate RGB-D fingertip detection using depth and edge information

Accurate detection of fingertips in depth image is critical for human-co...
research
03/23/2023

ReVersion: Diffusion-Based Relation Inversion from Images

Diffusion models gain increasing popularity for their generative capabil...

Please sign up or login with your details

Forgot password? Click here to reset