HyperDet3D: Learning a Scene-conditioned 3D Object Detector

04/12/2022
by   Yu Zheng, et al.
0

A bathtub in a library, a sink in an office, a bed in a laundry room – the counter-intuition suggests that scene provides important prior knowledge for 3D object detection, which instructs to eliminate the ambiguous detection of similar objects. In this paper, we propose HyperDet3D to explore scene-conditioned prior knowledge for 3D object detection. Existing methods strive for better representation of local elements and their relations without scene-conditioned knowledge, which may cause ambiguity merely based on the understanding of individual points and object candidates. Instead, HyperDet3D simultaneously learns scene-agnostic embeddings and scene-specific knowledge through scene-conditioned hypernetworks. More specifically, our HyperDet3D not only explores the sharable abstracts from various 3D scenes, but also adapts the detector to the given scene at test time. We propose a discriminative Multi-head Scene-specific Attention (MSA) module to dynamically control the layer parameters of the detector conditioned on the fusion of scene-conditioned knowledge. Our HyperDet3D achieves state-of-the-art results on the 3D object detection benchmark of the ScanNet and SUN RGB-D datasets. Moreover, through cross-dataset evaluation, we show the acquired scene-conditioned prior knowledge still takes effect when facing 3D scenes with domain gap.

READ FULL TEXT

page 1

page 3

page 7

page 10

research
10/21/2016

Enhanced Object Detection via Fusion With Prior Beliefs from Image Classification

In this paper, we introduce a novel fusion method that can enhance objec...
research
01/06/2023

Object as Query: Equipping Any 2D Object Detector with 3D Detection Ability

3D object detection from multi-view images has drawn much attention over...
research
08/05/2020

Tiny-YOLO object detection supplemented with geometrical data

We propose a method of improving detection precision (mAP) with the help...
research
04/09/2019

Towards Universal Object Detection by Domain Attention

Despite increasing efforts on universal representations for visual recog...
research
12/02/2022

Prediction of Scene Plausibility

Understanding the 3D world from 2D images involves more than detection a...
research
09/22/2021

Pix2seq: A Language Modeling Framework for Object Detection

This paper presents Pix2Seq, a simple and generic framework for object d...
research
11/19/2020

Classification by Attention: Scene Graph Classification with Prior Knowledge

A main challenge in scene graph classification is that the appearance of...

Please sign up or login with your details

Forgot password? Click here to reset