NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields

09/24/2022
by   Jiankai Sun, et al.
12

Neural Radiance Fields (NeRFs) have been successfully used for scene representation. Recent works have also developed robotic navigation and manipulation systems using NeRF-based environment representations. As object localization is the foundation for many robotic applications, to further unleash the potential of NeRFs in robotic systems, we study object localization within a NeRF scene. We propose a transformer-based framework NeRF-Loc to extract 3D bounding boxes of objects in NeRF scenes. NeRF-Loc takes a pre-trained NeRF model and camera view as input, and produces labeled 3D bounding boxes of objects as output. Concretely, we design a pair of paralleled transformer encoder branches, namely the coarse stream and the fine stream, to encode both the context and details of target objects. The encoded features are then fused together with attention layers to alleviate ambiguities for accurate object localization. We have compared our method with the conventional transformer-based method and our method achieves better performance. In addition, we also present the first NeRF samples-based object localization benchmark NeRFLocBench.

READ FULL TEXT

page 1

page 2

page 4

page 5

research
04/25/2019

RepPoints: Point Set Representation for Object Detection

Modern object detectors rely heavily on rectangular bounding boxes, such...
research
09/23/2022

Transformer-Based Microbubble Localization

Ultrasound Localization Microscopy (ULM) is an emerging technique that e...
research
11/24/2020

Multi-Stage CNN-Based Monocular 3D Vehicle Localization and Orientation Estimation

This paper aims to design a 3D object detection model from 2D images tak...
research
03/29/2017

Iterative Object and Part Transfer for Fine-Grained Recognition

The aim of fine-grained recognition is to identify sub-ordinate categori...
research
02/26/2022

An End-to-End Transformer Model for Crowd Localization

Crowd localization, predicting head positions, is a more practical and h...
research
06/26/2023

RVT: Robotic View Transformer for 3D Object Manipulation

For 3D object manipulation, methods that build an explicit 3D representa...
research
04/27/2016

Simultaneous Food Localization and Recognition

The development of automatic nutrition diaries, which would allow to kee...

Please sign up or login with your details

Forgot password? Click here to reset