Reprojection R-CNN: A Fast and Accurate Object Detector for 360° Images

07/27/2019
by   Pengyu Zhao, et al.
8

360 images are usually represented in either equirectangular projection (ERP) or multiple perspective projections. Different from the flat 2D images, the detection task is challenging for 360 images due to the distortion of ERP and the inefficiency of perspective projections. However, existing methods mostly focus on one of the above representations instead of both, leading to limited detection performance. Moreover, the lack of appropriate bounding-box annotations as well as the annotated datasets further increases the difficulties of the detection task. In this paper, we present a standard object detection framework for 360 images. Specifically, we adapt the terminologies of the traditional object detection task to the omnidirectional scenarios, and propose a novel two-stage object detector, i.e., Reprojection R-CNN by combining both ERP and perspective projection. Owing to the omnidirectional field-of-view of ERP, Reprojection R-CNN first generates coarse region proposals efficiently by a distortion-aware spherical region proposal network. Then, it leverages the distortion-free perspective projection and refines the proposed regions by a novel reprojection network. We construct two novel synthetic datasets for training and evaluation. Experiments reveal that Reprojection R-CNN outperforms the previous state-of-the-art methods on the mAP metric. In addition, the proposed detector could run at 178ms per image in the panoramic datasets, which implies its practicability in real-world applications.

READ FULL TEXT

page 1

page 3

page 5

page 9

research
02/07/2022

Field-of-View IoU for Object Detection in 360° Images

360 cameras have gained popularity over the last few years. In this pape...
research
10/25/2016

mdBrief - A Fast Online Adaptable, Distorted Binary Descriptor for Real-Time Applications Using Calibrated Wide-Angle Or Fisheye Cameras

Fast binary descriptors build the core for many vision based application...
research
08/28/2023

PanoSwin: a Pano-style Swin Transformer for Panorama Understanding

In panorama understanding, the widely used equirectangular projection (E...
research
01/17/2022

Distortion-Aware Brushing for Interactive Cluster Analysis in Multidimensional Projections

Brushing is an everyday interaction in 2D scatterplots, which allows use...
research
09/23/2019

Improving CNN-based Planar Object Detection with Geometric Prior Knowledge

In this paper, we focus on the question: how might mobile robots take ad...
research
08/02/2017

Flat2Sphere: Learning Spherical Convolution for Fast Features from 360° Imagery

While 360 cameras offer tremendous new possibilities in vision, graphics...
research
04/26/2021

Practical Wide-Angle Portraits Correction with Deep Structured Models

Wide-angle portraits often enjoy expanded views. However, they contain p...

Please sign up or login with your details

Forgot password? Click here to reset