Improving CNN-based Planar Object Detection with Geometric Prior Knowledge

09/23/2019
by   Jianxiong Cai, et al.
0

In this paper, we focus on the question: how might mobile robots take advantage of affordable RGB-D sensors for object detection? Although current CNN-based object detectors have achieved impressive results, there are three main drawbacks for practical usage on mobile robots: 1) It is hard and time-consuming to collect and annotate large-scale training sets. 2) It usually needs a long training time. 3) CNN-based object detection shows significant weakness in predicting location. We propose a novel approach for the detection of planar objects, which rectifies images with geometric information to compensate for the perspective distortion before feeding it to the CNN detector module, typically a CNN-based detector like YOLO or MASK RCNN. By dealing with the perspective distortion in advance, we eliminate the need for the CNN detector to learn that. Experiments show that this approach significantly boosts the detection performance. Besides, it effectively reduces the number of training images required. In addition to the novel detection framework proposed, we also release an RGB-D dataset for hazmat sign detection. To the best of our knowledge, this is the first public-available hazmat sign detection dataset with RGB-D sensors.

READ FULL TEXT

page 2

page 7

research
03/23/2018

Object Detection for Comics using Manga109 Annotations

With the growth of digitized comics, image understanding techniques are ...
research
12/04/2019

Object Detection with Convolutional Neural Networks

In this chapter, we present a brief overview of the recent development i...
research
07/27/2019

Reprojection R-CNN: A Fast and Accurate Object Detector for 360° Images

360 images are usually represented in either equirectangular projection ...
research
03/22/2021

Temporal Feature Networks for CNN based Object Detection

For reliable environment perception, the use of temporal information is ...
research
06/23/2015

R-CNN minus R

Deep convolutional neural networks (CNNs) have had a major impact in mos...
research
07/13/2020

DeepHAZMAT: Hazardous Materials Sign Detection and Segmentation with Restricted Computational Resources

One of the most challenging and non-trivial tasks in robot-based rescue ...
research
03/15/2019

Generate What You Can't See - a View-dependent Image Generation

In order to operate autonomously, a robot should explore the environment...

Please sign up or login with your details

Forgot password? Click here to reset