Disparity Sliding Window: Object Proposals From Disparity Images

05/17/2018
by   Julian Müller, et al.
0

Sliding window approaches have been widely used for object recognition tasks in recent years. They guarantee an investigation of the entire input image for the object to be detected and allow a localization of that object. Despite the current trend towards deep neural networks, sliding window methods are still used in combination with convolutional neural networks. The risk of overlooking an object is clearly reduced compared to alternative detection approaches which detect objects based on shape, edges or color. Nevertheless, the sliding window technique strongly increases the computational effort as the classifier has to verify a large number of object candidates. This paper proposes a sliding window approach which also uses depth information from a stereo camera. This leads to a greatly decreased number of object candidates without significantly reducing the detection accuracy. A theoretical investigation of the conventional sliding window approach is presented first. Other publications to date only mentioned rough estimations of the computational cost. A mathematical derivation clarifies the number of object candidates with respect to parameters such as image and object size. Subsequently, the proposed disparity sliding window approach is presented in detail. The approach is evaluated on pedestrian detection with annotations and images from the KITTI object detection benchmark. Furthermore, a comparison with two state-of-the-art methods is made. Code is available in C++ and Python https://github.com/julimueller/ disparity-sliding-window.

READ FULL TEXT
research
10/03/2019

A Stereo Algorithm for Thin Obstacles and Reflective Objects

Stereo cameras are a popular choice for obstacle avoidance for outdoor l...
research
05/25/2023

Sliding Window Sum Algorithms for Deep Neural Networks

Sliding window sums are widely used for string indexing, hashing and tim...
research
03/18/2010

Sliding window approach based Text Binarisation from Complex Textual images

Text binarisation process classifies individual pixels as text or backgr...
research
03/27/2010

Active Testing for Face Detection and Localization

We provide a novel search technique, which uses a hierarchical model and...
research
01/23/2017

Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN

While vehicle license plate recognition (VLPR) is usually done with a sl...
research
06/28/2018

Active query-driven visual search using probabilistic bisection and convolutional neural networks

We present a novel efficient object detection and localization framework...
research
12/11/2014

An active search strategy for efficient object class detection

Object class detectors typically apply a window classifier to all the wi...

Please sign up or login with your details

Forgot password? Click here to reset