Fast Object Localization Using a CNN Feature Map Based Multi-Scale Search

04/12/2016
by   Hyungtae Lee, et al.
0

Object localization is an important task in computer vision but requires a large amount of computational power due mainly to an exhaustive multiscale search on the input image. In this paper, we describe a near real-time multiscale search on a deep CNN feature map that does not use region proposals. The proposed approach effectively exploits local semantic information preserved in the feature map of the outermost convolutional layer. A multi-scale search is performed on the feature map by processing all the sub-regions of different sizes using separate expert units of fully connected layers. Each expert unit receives as input local semantic features only from the corresponding sub-regions of a specific geometric shape. Therefore, it contains more nearly optimal parameters tailored to the corresponding shape. This multi-scale and multi-aspect ratio scanning strategy can effectively localize a potential object of an arbitrary size. The proposed approach is fast and able to localize objects of interest with a frame rate of 4 fps while providing improved detection performance over the state-of-the art on the PASCAL VOC 12 and MSCOCO data sets.

READ FULL TEXT

page 2

page 8

page 10

page 14

research
09/24/2019

Multi-scale discriminative Region Discovery for Weakly-Supervised Object Localization

Localizing objects with weak supervision in an image is a key problem of...
research
04/23/2018

Multi-scale prediction for robust hand detection and classification

In this paper, we present a multi-scale Fully Convolutional Networks (MS...
research
11/01/2017

Single Multi-feature detector for Amodal 3D Object Detection in RGB-D Images

This paper aims at fast and high-accuracy amodal 3D object detections in...
research
10/23/2014

Density-Based Region Search with Arbitrary Shape for Object Localization

Region search is widely used for object localization. Typically, the reg...
research
11/18/2014

Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture

In this paper we address three different computer vision tasks using a s...
research
08/13/2021

CNN-based Two-Stage Parking Slot Detection Using Region-Specific Multi-Scale Feature Extraction

Autonomous parking systems start with the detection of available parking...
research
02/11/2016

Generating Discriminative Object Proposals via Submodular Ranking

A multi-scale greedy-based object proposal generation approach is presen...

Please sign up or login with your details

Forgot password? Click here to reset