DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection

09/11/2014
by   Wanli Ouyang, et al.
0

In this paper, we propose multi-stage and deformable deep convolutional neural networks for object detection. This new deep learning object detection diagram has innovations in multiple aspects. In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty. With the proposed multi-stage training strategy, multiple classifiers are jointly optimized to process samples at different difficulty levels. A new pre-training strategy is proposed to learn feature representations more suitable for the object detection task and with good generalization capability. By changing the net structures, training strategies, adding and removing some key components in the detection pipeline, a set of models with large diversity are obtained, which significantly improves the effectiveness of modeling averaging. The proposed approach ranked #2 in ILSVRC 2014. It improves the mean averaged precision obtained by RCNN, which is the state-of-the-art of object detection, from 31% to 45%. Detailed component-wise analysis is also provided through extensive experimental evaluation.

READ FULL TEXT

page 7

page 8

research
12/17/2014

DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

In this paper, we propose deformable deep convolutional neural networks ...
research
04/16/2014

Generic Object Detection With Dense Neural Patterns and Regionlets

This paper addresses the challenge of establishing a bridge between deep...
research
04/18/2012

Convolutional Neural Networks Applied to House Numbers Digit Classification

We classify digits of real-world house numbers using convolutional neura...
research
04/25/2019

HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object Detection

Object detection has been a challenging task in computer vision. Althoug...
research
06/10/2020

Condensing Two-stage Detection with Automatic Object Key Part Discovery

Modern two-stage object detectors generally require excessively large mo...
research
11/27/2018

Deformable ConvNets v2: More Deformable, Better Results

The superior performance of Deformable Convolutional Networks arises fro...
research
12/02/2019

DeepLofargram: A Deep Learning based Fluctuating Dim Frequency Line Detection and Recovery

This paper investigates the problem of dim frequency line detection and ...

Please sign up or login with your details

Forgot password? Click here to reset