Improving Object Detection with Region Similarity Learning

03/01/2017
by   Feng Gao, et al.
0

Object detection aims to identify instances of semantic objects of a certain class in images or videos. The success of state-of-the-art approaches is attributed to the significant progress of object proposal and convolutional neural networks (CNNs). Most promising detectors involve multi-task learning with an optimization objective of softmax loss and regression loss. The first is for multi-class categorization, while the latter is for improving localization accuracy. However, few of them attempt to further investigate the hardness of distinguishing different sorts of distracting background regions (i.e., negatives) from true object regions (i.e., positives). To improve the performance of classifying positive object regions vs. a variety of negative background regions, we propose to incorporate triplet embedding into learning objective. The triplet units are formed by assigning each negative region to a meaningful object class and establishing class- specific negatives, followed by triplets construction. Over the benchmark PASCAL VOC 2007, the proposed triplet em- bedding has improved the performance of well-known FastRCNN model with a mAP gain of 2.1 from the triplet embedding and has achieved a mAP improvement of 1.2

READ FULL TEXT

page 1

page 2

research
01/24/2019

Object Detection based on Region Decomposition and Assembly

Region-based object detection infers object regions for one or more cate...
research
04/11/2023

A Comprehensive Study on Object Detection Techniques in Unconstrained Environments

Object detection is a crucial task in computer vision that aims to ident...
research
02/11/2020

Object Detection as a Positive-Unlabeled Problem

As with other deep learning methods, label quality is important for lear...
research
05/07/2015

Object detection via a multi-region & semantic segmentation-aware CNN model

We propose an object detection system that relies on a multi-region deep...
research
05/11/2019

Triplet Distillation for Deep Face Recognition

Convolutional neural networks (CNNs) have achieved a great success in fa...
research
05/19/2017

Quadruplet Network with One-Shot Learning for Visual Tracking

As a discriminative method of one-shot learning, Siamese deep network al...
research
05/30/2018

Collaborative Human-AI (CHAI): Evidence-Based Interpretable Melanoma Classification in Dermoscopic Images

Automated dermoscopic image analysis has witnessed rapid growth in diagn...

Please sign up or login with your details

Forgot password? Click here to reset