DeepAI AI Chat
Log In Sign Up

Spot the Difference by Object Detection

by   Junhui Wu, et al.
Microsoft, Inc.

In this paper, we propose a simple yet effective solution to a change detection task that detects the difference between two images, which we call "spot the difference". Our approach uses CNN-based object detection by stacking two aligned images as input and considering the differences between the two images as objects to detect. An early-merging architecture is used as the backbone network. Our method is accurate, fast and robust while using very cheap annotation. We verify the proposed method on the task of change detection between the digital design and its photographic image of a book. Compared to verification based methods, our object detection based method outperforms other methods by a large margin and gives extra information of location. We compress the network and achieve 24 times acceleration while keeping the accuracy. Besides, as we synthesize the training data for detection using weakly labeled images, our method does not need expensive bounding box annotation.


page 1

page 2

page 3

page 6

page 8

page 9


Gliding vertex on the horizontal bounding box for multi-oriented object detection

Object detection has recently experienced substantial progress. Yet, the...

Object Detection for Comics using Manga109 Annotations

With the growth of digitized comics, image understanding techniques are ...

Stereo R-CNN based 3D Object Detection for Autonomous Driving

We propose a 3D object detection method for autonomous driving by fully ...

Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection

Modern leading object detectors are either two-stage or one-stage networ...

AttentionNet: Aggregating Weak Directions for Accurate Object Detection

We present a novel detection method using a deep convolutional neural ne...

An Object Detection by using Adaptive Structural Learning of Deep Belief Network

Deep learning forms a hierarchical network structure for representation ...

GaTector: A Unified Framework for Gaze Object Prediction

Gaze object prediction (GOP) is a newly proposed task that aims to disco...