Mining Mid-level Visual Patterns with Deep CNN Activations

06/21/2015
by   Yao Li, et al.
0

The purpose of mid-level visual element discovery is to find clusters of image patches that are both representative and discriminative. Here we study this problem from the prospective of pattern mining while relying on the recently popularized Convolutional Neural Networks (CNNs). We observe that a fully-connected CNN activation extracted from an image patch typically possesses two appealing properties that enable its seamless integration with pattern mining techniques. The marriage between CNN activations and association rule mining, a well-known pattern mining technique in the literature, leads to fast and effective discovery of representative and discriminative patterns from a huge number of image patches. When we retrieve and visualize image patches with the same pattern, surprisingly, they are not only visually similar but also semantically consistent, and thus give rise to a mid-level visual element in our work. Given the patterns and retrieved mid-level visual elements, we propose two methods to generate image feature representations for each. The first method is to use the patterns as codewords in a dictionary, similar to the Bag-of-Visual-Words model, we compute a Bag-of-Patterns representation. The second one relies on the retrieved mid-level visual elements to construct a Bag-of-Elements representation. We evaluate the two encoding methods on scene and object classification tasks, and demonstrate that our approach outperforms or matches recent works using CNN activations for these tasks.

READ FULL TEXT

page 2

page 6

page 8

page 9

page 13

page 14

page 15

page 16

research
11/24/2014

Mid-level Deep Pattern Mining

Mid-level visual element discovery aims to find clusters of image patche...
research
03/31/2016

Modeling Visual Compatibility through Hierarchical Mid-level Elements

In this paper we present a hierarchical method to discover mid-level ele...
research
03/18/2017

PatternNet: Visual Pattern Mining with Deep Neural Network

Visual patterns represent the discernible regularity in the visual world...
research
05/14/2012

Unsupervised Discovery of Mid-Level Discriminative Patches

The goal of this paper is to discover a set of discriminative patches wh...
research
12/31/2015

Event Specific Multimodal Pattern Mining with Image-Caption Pairs

In this paper we describe a novel framework and algorithms for discoveri...
research
05/29/2017

Ensemble of Part Detectors for Simultaneous Classification and Localization

Part-based representation has been proven to be effective for a variety ...
research
12/04/2014

Fisher Kernel for Deep Neural Activations

Compared to image representation based on low-level local descriptors, d...

Please sign up or login with your details

Forgot password? Click here to reset