DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns

08/10/2016
by   Ali Diba, et al.
0

The recognition of human actions and the determination of human attributes are two tasks that call for fine-grained classification. Indeed, often rather small and inconspicuous objects and features have to be detected to tell their classes apart. In order to deal with this challenge, we propose a novel convolutional neural network that mines mid-level image patches that are sufficiently dedicated to resolve the corresponding subtleties. In particular, we train a newly de- signed CNN (DeepPattern) that learns discriminative patch groups. There are two innovative aspects to this. On the one hand we pay attention to contextual information in an origi- nal fashion. On the other hand, we let an iteration of feature learning and patch clustering purify the set of dedicated patches that we use. We validate our method for action clas- sification on two challenging datasets: PASCAL VOC 2012 Action and Stanford 40 Actions, and for attribute recogni- tion we use the Berkeley Attributes of People dataset. Our discriminative mid-level mining CNN obtains state-of-the- art results on these datasets, without a need for annotations about parts and poses.

READ FULL TEXT

page 1

page 5

page 7

research
11/29/2016

Weakly-supervised Discriminative Patch Learning via CNN for Fine-grained Recognition

Research on fine-grained recognition has recently shifted from multistag...
research
05/05/2015

Contextual Action Recognition with R*CNN

There are multiple cues in an image which reveal what action a person is...
research
12/08/2014

Actions and Attributes from Wholes and Parts

We investigate the importance of parts for the tasks of action and attri...
research
11/10/2018

Multi-label Object Attribute Classification using a Convolutional Neural Network

Objects of different classes can be described using a limited number of ...
research
08/01/2023

Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark Analysis

Human body-pose estimation is a complex problem in computer vision. Rece...
research
05/14/2012

Unsupervised Discovery of Mid-Level Discriminative Patches

The goal of this paper is to discover a set of discriminative patches wh...
research
09/14/2015

Expanded Parts Model for Semantic Description of Humans in Still Images

We introduce an Expanded Parts Model (EPM) for recognizing human attribu...

Please sign up or login with your details

Forgot password? Click here to reset