Cultural Event Recognition with Visual ConvNets and Temporal Models

04/24/2015
by   Amaia Salvador, et al.
0

This paper presents our contribution to the ChaLearn Challenge 2015 on Cultural Event Classification. The challenge in this task is to automatically classify images from 50 different cultural events. Our solution is based on the combination of visual features extracted from convolutional neural networks with temporal information using a hierarchical classifier scheme. We extract visual features from the last three fully connected layers of both CaffeNet (pretrained with ImageNet) and our fine tuned version for the ChaLearn challenge. We propose a late fusion strategy that trains a separate low-level SVM on each of the extracted neural codes. The class predictions of the low-level SVMs form the input to a higher level SVM, which gives the final event scores. We achieve our best result by adding a temporal refinement step into our classification scheme, which is applied directly to the output of each low-level SVM. Our approach penalizes high classification scores based on visual features when their time stamp does not match well an event-specific temporal distribution learned from the training and validation data. Our system achieved the second best result in the ChaLearn Challenge 2015 on Cultural Event Classification with a mean average precision of 0.767 on the test set.

READ FULL TEXT

page 1

page 3

page 4

research
09/18/2017

Continuous Multimodal Emotion Recognition Approach for AVEC 2017

This paper reports the analysis of audio and visual features in predicti...
research
02/09/2018

Video Event Recognition and Anomaly Detection by Combining Gaussian Process and Hierarchical Dirichlet Process Models

In this paper, we present an unsupervised learning framework for analyzi...
research
11/10/2017

Depression Severity Estimation from Multiple Modalities

Depression is a major debilitating disorder which can affect people from...
research
09/18/2017

Depression Scale Recognition from Audio, Visual and Text Analysis

Depression is a major mental health disorder that is rapidly affecting l...
research
10/14/2015

Better Exploiting OS-CNNs for Better Event Recognition in Images

Event recognition from still images is one of the most important problem...
research
05/05/2015

Visual Summary of Egocentric Photostreams by Representative Keyframes

Building a visual summary from an egocentric photostream captured by a l...
research
02/15/2021

RMS-Net: Regression and Masking for Soccer Event Spotting

The recently proposed action spotting task consists in finding the exact...

Please sign up or login with your details

Forgot password? Click here to reset