Deep Fishing: Gradient Features from Deep Nets

07/23/2015
by   Albert Gordo, et al.
0

Convolutional Networks (ConvNets) have recently improved image recognition performance thanks to end-to-end learning of deep feed-forward models from raw pixels. Deep learning is a marked departure from the previous state of the art, the Fisher Vector (FV), which relied on gradient-based encoding of local hand-crafted features. In this paper, we discuss a novel connection between these two approaches. First, we show that one can derive gradient representations from ConvNets in a similar fashion to the FV. Second, we show that this gradient representation actually corresponds to a structured matrix that allows for efficient similarity computation. We experimentally study the benefits of transferring this representation over the outputs of ConvNet layers, and find consistent improvements on the Pascal VOC 2007 and 2012 datasets.

READ FULL TEXT
research
12/07/2013

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks

Most phoneme recognition state-of-the-art systems rely on a classical ne...
research
06/08/2016

Structured Convolution Matrices for Energy-efficient Deep learning

We derive a relationship between network representation in energy-effici...
research
10/04/2021

WaveBeat: End-to-end beat and downbeat tracking in the time domain

Deep learning approaches for beat and downbeat tracking have brought adv...
research
03/22/2020

Audio Impairment Recognition Using a Correlation-Based Feature Representation

Audio impairment recognition is based on finding noise in audio files an...
research
06/13/2016

Deep Image Homography Estimation

We present a deep convolutional neural network for estimating the relati...
research
03/16/2017

End-to-End Learning for Structured Prediction Energy Networks

Structured Prediction Energy Networks (SPENs) are a simple, yet expressi...

Please sign up or login with your details

Forgot password? Click here to reset