Describing Multimedia Content using Attention-based Encoder--Decoder Networks

07/04/2015
by   Kyunghyun Cho, et al.
0

Whereas deep neural networks were first mostly used for classification tasks, they are rapidly expanding in the realm of structured output problems, where the observed target is composed of multiple random variables that have a rich joint distribution, given the input. We focus in this paper on the case where the input also has a rich structure and the input and output structures are somehow related. We describe systems that learn to attend to different places in the input, for each element of the output, for a variety of tasks: machine translation, image caption generation, video clip description and speech recognition. All these systems are based on a shared set of building blocks: gated recurrent neural networks and convolutional neural networks, along with trained attention mechanisms. We report on experimental results with these systems, showing impressively good performance and the advantage of the attention mechanism.

READ FULL TEXT

page 5

page 8

research
12/04/2014

End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

We replace the Hidden Markov Model (HMM) which is traditionally used in ...
research
05/23/2017

Local Monotonic Attention Mechanism for End-to-End Speech and Language Processing

Recently, encoder-decoder neural networks have shown impressive performa...
research
06/16/2017

One Model To Learn Them All

Deep learning yields great results across many fields, from speech recog...
research
01/25/2016

Survey on the attention based RNN model and its applications in computer vision

The recurrent neural networks (RNN) can be used to solve the sequence to...
research
12/15/2017

Pre-training Attention Mechanisms

Recurrent neural networks with differentiable attention mechanisms have ...
research
02/15/2022

The Quarks of Attention

Attention plays a fundamental role in both natural and artificial intell...
research
03/26/2021

Modeling the Nonsmoothness of Modern Neural Networks

Modern neural networks have been successful in many regression-based tas...

Please sign up or login with your details

Forgot password? Click here to reset