Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

02/04/2019
by   Andrea Galassi, et al.
0

Attention is an increasingly popular mechanism used in a wide range of neural architectures. Because of the fast-paced advances in this domain, a systematic overview of attention is still missing. In this article, we define a unified model for attention architectures for natural language processing, with a focus on architectures designed to work with vector representation of the textual data. We discuss the dimensions along which proposals differ, the possible uses of attention, and chart the major research activities and open challenges in the area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2016

Exploring Different Dimensions of Attention for Uncertainty Detection

Neural networks with attention have proven effective for many natural la...
research
12/05/2018

Attention Boosted Sequential Inference Model

Attention mechanism has been proven effective on natural language proces...
research
02/18/2023

Transformadores: Fundamentos teoricos y Aplicaciones

Transformers are a neural network architecture originally designed for n...
research
11/10/2019

Location Attention for Extrapolation to Longer Sequences

Neural networks are surprisingly good at interpolating and perform remar...
research
03/22/2021

BERT: A Review of Applications in Natural Language Processing and Understanding

In this review, we describe the application of one of the most popular d...
research
05/25/2021

Extending the Abstraction of Personality Types based on MBTI with Machine Learning and Natural Language Processing

A data-centric approach with Natural Language Processing (NLP) to predic...
research
07/22/2016

Syntax-based Attention Model for Natural Language Inference

Introducing attentional mechanism in neural network is a powerful concep...

Please sign up or login with your details

Forgot password? Click here to reset