Pruning and Sparsemax Methods for Hierarchical Attention Networks

04/08/2020
by   João G. Ribeiro, et al.
0

This paper introduces and evaluates two novel Hierarchical Attention Network models [Yang et al., 2016] - i) Hierarchical Pruned Attention Networks, which remove the irrelevant words and sentences from the classification process in order to reduce potential noise in the document classification accuracy and ii) Hierarchical Sparsemax Attention Networks, which replace the Softmax function used in the attention mechanism with the Sparsemax [Martins and Astudillo, 2016], capable of better handling importance distributions where a lot of words or sentences have very low probabilities. Our empirical evaluation on the IMDB Review for sentiment analysis datasets shows both approaches to be able to match the results obtained by the current state-of-the-art (without, however, any significant benefits). All our source code is made available athttps://github.com/jmribeiro/dsl-project.

READ FULL TEXT
research
10/20/2016

Lexicon Integrated CNN Models with Attention for Sentiment Analysis

With the advent of word embeddings, lexicons are no longer fully utilize...
research
11/08/2019

Question Generation from Paragraphs: A Tale of Two Hierarchical Models

Automatic question generation from paragraphs is an important and challe...
research
09/24/2019

TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots

We consider the importance of different utterances in the context for se...
research
09/05/2019

A Better Way to Attend: Attention with Trees for Video Question Answering

We propose a new attention model for video question answering. The main ...
research
03/05/2021

Enhanced Aspect-Based Sentiment Analysis Models with Progressive Self-supervised Attention Learning

In aspect-based sentiment analysis (ABSA), many neural models are equipp...
research
08/16/2019

Bidirectional Context-Aware Hierarchical Attention Network for Document Understanding

The Hierarchical Attention Network (HAN) has made great strides, but it ...
research
09/11/2022

Learning When to Say "I Don't Know"

We propose a new Reject Option Classification technique to identify and ...

Please sign up or login with your details

Forgot password? Click here to reset