Gating Mechanisms for Combining Character and Word-level Word Representations: An Empirical Study

04/11/2019
by   Jorge A. Balazs, et al.
18

In this paper we study how different ways of combining character and word-level representations affect the quality of both final word and sentence representations. We provide strong empirical evidence that modeling characters improves the learned representations at the word and sentence levels, and that doing so is particularly useful when representing less frequent words. We further show that a feature-wise sigmoid gating mechanism is a robust method for creating representations that encode semantic similarity, as it performed reasonably well in several word similarity datasets. Finally, our findings suggest that properly capturing semantic similarity at the word level does not consistently yield improved performance in downstream sentence-level tasks. Our code is available at https://github.com/jabalazs/gating

READ FULL TEXT

page 6

page 8

research
11/11/2022

Improving word mover's distance by leveraging self-attention matrix

Measuring the semantic similarity between two sentences is still an impo...
research
07/10/2016

Charagram: Embedding Words and Sentences via Character n-grams

We present Charagram embeddings, a simple approach for learning characte...
research
11/14/2016

Attending to Characters in Neural Sequence Labeling Models

Sequence labeling architectures use word embeddings for capturing simila...
research
05/21/2018

Character-based Neural Networks for Sentence Pair Modeling

Sentence pair modeling is critical for many NLP tasks, such as paraphras...
research
05/02/2016

Compositional Sentence Representation from Character within Large Context Text

This paper describes a Hierarchical Composition Recurrent Network (HCRN)...
research
05/10/2023

Acceleration of FM-index Queries Through Prefix-free Parsing

FM-indexes are a crucial data structure in DNA alignment, for example, b...
research
05/20/2019

Word Usage Similarity Estimation with Sentence Representations and Automatic Substitutes

Usage similarity estimation addresses the semantic proximity of word ins...

Please sign up or login with your details

Forgot password? Click here to reset