Word Embedding Visualization Via Dictionary Learning

10/09/2019
by   Juexiao Zhang, et al.
0

Co-occurrence statistics based word embedding techniques have proved to be very useful in extracting the semantic and syntactic representation of words as low dimensional continuous vectors. In this work, we discovered that dictionary learning can open up these word vectors as a linear combination of more elementary word factors. We demonstrate many of the learned factors have surprisingly strong semantic or syntactic meaning corresponding to the factors previously identified manually by human inspection. Thus dictionary learning provides a powerful visualization tool for understanding word embedding representations. Furthermore, we show that the word factors can help in identifying key semantic and syntactic differences in word analogy tasks and improve upon the state-of-the-art word embedding techniques in these tasks by a large margin.

READ FULL TEXT

page 6

page 13

research
11/27/2015

Category Enhanced Word Embedding

Distributed word representations have been demonstrated to be effective ...
research
03/29/2021

Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors

Transformer networks have revolutionized NLP representation learning sin...
research
06/10/2016

Unsupervised Learning of Word-Sequence Representations from Scratch via Convolutional Tensor Decomposition

Unsupervised text embeddings extraction is crucial for text understandin...
research
11/14/2017

Modeling Semantic Relatedness using Global Relation Vectors

Word embedding models such as GloVe rely on co-occurrence statistics fro...
research
11/06/2015

Towards a Better Understanding of Predict and Count Models

In a recent paper, Levy and Goldberg pointed out an interesting connecti...
research
05/13/2022

IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic Representations

What is the relation between a word and its description, or a word and i...
research
07/02/2019

Obj-GloVe: Scene-Based Contextual Object Embedding

Recently, with the prevalence of large-scale image dataset, the co-occur...

Please sign up or login with your details

Forgot password? Click here to reset