GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations

06/14/2018
by   Zhilin Yang, et al.
0

Modern deep transfer learning approaches have mainly focused on learning generic feature vectors from one task that are transferable to other tasks, such as word embeddings in language and pretrained convolutional features in vision. However, these approaches usually transfer unary features and largely ignore more structured graphical representations. This work explores the possibility of learning generic latent relational graphs that capture dependencies between pairs of data units (e.g., words or pixels) from large-scale unlabeled data and transferring the graphs to downstream tasks. Our proposed transfer learning framework improves performance on various tasks including question answering, natural language inference, sentiment analysis, and image classification. We also show that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have not been trained (including GloVe embeddings, ELMo embeddings, and task-specific RNN hidden unit), or embedding-free units such as image pixels.

READ FULL TEXT
research
05/05/2017

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Many modern NLP systems rely on word embeddings, previously trained in a...
research
05/11/2018

Domain Adapted Word Embeddings for Improved Sentiment Classification

Generic word embeddings are trained on large-scale generic corpora; Doma...
research
04/01/2021

Evaluating Neural Word Embeddings for Sanskrit

Recently, the supervised learning paradigm's surprisingly remarkable per...
research
05/29/2017

Neural Embeddings of Graphs in Hyperbolic Space

Neural embeddings have been used with great success in Natural Language ...
research
10/08/2015

Mapping Unseen Words to Task-Trained Embedding Spaces

We consider the supervised training setting in which we learn task-speci...
research
11/15/2016

Intrinsic Geometric Information Transfer Learning on Multiple Graph-Structured Datasets

Graphs provide a powerful means for representing complex interactions be...
research
08/30/2019

Single Training Dimension Selection for Word Embedding with PCA

In this paper, we present a fast and reliable method based on PCA to sel...

Please sign up or login with your details

Forgot password? Click here to reset