Bayesian Paragraph Vectors

11/10/2017
by   Geng Ji, et al.
0

Word2vec (Mikolov et al., 2013) has proven to be successful in natural language processing by capturing the semantic relationships between different words. Built on top of single-word embeddings, paragraph vectors (Le and Mikolov, 2014) find fixed-length representations for pieces of text with arbitrary lengths, such as documents, paragraphs, and sentences. In this work, we propose a novel interpretation for neural-network-based paragraph vectors by developing an unsupervised generative model whose maximum likelihood solution corresponds to traditional paragraph vectors. This probabilistic formulation allows us to go beyond point estimates of parameters and to perform Bayesian posterior inference. We find that the entropy of paragraph vectors decreases with the length of documents, and that information about posterior uncertainty improves performance in supervised learning tasks such as sentiment analysis and paraphrase detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2017

Spherical Paragraph Model

Representing texts as fixed-length vectors is central to many language p...
research
04/18/2021

Variational Weakly Supervised Sentiment Analysis with Posterior Regularization

Sentiment analysis is an important task in natural language processing (...
research
04/16/2021

Word2rate: training and evaluating multiple word embeddings as statistical transitions

Using pretrained word embeddings has been shown to be a very effective w...
research
08/01/2017

Learned in Translation: Contextualized Word Vectors

Computer vision has benefited from initializing multiple deep layers wit...
research
01/14/2020

Balancing the composition of word embeddings across heterogenous data sets

Word embeddings capture semantic relationships based on contextual infor...
research
11/24/2019

Causally Denoise Word Embeddings Using Half-Sibling Regression

Distributional representations of words, also known as word vectors, hav...
research
12/29/2020

Bayesian analysis of seasonally cointegrated VAR model

The paper aims at developing the Bayesian seasonally cointegrated model ...

Please sign up or login with your details

Forgot password? Click here to reset