SenGen: Sentence Generating Neural Variational Topic Model

08/01/2017
by   Ramesh Nallapati, et al.
0

We present a new topic model that generates documents by sampling a topic for one whole sentence at a time, and generating the words in the sentence using an RNN decoder that is conditioned on the topic of the sentence. We argue that this novel formalism will help us not only visualize and model the topical discourse structure in a document better, but also potentially lead to more interpretable topics since we can now illustrate topics by sampling representative sentences instead of bag of words or phrases. We present a variational auto-encoder approach for learning in which we use a factorized variational encoder that independently models the posterior over topical mixture vectors of documents using a feed-forward network, and the posterior over topic assignments to sentences using an RNN. Our preliminary experiments on two different datasets indicate early promise, but also expose many challenges that remain to be addressed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2016

Sentence Level Recurrent Topic Model: Letting Topics Speak for Themselves

We propose Sentence Level Recurrent Topic Model (SLRTM), a new topic mod...
research
11/25/2019

Discovering topics with neural topic models built from PLSA assumptions

In this paper we present a model for unsupervised topic discovery in tex...
research
10/14/2021

Neural Attention-Aware Hierarchical Topic Model

Neural topic models (NTMs) apply deep neural networks to topic modelling...
research
12/07/2015

Jointly Modeling Topics and Intents with Global Order Structure

Modeling document structure is of great importance for discourse analysi...
research
08/20/2019

Learning document embeddings along with their uncertainties

Majority of the text modelling techniques yield only point estimates of ...
research
12/12/2018

Structured Neural Topic Models for Reviews

We present Variational Aspect-based Latent Topic Allocation (VALTA), a f...
research
12/12/2018

Structured Representations for Reviews:Aspect-Based Variational Hidden Factor Models

We present Variational Aspect-Based Latent Dirichlet Allocation (VALDA),...

Please sign up or login with your details

Forgot password? Click here to reset