Generative Adversarial Nets for Multiple Text Corpora

12/25/2017
by   Baiyang Wang, et al.
0

Generative adversarial nets (GANs) have been successfully applied to the artificial generation of image data. In terms of text data, much has been done on the artificial generation of natural language from a single corpus. We consider multiple text corpora as the input data, for which there can be two applications of GANs: (1) the creation of consistent cross-corpus word embeddings given different word embeddings per corpus; (2) the generation of robust bag-of-words document embeddings for each corpora. We demonstrate our GAN models on real-world text data sets from different corpora, and show that embeddings from both models lead to improvements in supervised learning problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2017

Sentiment Analysis by Joint Learning of Word Embeddings and Classifier

Word embeddings are representations of individual words of a text docume...
research
08/27/2018

Generating Text through Adversarial Training using Skip-Thought Vectors

In the past few years, various advancements have been made in generative...
research
12/07/2018

Asynchronous Training of Word Embeddings for Large Text Corpora

Word embeddings are a powerful approach for analyzing language and have ...
research
06/22/2022

Understanding the Properties of Generated Corpora

Models for text generation have become focal for many research tasks and...
research
02/14/2020

Semantic Relatedness and Taxonomic Word Embeddings

This paper connects a series of papers dealing with taxonomic word embed...
research
11/13/2020

Learning language variations in news corpora through differential embeddings

There is an increasing interest in the NLP community in capturing variat...
research
10/22/2018

Proactive Security: Embedded AI Solution for Violent and Abusive Speech Recognition

Violence is an epidemic in Brazil and a problem on the rise world-wide. ...

Please sign up or login with your details

Forgot password? Click here to reset