Russian Natural Language Generation: Creation of a Language Modelling Dataset and Evaluation with Modern Neural Architectures

05/05/2020
by   Zein Shaheen, et al.
0

Generating coherent, grammatically correct, and meaningful text is very challenging, however, it is crucial to many modern NLP systems. So far, research has mostly focused on English language, for other languages both standardized datasets, as well as experiments with state-of-the-art models, are rare. In this work, we i) provide a novel reference dataset for Russian language modeling, ii) experiment with popular modern methods for text generation, namely variational autoencoders, and generative adversarial networks, which we trained on the new dataset. We evaluate the generated text regarding metrics such as perplexity, grammatical correctness and lexical diversity.

READ FULL TEXT
research
01/02/2019

Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation

Recent advances in deep learning have resulted in a resurgence in the po...
research
09/25/2020

Controllable Text Generation with Focused Variation

This work introduces Focused-Variation Network (FVN), a novel model to c...
research
05/18/2022

GPoeT-2: A GPT-2 Based Poem Generator

This project aims to produce the next volume of machine-generated poetry...
research
07/08/2021

HinGE: A Dataset for Generation and Evaluation of Code-Mixed Hinglish Text

Text generation is a highly active area of research in the computational...
research
09/12/2022

Lexical Simplification Benchmarks for English, Portuguese, and Spanish

Even in highly-developed countries, as many as 15-30% of the population ...
research
02/15/2023

NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation

Translating natural language into Bash Commands is an emerging research ...
research
07/25/2022

Innovations in Neural Data-to-text Generation

The neural boom that has sparked natural language processing (NLP) resea...

Please sign up or login with your details

Forgot password? Click here to reset