Reliable automatic evaluation of summarization systems is challenging du...
Evaluation metrics that are not robust to dialect variation make it
impo...
Much of text-to-speech research relies on human evaluation, which incurs...
Evaluation practices in natural language generation (NLG) have many know...
Recent developments in machine translation and multilingual text generat...
Experiments with pretrained models such as BERT are often based on a sin...
We introduce GEM, a living benchmark for natural language Generation (NL...
The quality of machine translation systems has dramatically improved ove...
Text generation has made significant advances in the last few years. Yet...
We present a probabilistic framework for multilingual neural machine
tra...
Neural conditional text generation systems have achieved significant pro...
Although deep learning models perform remarkably across a range of tasks...
Interactive tools make data analysis both more efficient and more access...