
-
ToTTo: A Controlled Table-To-Text Generation Dataset
We present ToTTo, an open-domain English table-to-text dataset with over...
read it
-
How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions
We present a large-scale dataset for the task of rewriting an ill-formed...
read it
-
Attention Interpretability Across NLP Tasks
The attention layer in a neural network model provides insights into the...
read it
-
Handling Divergent Reference Texts when Evaluating Table-to-Text Generation
Automatically constructed datasets for generating text from semi-structu...
read it
-
Text Generation with Exemplar-based Adaptive Decoding
We propose a novel conditioned text generation model. It draws inspirati...
read it
-
UniMorph 2.0: Universal Morphology
The Universal Morphology UniMorph project is a collaborative effort to i...
read it
-
Learning To Split and Rephrase From Wikipedia Edit History
Split and rephrase is the task of breaking down a sentence into shorter ...
read it
-
WikiAtomicEdits: A Multilingual Corpus of Wikipedia Edits for Modeling Language and Discourse
We release a corpus of 43 million atomic edits across 8 languages. These...
read it
-
Identifying Well-formed Natural Language Questions
Understanding search queries is a hard problem as it involves dealing wi...
read it
-
CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages
The CoNLL-SIGMORPHON 2017 shared task on supervised morphological genera...
read it
-
DyNet: The Dynamic Neural Network Toolkit
We describe DyNet, a toolkit for implementing neural network models base...
read it
-
Correlation-based Intrinsic Evaluation of Word Vector Representations
We introduce QVEC-CCA--an intrinsic evaluation metric for word vector re...
read it
-
Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation Learning
We use Bayesian optimization to learn curricula for word representation ...
read it
-
Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning
We introduce polyglot language models, recurrent neural network models t...
read it
-
Problems With Evaluation of Word Embeddings Using Word Similarity Tasks
Lacking standardized extrinsic evaluation methods for vector representat...
read it
-
Cross-lingual Models of Word Embeddings: An Empirical Comparison
Despite interest in using cross-lingual knowledge to learn word embeddin...
read it
-
Morphological Inflection Generation Using Character Sequence to Sequence Learning
Morphological inflection generation is the task of generating the inflec...
read it
-
Morpho-syntactic Lexicon Generation Using Graph-based Semi-supervised Learning
Morpho-syntactic lexicons provide information about the morphological an...
read it
-
Non-distributional Word Vector Representations
Data-driven representation learning for words is a technique of central ...
read it
-
Multilingual Open Relation Extraction Using Cross-lingual Projection
Open domain relation extraction systems identify relation and argument p...
read it