Contextualization of Morphological Inflection

05/04/2019
by   Ekaterina Vylomova, et al.
0

Critical to natural language generation is the production of correctly inflected text. In this paper, we isolate the task of predicting a fully inflected sentence from its partially lemmatized version. Unlike traditional morphological inflection or surface realization, our task input does not provide "gold" tags that specify what morphological features to realize on each lemmatized word; rather, such features must be inferred from sentential context. We develop a neural hybrid graphical model that explicitly reconstructs morphological features before predicting the inflected forms, and compare this to a system that directly predicts the inflected forms without relying on any morphological annotation. We experiment on several typologically diverse languages from the Universal Dependencies treebanks, showing the utility of incorporating linguistically-motivated latent variables into NLP models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2020

Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection

Universal Dependencies is an open community effort to create cross-lingu...
research
09/06/2018

82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models

We present the Uppsala system for the CoNLL 2018 Shared Task on universa...
research
04/04/2019

A Simple Joint Model for Improved Contextual Neural Lemmatization

English verbs have multiple forms. For instance, talk may also appear as...
research
10/05/2019

Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging

Semitic languages can be highly ambiguous, having several interpretation...
research
12/11/2019

Two Birds with One Stone: Investigating Invertible Neural Networks for Inverse Problems in Morphology

Most problems in natural language processing can be approximated as inve...
research
11/07/2019

Transition-Based Deep Input Linearization

Traditional methods for deep NLG adopt pipeline approaches comprising st...
research
10/26/2022

Eeny, meeny, miny, moe. How to choose data for morphological inflection

Data scarcity is a widespread problem in numerous natural language proce...

Please sign up or login with your details

Forgot password? Click here to reset