Multi Task Deep Morphological Analyzer: Context Aware Joint Morphological Tagging and Lemma Prediction

11/21/2018
by   Saurav Jha, et al.
0

Morphological analysis is an important first step in downstream tasks like machine translation and dependency parsing of morphologically rich languages (MRLs) such as those belonging to Indo-Aryan and Dravidian families. However, the ambiguities introduced by the recombination of morphemes constructing several possible inflections for a word makes the prediction of syntactic traits a notoriously complicated task for MRLs. We propose a character-level neural morphological analyzer, the Multi Task Deep Morphological analyzer (MT-DMA), based on multitask learning of word-level tag markers for Hindi. In order to show the portability of our system to other related languages, we present results on Urdu too. MT-DMA predicts the complete set of morphological tags for words of Indo-Aryan languages: Parts-of-speech (POS), Gender (G), Number (N), Person (P), Case (C), Tense-Aspect-Modality (TAM) marker as well as the Lemma (L) by jointly learning all these in a single end-to-end framework. We show the effectiveness of training of such deep neural networks by the simultaneous optimization of multiple loss functions and sharing of initial parameters for context-aware morphological analysis. Our model outperforms the state-of-art analyzers for Hindi and Urdu. Exploring the use of a set of character-level features in phonological space optimized for each tag through a multi-objective genetic algorithm, coupled with effective training strategies, our model establishes a new state-of-the-art accuracy score upon all seven of the tasks for both the languages. MT-DMA is publicly accessible to be used at http://35.154.251.44/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2018

82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models

We present the Uppsala system for the CoNLL 2018 Shared Task on universa...
research
08/28/2018

What do character-level models learn about morphology? The case of dependency parsing

When parsing morphologically-rich languages with neural models, it is be...
research
04/11/2017

What do Neural Machine Translation Models Learn about Morphology?

Neural machine translation (MT) models obtain state-of-the-art performan...
research
06/07/2020

A Multitask Learning Approach for Diacritic Restoration

In many languages like Arabic, diacritics are used to specify pronunciat...
research
08/20/2017

LSTM Network for Inflected Abbreviation Expansion

In this paper, the problem of recovery of morphological information lost...
research
06/21/2016

Neural Morphological Tagging from Characters for Morphologically Rich Languages

This paper investigates neural character-based morphological tagging for...
research
03/08/2015

An Unsupervised Method for Uncovering Morphological Chains

Most state-of-the-art systems today produce morphological analysis based...

Please sign up or login with your details

Forgot password? Click here to reset