Indowordnets help in Indian Language Machine Translation

10/05/2017
by   Sreelekha S, et al.
0

Being less resource languages, Indian-Indian and English-Indian language MT system developments faces the difficulty to translate various lexical phenomena. In this paper, we present our work on a comparative study of 440 phrase-based statistical trained models for 110 language pairs across 11 Indian languages. We have developed 110 baseline Statistical Machine Translation systems. Then we have augmented the training corpus with Indowordnet synset word entries of lexical database and further trained 110 models on top of the baseline system. We have done a detailed performance comparison using various evaluation metrics such as BLEU score, METEOR and TER. We observed significant improvement in evaluations of translation quality across all the 440 models after using the Indowordnet. These experiments give a detailed insight in two ways : (1) usage of lexical database with synset mapping for resource poor languages (2) efficient usage of Indowordnet sysnset mapping. More over, synset mapped lexical entries helped the SMT system to handle the ambiguity to a great extent during the translation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2017

Phrase Pair Mappings for Hindi-English Statistical Machine Translation

In this paper, we present our work on the creation of lexical resources ...
research
10/05/2017

Bilingual Words and Phrase Mappings for Marathi and Hindi SMT

Lack of proper linguistic resources is the major challenges faced by the...
research
03/04/2017

Lexical Resources for Hindi Marathi MT

In this paper we describe some ways to utilize various lexical resources...
research
01/02/2023

Statistical Machine Translation for Indic Languages

Machine Translation (MT) system generally aims at automatic representati...
research
04/06/2020

Building a Norwegian Lexical Resource for Medical Entity Recognition

We present a large Norwegian lexical resource of categorized medical ter...
research
08/03/2016

To Swap or Not to Swap? Exploiting Dependency Word Pairs for Reordering in Statistical Machine Translation

Reordering poses a major challenge in machine translation (MT) between t...
research
02/21/2019

Development of a classifiers/quantifiers dictionary towards French-Japanese MT

Although classifiers/quantifiers (CQs) expressions appear frequently in ...

Please sign up or login with your details

Forgot password? Click here to reset