A Rule-Based Approach For Aligning Japanese-Spanish Sentences From A Comparable Corpora

11/19/2012
by   Jessica C. Ramírez, et al.
0

The performance of a Statistical Machine Translation System (SMT) system is proportionally directed to the quality and length of the parallel corpus it uses. However for some pair of languages there is a considerable lack of them. The long term goal is to construct a Japanese-Spanish parallel corpus to be used for SMT, whereas, there are a lack of useful Japanese-Spanish parallel Corpus. To address this problem, In this study we proposed a method for extracting Japanese-Spanish Parallel Sentences from Wikipedia using POS tagging and Rule-Based approach. The main focus of this approach is the syntactic features of both languages. Human evaluation was performed over a sample and shows promising results, in comparison with the baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2017

Extracting an English-Persian Parallel Corpus from Comparable Corpora

Parallel data are an important part of a reliable Statistical Machine Tr...
research
03/10/2017

Comparison of SMT and RBMT; The Requirement of Hybridization for Marathi-Hindi MT

We present in this paper our work on comparison between Statistical Mach...
research
03/04/2015

Statistical modality tagging from rule-based annotations and crowdsourcing

We explore training an automatic modality tagger. Modality is the attitu...
research
01/29/2017

Using English as Pivot to Extract Persian-Italian Parallel Sentences from Non-Parallel Corpora

The effectiveness of a statistical machine translation system (SMT) is v...
research
08/30/2018

Pronoun Translation in English-French Machine Translation: An Analysis of Error Types

Pronouns are a long-standing challenge in machine translation. We presen...
research
02/06/2017

A Hybrid Approach For Hindi-English Machine Translation

In this paper, an extended combined approach of phrase based statistical...
research
11/20/2017

Fast BTG-Forest-Based Hierarchical Sub-sentential Alignment

In this paper, we propose a novel BTG-forest-based alignment method. Bas...

Please sign up or login with your details

Forgot password? Click here to reset