On the Use of Machine Translation-Based Approaches for Vietnamese Diacritic Restoration

09/20/2017
by   Thai Hoang Pham, et al.
0

This paper presents an empirical study of two machine translation-based approaches for Vietnamese diacritic restoration problem, including phrase-based and neural-based machine translation models. This is the first work that applies neural-based machine translation method to this problem and gives a thorough comparison to the phrase-based machine translation method which is the current state-of-the-art method for this problem. On a large dataset, the phrase-based approach has an accuracy of 97.32 approach is 96.15 accuracy, it is about twice faster than the phrase-based method in terms of inference speed. Moreover, neural-based machine translation method has much room for future improvement such as incorporating pre-trained word embeddings and collecting more training data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2018

Neural Phrase-to-Phrase Machine Translation

In this paper, we propose Neural Phrase-to-Phrase Machine Translation (N...
research
09/06/2019

Self Learning from Large Scale Code Corpus to Infer Structure of Method Invocations

Automatically generating code from a textual description of method invoc...
research
06/08/2016

First Result on Arabic Neural Machine Translation

Neural machine translation has become a major alternative to widely used...
research
12/22/2014

Bayesian Optimisation for Machine Translation

This paper presents novel Bayesian optimisation algorithms for minimum e...
research
02/28/2015

Non-linear Learning for Statistical Machine Translation

Modern statistical machine translation (SMT) systems usually use a linea...
research
08/20/2019

Prosodic Phrase Alignment for Machine Dubbing

Dubbing is a type of audiovisual translation where dialogues are transla...
research
04/03/2020

Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Translation models based on hierarchical phrase-based statistical machin...

Please sign up or login with your details

Forgot password? Click here to reset