An Evaluation of Neural Machine Translation Models on Historical Spelling Normalization

06/13/2018
by   Gongbo Tang, et al.
0

In this paper, we apply different NMT models to the problem of historical spelling normalization for five languages: English, German, Hungarian, Icelandic, and Swedish. The NMT models are at different levels, have different attention mechanisms, and different neural network architectures. Our results show that NMT models are much better than SMT models in terms of character error rate. The vanilla RNNs are competitive to GRUs/LSTMs in historical spelling normalization. Transformer models perform better only when provided with more training data. We also find that subword-level models with a small subword vocabulary are better than character-level models. In addition, we propose a hybrid method which further improves the performance of historical spelling normalization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2023

Character-level NMT and language similarity

We explore the effectiveness of character-level neural machine translati...
research
04/03/2019

A Large-Scale Comparison of Historical Text Normalization Systems

There is no consensus on the state-of-the-art approach to historical tex...
research
10/25/2016

Improving historical spelling normalization with bi-directional LSTMs and multi-task learning

Natural-language processing of historical documents is complicated by th...
research
04/07/2018

Evaluating historical text normalization systems: How well do they generalize?

We highlight several issues in the evaluation of historical text normali...
research
05/25/2020

Dialect Text Normalization to Normative Standard Finnish

We compare different LSTMs and transformer models in terms of their effe...
research
10/02/2018

Optimally Segmenting Inputs for NMT Shows Preference for Character-Level Processing

Most modern neural machine translation (NMT) systems rely on presegmente...
research
11/17/2019

The Proper Care and Feeding of CAMELS: How Limited Training Data Affects Streamflow Prediction

Accurate streamflow prediction largely relies on historical records of b...

Please sign up or login with your details

Forgot password? Click here to reset