Efficient Convolutional Neural Networks for Diacritic Restoration

12/14/2019
by   Sawsan Alqahtani, et al.
0

Diacritic restoration has gained importance with the growing need for machines to understand written texts. The task is typically modeled as a sequence labeling problem and currently Bidirectional Long Short Term Memory (BiLSTM) models provide state-of-the-art results. Recently, Bai et al. (2018) show the advantages of Temporal Convolutional Neural Networks (TCN) over Recurrent Neural Networks (RNN) for sequence modeling in terms of performance and computational resources. As diacritic restoration benefits from both previous as well as subsequent timesteps, we further apply and evaluate a variant of TCN, Acausal TCN (A-TCN), which incorporates context from both directions (previous and future) rather than strictly incorporating previous context as in the case of TCN. A-TCN yields significant improvement over TCN for diacritization in three different languages: Arabic, Yoruba, and Vietnamese. Furthermore, A-TCN and BiLSTM have comparable performance, making A-TCN an efficient alternative over BiLSTM since convolutions can be trained in parallel. A-TCN is significantly faster than BiLSTM at inference time (270

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2020

Romanian Diacritics Restoration Using Recurrent Neural Networks

Diacritics restoration is a mandatory step for adequately processing Rom...
research
07/23/2020

Deep Learning based, end-to-end metaphor detection in Greek language with Recurrent and Convolutional Neural Networks

This paper presents and benchmarks a number of end-to-end Deep Learning ...
research
01/18/2022

Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration

Diacritics restoration has become a ubiquitous task in the Latin-alphabe...
research
06/07/2020

A Multitask Learning Approach for Diacritic Restoration

In many languages like Arabic, diacritics are used to specify pronunciat...
research
04/23/2021

3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces

Silent speech interfaces (SSI) aim to reconstruct the speech signal from...
research
03/04/2020

Restoration of Fragmentary Babylonian Texts Using Recurrent Neural Networks

The main source of information regarding ancient Mesopotamian history an...
research
02/28/2019

No Padding Please: Efficient Neural Handwriting Recognition

Neural handwriting recognition (NHR) is the recognition of handwritten t...

Please sign up or login with your details

Forgot password? Click here to reset