Supervised Grapheme-to-Phoneme Conversion of Orthographic Schwas in Hindi and Punjabi

04/22/2020
by   Aryaman Arora, et al.
0

Hindi grapheme-to-phoneme (G2P) conversion is mostly trivial, with one exception: whether a schwa represented in the orthography is pronounced or unpronounced (deleted). Previous work has attempted to predict schwa deletion in a rule-based fashion using prosodic or phonetic analysis. We present the first statistical schwa deletion classifier for Hindi, which relies solely on the orthography as the input and outperforms previous approaches. We trained our model on a newly-compiled pronunciation lexicon extracted from various online dictionaries. Our best Hindi model achieves state of the art performance, and also achieves good performance on a closely related language, Punjabi, without modification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2022

Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models

Neural network models have achieved state-of-the-art performance on grap...
research
02/08/2021

Constructios of l-adic t-deletion-correcting quantum codes

We propose two systematic constructions of deletion-correcting codes for...
research
04/03/2018

Graph2Seq: Graph to Sequence Learning with Attention-based Neural Networks

Celebrated Sequence to Sequence learning (Seq2Seq) and its fruitful vari...
research
10/29/2020

Multiple Sclerosis Severity Classification From Clinical Text

Multiple Sclerosis (MS) is a chronic, inflammatory and degenerative neur...
research
03/28/2020

BiLingUNet: Image Segmentation by Modulating Top-Down and Bottom-Up Visual Processing with Referring Expressions

We present BiLingUNet, a state-of-the-art model for image segmentation u...
research
05/04/2017

A Finite State and Rule-based Akshara to Prosodeme (A2P) Converter in Hindi

This article describes a software module called Akshara to Prosodeme (A2...
research
12/30/2019

"Hinglish" Language – Modeling a Messy Code-Mixed Language

With a sharp rise in fluency and users of "Hinglish" in linguistically d...

Please sign up or login with your details

Forgot password? Click here to reset