Learning to Pronounce Chinese Without a Pronunciation Dictionary

10/09/2020
by   Christopher Chu, et al.
0

We demonstrate a program that learns to pronounce Chinese text in Mandarin, without a pronunciation dictionary. From non-parallel streams of Chinese characters and Chinese pinyin syllables, it establishes a many-to-many mapping between characters and pronunciations. Using unsupervised methods, the program effectively deciphers writing into speech. Its token-level character-to-syllable accuracy is 89 accuracy of prior work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2020

CalliGAN: Style and Structure-aware Chinese Calligraphy Character Generator

Chinese calligraphy is the writing of Chinese characters as an art form ...
research
05/30/2023

Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training

We introduce CDBERT, a new learning paradigm that enhances the semantics...
research
02/07/2018

Unsupervised Typography Transfer

Traditional methods in Chinese typography synthesis view characters as a...
research
05/08/2020

Development of a New Image-to-text Conversion System for Pashto, Farsi and Traditional Chinese

We report upon the results of a research and prototype building project ...
research
02/28/2016

Optimizing the Learning Order of Chinese Characters Using a Novel Topological Sort Algorithm

We present a novel algorithm for optimizing the order in which Chinese c...
research
02/05/2020

Multi-Fusion Chinese WordNet (MCW) : Compound of Machine Learning and Manual Correction

Princeton WordNet (PWN) is a lexicon-semantic network based on cognitive...
research
05/24/2022

Deep Learning-based automated classification of Chinese Speech Sound Disorders

This article describes a system for analyzing acoustic data to assist in...

Please sign up or login with your details

Forgot password? Click here to reset