GIPFA: Generating IPA Pronunciation from Audio

06/13/2020
by   Xavier Marjou, et al.
0

Transcribing spoken audio samples into International Phonetic Alphabet (IPA) has long been reserved for experts. In this study, we instead examined the use of an Artificial Neural Network (ANN) model to automatically extract the IPA pronunciation of a word based on its audio pronunciation, hence its name Generating IPA Pronunciation From Audio (GIPFA). Based on the French Wikimedia dictionary, we trained our model which then correctly predicted 75 pronunciations tested. Interestingly, by studying inference errors, the model made it possible to highlight possible errors in the dataset as well as identifying the closest phonemes in French.

READ FULL TEXT
research
12/31/2019

OTEANN: Estimating the Transparency of Orthographies with an Artificial Neural Network

To transcribe spoken language to written medium, most alphabets enable a...
research
12/17/2018

Persian Vowel recognition with MFCC and ANN on PCVC speech dataset

In this paper a new method for recognition of consonant-vowel phonemes c...
research
12/22/2016

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

In this paper we propose a novel model for unconditional audio generatio...
research
09/12/2016

WaveNet: A Generative Model for Raw Audio

This paper introduces WaveNet, a deep neural network for generating raw ...
research
03/30/2022

Generative Spoken Dialogue Language Modeling

We introduce dGSLM, the first "textless" model able to generate audio sa...
research
06/11/2021

Catch-A-Waveform: Learning to Generate Audio from a Single Short Example

Models for audio generation are typically trained on hours of recordings...
research
05/10/2020

Audio and Contact Microphones for Cough Detection

In the framework of assessing the pathology severity in chronic cough di...

Please sign up or login with your details

Forgot password? Click here to reset