Automatic Measurement of Pre-aspiration

04/05/2017
by   Yaniv Sheena, et al.
0

Pre-aspiration is defined as the period of glottal friction occurring in sequences of vocalic/consonantal sonorants and phonetically voiceless obstruents. We propose two machine learning methods for automatic measurement of pre-aspiration duration: a feedforward neural network, which works at the frame level; and a structured prediction model, which relies on manually designed feature functions, and works at the segment level. The input for both algorithms is a speech signal of an arbitrary length containing a single obstruent, and the output is a pair of times which constitutes the pre-aspiration boundaries. We train both models on a set of manually annotated examples. Results suggest that the structured model is superior to the frame-based model as it yields higher accuracy in predicting the boundaries and generalizes to new speakers and new languages. Finally, we demonstrate the applicability of our structured prediction algorithm by replicating linguistic analysis of pre-aspiration in Aberystwyth English with high correlation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2016

Automatic measurement of vowel duration via structured prediction

A key barrier to making phonetic studies scalable and replicable is the ...
research
11/01/2022

Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features

Current state-of-the-art methods for automatic synthetic speech evaluati...
research
03/22/2018

Structured Output Learning with Abstention: Application to Accurate Opinion Prediction

Motivated by Supervised Opinion Analysis, we propose a novel framework d...
research
10/27/2019

Dr.VOT : Measuring Positive and Negative Voice Onset Time in the Wild

Voice Onset Time (VOT), a key measurement of speech for basic research a...
research
10/25/2016

Sequence Segmentation Using Joint RNN and Structured Prediction Models

We describe and analyze a simple and effective algorithm for sequence se...
research
04/02/2021

HMM-Free Encoder Pre-Training for Streaming RNN Transducer

This work describes an encoder pre-training procedure using frame-wise l...
research
07/23/2019

A system for efficient 3D printed stop-motion face animation

Computer animation in conjunction with 3D printing has the potential to ...

Please sign up or login with your details

Forgot password? Click here to reset