
-
Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition
Automatic speech recognition (ASR) systems for young children are needed...
read it
-
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASR
We present a bidirectional unsupervised model pre-training (UPT) method ...
read it
-
Analysis of Disfluency in Children's Speech
Disfluencies are prevalent in spontaneous speech, as shown in many studi...
read it
-
Speaker discrimination in humans and machines: Effects of speaking style variability
Does speaking style variation affect humans' ability to distinguish indi...
read it
-
Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification
The effects of speaking-style variability on automatic speaker verificat...
read it
-
Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification
In this paper, we propose a novel way of addressing text-dependent autom...
read it
-
Glottal Source Processing: from Analysis to Applications
The great majority of current voice technology applications relies on ac...
read it
-
Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics
This paper focuses on the problem of pitch tracking in noisy conditions....
read it
-
Deep neural network based i-vector mapping for speaker verification using short utterances
Text-independent speaker recognition using short utterances is a highly ...
read it