-
Data Augmentation For Children's Speech Recognition – The "Ethiopian" System For The SLT 2021 Children Speech Recognition Challenge
This paper presents the "Ethiopian" system for the SLT 2021 Children Spe...
read it
-
Automatic recognition of child speech for robotic applications in noisy environments
Automatic speech recognition (ASR) allows a natural and intuitive interf...
read it
-
IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines
The IEEE Spoken Language Technology Workshop (SLT) 2021 Alpha-mini Speec...
read it
-
Open Challenge for Correcting Errors of Speech Recognition Systems
The paper announces the new long-term challenge for improving the perfor...
read it
-
Analysis of Disfluency in Children's Speech
Disfluencies are prevalent in spontaneous speech, as shown in many studi...
read it
-
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
The variety of accents has posed a big challenge to speech recognition. ...
read it
-
The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
In this paper, we describe the outcomes of the challenge organized and r...
read it
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines
Automatic speech recognition (ASR) has been significantly advanced with the use of deep learning and big data. However improving robustness, including achieving equally good performance on diverse speakers and accents, is still a challenging problem. In particular, the performance of children speech recognition (CSR) still lags behind due to 1) the speech and language characteristics of children's voice are substantially different from those of adults and 2) sizable open dataset for children speech is still not available in the research community. To address these problems, we launch the Children Speech Recognition Challenge (CSRC), as a flagship satellite event of IEEE SLT 2021 workshop. The challenge will release about 400 hours of Mandarin speech data for registered teams and set up two challenge tracks and provide a common testbed to benchmark the CSR performance. In this paper, we introduce the datasets, rules, evaluation method as well as baselines.
READ FULL TEXT
Comments
There are no comments yet.