-
CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment
This paper describes the design and development of CUCHILD, a large-scal...
read it
-
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
This paper introduces a new speech corpus called "LibriTTS" designed for...
read it
-
FT Speech: Danish Parliament Speech Corpus
This paper introduces FT Speech, a new speech corpus created from the re...
read it
-
RadioTalk: a large-scale corpus of talk radio transcripts
We introduce RadioTalk, a corpus of speech recognition transcripts sampl...
read it
-
Examining a hate speech corpus for hate speech detection and popularity prediction
As research on hate speech becomes more and more relevant every day, mos...
read it
-
The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
This paper introduces the contents and the possible usage of the DIRHA-E...
read it
-
Pronunciation recognition of English phonemes /@/, /æ/, /A:/ and /2/ using Formants and Mel Frequency Cepstral Coefficients
The Vocal Joystick Vowel Corpus, by Washington University, was used to s...
read it
Comprehending Real Numbers: Development of Bengali Real Number Speech Corpus
Speech recognition has received a less attention in Bengali literature due to the lack of a comprehensive dataset. In this paper, we describe the development process of the first comprehensive Bengali speech dataset on real numbers. It comprehends all the possible words that may arise in uttering any Bengali real number. The corpus has ten speakers from the different regions of Bengali native people. It comprises of more than two thousands of speech samples in a total duration of closed to four hours. We also provide a deep analysis of our corpus, highlight some of the notable features of it, and finally evaluate the performances of two of the notable Bengali speech recognizers on it.
READ FULL TEXT
Comments
There are no comments yet.