The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments

10/06/2017
by   Mirco Ravanelli, et al.
0

This paper introduces the contents and the possible usage of the DIRHA-ENGLISH multi-microphone corpus, recently realized under the EC DIRHA project. The reference scenario is a domestic environment equipped with a large number of microphones and microphone arrays distributed in space. The corpus is composed of both real and simulated material, and it includes 12 US and 12 UK English native speakers. Each speaker uttered different sets of phonetically-rich sentences, newspaper articles, conversational speech, keywords, and commands. From this material, a large set of 1-minute sequences was generated, which also includes typical domestic background noise as well as inter/intra-room reverberation effects. Dev and test sets were derived, which represent a very precious material for different studies on multi-microphone speech processing and distant-speech recognition. Various tasks and corresponding Kaldi recipes have already been developed. The paper reports a first set of baseline results obtained using different techniques, including Deep Neural Networks (DNN), aligned with the state-of-the-art at international level.

READ FULL TEXT
research
04/20/2020

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges...
research
04/13/2018

Voices Obscured in Complex Environmental Settings (VOICES) corpus

This paper introduces the Voices Obscured In Complex Environmental Setti...
research
04/03/2021

speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

This paper introduces a new open-source speech corpus named "speechocean...
research
11/26/2017

Realistic multi-microphone data simulation for distant speech recognition

The availability of realistic simulated corpora is of key importance for...
research
09/25/2018

Non-native children speech recognition through transfer learning

This work deals with non-native children's speech and investigates both ...
research
02/23/2017

Pronunciation recognition of English phonemes /@/, /æ/, /A:/ and /2/ using Formants and Mel Frequency Cepstral Coefficients

The Vocal Joystick Vowel Corpus, by Washington University, was used to s...
research
01/30/2017

Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus

Automatic speech recognition (ASR) and Text to speech (TTS) are two prom...

Please sign up or login with your details

Forgot password? Click here to reset