DiPCo – Dinner Party Corpus

09/30/2019
by   Maarten Van Segbroeck, et al.
0

We present a speech data corpus that simulates a "dinner party" scenario taking place in an everyday home environment. The corpus was created by recording multiple groups of four Amazon employee volunteers having a natural conversation in English around a dining table. The participants were recorded by a single-channel close-talk microphone and by five far-field 7-microphone array devices positioned at different locations in the recording room. The dataset contains the audio recordings and human labeled transcripts of a total of 10 sessions with a duration between 15 and 45 minutes. The corpus was created to advance in the field of noise robust and distant speech processing and is intended to serve as a public research and benchmarking data set.

READ FULL TEXT

page 1

page 2

page 3

research
04/13/2018

Voices Obscured in Complex Environmental Settings (VOICES) corpus

This paper introduces the Voices Obscured In Complex Environmental Setti...
research
05/25/2020

FT Speech: Danish Parliament Speech Corpus

This paper introduces FT Speech, a new speech corpus created from the re...
research
08/21/2023

LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices

We present LibriWASN, a data set whose design follows closely the LibriC...
research
05/22/2023

EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels

The increasing adoption of text-to-speech technologies has led to a grow...
research
02/14/2023

Detecting human and non-human vocal productions in large scale audio recordings

We propose an automatic data processing pipeline to extract vocal produc...
research
04/29/2020

Robust Phonetic Segmentation Using Spectral Transition measure for Non-Standard Recording Environments

Phone level localization of mis-articulation is a key requirement for an...
research
12/19/2019

Developing a Multi-Platform Speech Recording System Toward Open Service of Building Large-Scale Speech Corpora

This paper briefly reports our ongoing attempt at the development of a m...

Please sign up or login with your details

Forgot password? Click here to reset