Developing a Multi-Platform Speech Recording System Toward Open Service of Building Large-Scale Speech Corpora

12/19/2019
by   Keita Ishizuka, et al.
0

This paper briefly reports our ongoing attempt at the development of a multi-platform browser-based speech recording system. We designed the system toward a service of providing open service of building large-scale speech corpora at a low-cost for any researchers and developers related to speech processing. The recent increase in the use of crowdsourcing services, e.g., Amazon Mechanical Turk, enable us to reduce the cost of collecting speakers in the web, and there have been many attempts to develop the automated speech collecting platforms or application that is designed for the use the crowdsourcing. However, one of the major problems in the previous studies and developments for the attempts is that most of the systems are not a form of common service of speech recording and corpus building, and each corpus builder is necessary to develop the system in their own environment including a web server. For this problem, we develope a new platform where both the corpus builders and recording participants can commonly use a single system and service by creating their user accounts. A brief introduction of the system is given in this paper as the start of this challenge.

READ FULL TEXT

page 1

page 2

page 3

research
08/07/2020

CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment

This paper describes the design and development of CUCHILD, a large-scal...
research
05/19/2022

SDS-200: A Swiss German Speech to Standard German Text Corpus

We present SDS-200, a corpus of Swiss German dialectal speech with Stand...
research
11/22/2018

Creating a contemporary corpus of similes in Serbian by using natural language processing

Simile is a figure of speech that compares two things through the use of...
research
04/06/2021

EasyCall corpus: a dysarthric speech dataset

This paper introduces a new dysarthric speech command dataset in Italian...
research
06/07/2023

RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction

The detection of shouted speech is crucial in audio surveillance and mon...
research
09/30/2019

DiPCo – Dinner Party Corpus

We present a speech data corpus that simulates a "dinner party" scenario...
research
04/24/2020

TeleCrowd: A Crowdsourcing Approach to Create Informal to Formal Text Corpora

Crowdsourcing has been widely used recently as an alternative to traditi...

Please sign up or login with your details

Forgot password? Click here to reset