SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain

01/08/2023
by   Heli Qi, et al.
0

This paper introduces SpeeChain, an open-source Pytorch-based toolkit designed to develop the machine speech chain for large-scale use. This first release focuses on the TTS-to-ASR chain, a core component of the machine speech chain, that refers to the TTS data augmentation by unspoken text for ASR. To build an efficient pipeline for the large-scale TTS-to-ASR chain, we implement easy-to-use multi-GPU batch-level model inference, multi-dataloader batch generation, and on-the-fly data selection techniques. In this paper, we first explain the overall procedure of the TTS-to-ASR chain and the difficulties of each step. Then, we present a detailed ablation study on different types of unlabeled data, data filtering thresholds, batch composition, and real-synthetic data ratios. Our experimental results on train_clean_460 of LibriSpeech demonstrate that our TTS-to-ASR chain can significantly improve WER in a semi-supervised setting.

READ FULL TEXT
research
10/24/2019

ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit

This paper introduces a new end-to-end text-to-speech (E2E-TTS) toolkit ...
research
06/03/2019

From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning

The most common way for humans to communicate is by speech. But perhaps ...
research
11/04/2020

Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework

Previous research has proposed a machine speech chain to enable automati...
research
03/13/2023

The System Description of dun_oscar team for The ICPR MSR Challenge

This paper introduces the system submitted by dun_oscar team for the ICP...
research
07/16/2017

Listening while Speaking: Speech Chain by Deep Learning

Despite the close relationship between speech perception and production,...
research
12/20/2022

Efficient L2 Batch Posting Strategy on L1

We design efficient algorithms for the batch posting of Layer 2 chain ca...
research
06/08/2016

DefExt: A Semi Supervised Definition Extraction Tool

We present DefExt, an easy to use semi supervised Definition Extraction ...

Please sign up or login with your details

Forgot password? Click here to reset