PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

05/20/2022
by   Hui Zhang, et al.
0

PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/PaddleSpeech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2021

SpeechBrain: A General-Purpose Speech Toolkit

SpeechBrain is an open-source and all-in-one speech toolkit. It is desig...
research
05/04/2020

ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents

We present ADVISER - an open-source, multi-domain dialog system toolkit ...
research
04/07/2021

EXPATS: A Toolkit for Explainable Automated Text Scoring

Automated text scoring (ATS) tasks, such as automated essay scoring and ...
research
10/21/2020

NeuSpell: A Neural Spelling Correction Toolkit

We introduce NeuSpell, an open-source toolkit for spelling correction in...
research
12/18/2020

NeurST: Neural Speech Translation Toolkit

NeurST is an open-source toolkit for neural speech translation developed...
research
10/15/2021

ESPnet2-TTS: Extending the Edge of TTS Research

This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS)...
research
12/10/2021

Shennong: a Python toolbox for audio speech features extraction

We introduce Shennong, a Python toolbox and command-line utility for spe...

Please sign up or login with your details

Forgot password? Click here to reset