ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit

04/10/2023
by   Brian Yan, et al.
0

ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitated by the broadening interests of the spoken language translation community. ESPnet-ST-v2 supports 1) offline speech-to-text translation (ST), 2) simultaneous speech-to-text translation (SST), and 3) offline speech-to-speech translation (S2ST) – each task is supported with a wide variety of approaches, differentiating ESPnet-ST-v2 from other open source spoken language translation toolkits. This toolkit offers state-of-the-art architectures such as transducers, hybrid CTC/attention, multi-decoders with searchable intermediates, time-synchronous blockwise CTC/attention, Translatotron models, and direct discrete unit models. In this paper, we describe the overall design, example models for each task, and performance benchmarking behind ESPnet-ST-v2, which is publicly available at https://github.com/espnet/espnet.

READ FULL TEXT
research
11/29/2021

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

As Automatic Speech Processing (ASR) systems are getting better, there i...
research
12/18/2020

NeurST: Neural Speech Translation Toolkit

NeurST is an open-source toolkit for neural speech translation developed...
research
02/14/2023

TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments

The evidence is growing that machine and deep learning methods can learn...
research
10/11/2020

fairseq S2T: Fast Speech-to-Text Modeling with fairseq

We introduce fairseq S2T, a fairseq extension for speech-to-text (S2T) m...
research
10/24/2022

Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation

For real-life applications, it is crucial that end-to-end spoken languag...
research
10/31/2016

RNN Approaches to Text Normalization: A Challenge

This paper presents a challenge to the community: given a large corpus o...
research
09/20/2021

StreamSide: A Fully-Customizable Open-Source Toolkit for Efficient Annotation of Meaning Representations

This demonstration paper presents StreamSide, an open-source toolkit for...

Please sign up or login with your details

Forgot password? Click here to reset