wav2letter++: The Fastest Open-source Speech Recognition System

12/18/2018
by   Vineel Pratap, et al.
0

This paper introduces wav2letter++, the fastest open-source deep learning speech recognition framework. wav2letter++ is written entirely in C++, and uses the ArrayFire tensor library for maximum efficiency. Here we explain the architecture and design of the wav2letter++ system and compare it to other major open-source speech recognition systems. In some cases wav2letter++ is more than 2x faster than other optimized frameworks for training end-to-end neural networks for speech recognition. We also show that wav2letter++'s training times scale linearly to 64 GPUs, the highest we tested, for models with 100 million parameters. High-performance frameworks enable fast iteration, which is often a crucial factor in successful research and model tuning on new datasets and tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2020

KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition

We present KoSpeech, an open-source software, which is modular and exten...
research
03/30/2018

ESPnet: End-to-End Speech Processing Toolkit

This paper introduces a new open source platform for end-to-end speech p...
research
11/03/2019

Onssen: an open-source speech separation and enhancement library

Speech separation is an essential task for multi-talker speech recogniti...
research
09/30/2019

MIOpen: An Open Source Library For Deep Learning Primitives

Deep Learning has established itself to be a common occurrence in the bu...
research
07/25/2022

AMLB: an AutoML Benchmark

Comparing different AutoML frameworks is notoriously challenging and oft...
research
05/18/2023

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

This paper introduces FunASR, an open-source speech recognition toolkit ...
research
06/05/2018

LSTM Benchmarks for Deep Learning Frameworks

This study provides benchmarks for different implementations of LSTM uni...

Please sign up or login with your details

Forgot password? Click here to reset