SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

04/18/2019
by   Daniel S. Park, et al.
16

We present SpecAugment, a simple data augmentation method for speech recognition. SpecAugment is applied directly to the feature inputs of a neural network (i.e., filter bank coefficients). The augmentation policy consists of warping the features, masking blocks of frequency channels, and masking blocks of time steps. We apply SpecAugment on Listen, Attend and Spell networks for end-to-end speech recognition tasks. We achieve state-of-the-art performance on the LibriSpeech 960h and Swichboard 300h tasks, outperforming all prior work. On LibriSpeech, we achieve 6.8 model, and 5.8 the previous state-of-the-art hybrid system of 7.5 achieve 7.2 without the use of a language model, and 6.8 compares to the previous state-of-the-art hybrid system at 8.3

READ FULL TEXT
research
04/08/2022

Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition

End-to-end models have achieved significant improvement on automatic spe...
research
11/20/2019

On Using SpecAugment for End-to-End Speech Translation

This work investigates a simple data augmentation technique, SpecAugment...
research
03/18/2021

TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation

Automatic augmentation methods have recently become a crucial pillar for...
research
12/22/2019

power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition

In this paper, we describe the Maximum Uniformity of Distribution (MUD) ...
research
10/08/2020

Population Based Training for Data Augmentation and Regularization in Speech Recognition

Varying data augmentation policies and regularization over the course of...
research
10/04/2021

WaveBeat: End-to-end beat and downbeat tracking in the time domain

Deep learning approaches for beat and downbeat tracking have brought adv...
research
07/22/2021

CarneliNet: Neural Mixture Model for Automatic Speech Recognition

End-to-end automatic speech recognition systems have achieved great accu...

Please sign up or login with your details

Forgot password? Click here to reset