Efficient keyword spotting using time delay neural networks

07/11/2018
by   Samuel Myer, et al.
0

This paper describes a novel method of live keyword spotting using a two-stage time delay neural network. The model is trained using transfer learning: initial training with phone targets from a large speech corpus is followed by training with keyword targets from a smaller data set. The accuracy of the system is evaluated on two separate tasks. The first is the freely available Google Speech Commands dataset. The second is an in-house task specifically developed for keyword spotting. The results show significant improvements in false accept and false reject rates in both clean and noisy environments when compared with previously known techniques. Furthermore, we investigate various techniques to reduce computation in terms of multiplications per second of audio. Compared to recently published work, the proposed system provides up to 89

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2017

Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting

We describe Honk, an open-source PyTorch reimplementation of convolution...
research
04/06/2023

To Wake-up or Not to Wake-up: Reducing Keyword False Alarm by Successive Refinement

Keyword spotting systems continuously process audio streams to detect ke...
research
10/05/2017

Semantic keyword spotting by learning from images and speech

We consider the problem of representing semantic concepts in speech by l...
research
11/01/2018

End-to-end Models with auditory attention in Multi-channel Keyword Spotting

In this paper, we propose an attention-based end-to-end model for multi-...
research
08/04/2023

N-gram Boosting: Improving Contextual Biasing with Normalized N-gram Targets

Accurate transcription of proper names and technical terms is particular...
research
05/24/2022

Boosting Tail Neural Network for Realtime Custom Keyword Spotting

In this paper, we propose a Boosting Tail Neural Network (BTNN) for impr...
research
09/23/2022

UniKW-AT: Unified Keyword Spotting and Audio Tagging

Within the audio research community and the industry, keyword spotting (...

Please sign up or login with your details

Forgot password? Click here to reset