Very Fast Keyword Spotting System with Real Time Factor below 0.01

07/21/2020
by   Jan Nouza, et al.
0

In the paper we present an architecture of a keyword spotting (KWS) system that is based on modern neural networks, yields good performance on various types of speech data and can run very fast. We focus mainly on the last aspect and propose optimizations for all the steps required in a KWS design: signal processing and likelihood computation, Viterbi decoding, spot candidate detection and confidence calculation. We present time and memory efficient modelling by bidirectional feedforward sequential memory networks (an alternative to recurrent nets) either by standard triphones or so called quasi-monophones, and an entirely forward decoding of speech frames (with minimal need for look back). Several variants of the proposed scheme are evaluated on 3 large Czech datasets (broadcast, internet and telephone, 17 hours in total) and their performance is compared by Detection Error Tradeoff (DET) diagrams and real-time (RT) factors. We demonstrate that the complete system can run in a single pass with a RT factor close to 0.001 if all optimizations (including a GPU for likelihood computation) are applied.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

Fast Gaussian Process Occupancy Maps

In this paper, we demonstrate our work on Gaussian Process Occupancy Map...
research
05/23/2023

EfficientSpeech: An On-Device Text to Speech Model

State of the art (SOTA) neural text to speech (TTS) models can generate ...
research
10/14/2019

OmniTrack: Real-time detection and tracking of objects, text and logos in video

The automatic detection and tracking of general objects (like persons, a...
research
03/04/2018

Deep-FSMN for Large Vocabulary Continuous Speech Recognition

In this paper, we present an improved feedforward sequential memory netw...
research
04/21/2023

Small-footprint slimmable networks for keyword spotting

In this work, we present Slimmable Neural Networks applied to the proble...
research
11/06/2018

Hierarchical Neural Network Architecture In Keyword Spotting

Keyword Spotting (KWS) provides the start signal of ASR problem, and thu...

Please sign up or login with your details

Forgot password? Click here to reset