Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting

10/18/2017
by   Raphael Tang, et al.
0

We describe Honk, an open-source PyTorch reimplementation of convolutional neural networks for keyword spotting that are included as examples in TensorFlow. These models are useful for recognizing "command triggers" in speech-based interfaces (e.g., "Hey Siri"), which serve as explicit cues for audio recordings of utterances that are sent to the cloud for full speech recognition. Evaluation on Google's recently released Speech Commands Dataset shows that our reimplementation is comparable in accuracy and provides a starting point for future work on the keyword spotting task.

READ FULL TEXT
research
10/28/2017

Deep Residual Learning for Small-Footprint Keyword Spotting

We explore the application of deep residual learning and dilated convolu...
research
07/11/2018

Efficient keyword spotting using time delay neural networks

This paper describes a novel method of live keyword spotting using a two...
research
04/14/2019

SpeechYOLO: Detection and Localization of Speech Objects

In this paper, we propose to apply object detection methods from the vis...
research
04/20/2020

ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric

Estimation of perceptual quality in audio and speech is possible using a...
research
03/10/2018

Speech Recognition: Keyword Spotting Through Image Recognition

The problem of identifying voice commands has always been a challenge du...
research
10/30/2018

JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis

Used for simple commands recognition on devices from smart routers to mo...
research
04/21/2023

Small-footprint slimmable networks for keyword spotting

In this work, we present Slimmable Neural Networks applied to the proble...

Please sign up or login with your details

Forgot password? Click here to reset