DeepAI AI Chat
Log In Sign Up

An Anchor-Free Detector for Continuous Speech Keyword Spotting

08/09/2022
by   Zhiyuan Zhao, et al.
University of Technology Sydney
Microsoft
0

Continuous Speech Keyword Spotting (CSKWS) is a task to detect predefined keywords in a continuous speech. In this paper, we regard CSKWS as a one-dimensional object detection task and propose a novel anchor-free detector, named AF-KWS, to solve the problem. AF-KWS directly regresses the center locations and lengths of the keywords through a single-stage deep neural network. In particular, AF-KWS is tailored for this speech task as we introduce an auxiliary unknown class to exclude other words from non-speech or silent background. We have built two benchmark datasets named LibriTop-20 and continuous meeting analysis keywords (CMAK) dataset for CSKWS. Evaluations on these two datasets show that our proposed AF-KWS outperforms reference schemes by a large margin, and therefore provides a decent baseline for future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

01/12/2019

Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data

Continuous Speech Keyword Spotting (CSKS) is the problem of spotting key...
05/18/2020

Metric Learning for Keyword Spotting

The goal of this work is to train effective representations for keyword ...
05/28/2021

Augmenting Anchors by the Detector Itself

It is difficult to determine the scale and aspect ratio of anchors for a...
12/31/2020

EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting

Keyword spotting is a process of finding some specific words or phrases ...
06/25/2018

Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

We use dynamic time warping (DTW) as supervision for training a convolut...
03/28/2022

Filler Word Detection and Classification: A Dataset and Benchmark

Filler words such as `uh' or `um' are sounds or words people use to sign...
11/01/2022

Metric Learning for User-defined Keyword Spotting

The goal of this work is to detect new spoken terms defined by users. Wh...