Keyword Spotting for Hearing Assistive Devices Robust to External Speakers

06/22/2019
by   Iván López-Espejo, et al.
0

Keyword spotting (KWS) is experiencing an upswing due to the pervasiveness of small electronic devices that allow interaction with them via speech. Often, KWS systems are speaker-independent, which means that any person -- user or not -- might trigger them. For applications like KWS for hearing assistive devices this is unacceptable, as only the user must be allowed to handle them. In this paper we propose KWS for hearing assistive devices that is robust to external speakers. A state-of-the-art deep residual network for small-footprint KWS is regarded as a basis to build upon. By following a multi-task learning scheme, this system is extended to jointly perform KWS and users' own-voice/external speaker detection with a negligible increase in the number of parameters. For experiments, we generate from the Google Speech Commands Dataset a speech corpus emulating hearing aids as a capturing device. Our results show that this multi-task deep residual network is able to achieve a KWS accuracy relative improvement of around 32 external speakers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2021

Multi-task Voice Activated Framework using Self-supervised Learning

Self-supervised learning methods such as wav2vec 2.0 have shown promisin...
research
06/28/2022

Personalized Keyword Spotting through Multi-task Learning

Keyword spotting (KWS) plays an essential role in enabling speech-based ...
research
01/26/2020

Multi-task Learning for Speaker Verification and Voice Trigger Detection

Automatic speech transcription and speaker recognition are usually treat...
research
06/08/2021

Broadcasted Residual Learning for Efficient Keyword Spotting

Keyword spotting is an important research field because it plays a key r...
research
10/14/2021

FedSpeech: Federated Text-to-Speech with Continual Learning

Federated learning enables collaborative training of machine learning mo...
research
05/08/2020

Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention

Keyword spotting (KWS) and speaker verification (SV) have been studied i...
research
05/30/2020

Exploring Filterbank Learning for Keyword Spotting

Despite their great performance over the years, handcrafted speech featu...

Please sign up or login with your details

Forgot password? Click here to reset