End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

08/15/2023
by   Bolaji Yusuf, et al.
0

Conventional keyword search systems operate on automatic speech recognition (ASR) outputs, which causes them to have a complex indexing and search pipeline. This has led to interest in ASR-free approaches to simplify the search procedure. We recently proposed a neural ASR-free keyword search model which achieves competitive performance while maintaining an efficient and simplified pipeline, where queries and documents are encoded with a pair of recurrent neural network encoders and the encodings are combined with a dot-product. In this article, we extend this work with multilingual pretraining and detailed analysis of the model. Our experiments show that the proposed multilingual training significantly improves the model performance and that despite not matching a strong ASR-based conventional keyword search system for short queries and queries comprising in-vocabulary words, the proposed model outperforms the ASR-based system for long queries and queries that do not appear in the training data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/13/2017

End-to-End ASR-free Keyword Search from Speech

End-to-end (E2E) systems have achieved competitive results compared to c...
research
07/23/2018

Zero-shot keyword spotting for visual speech recognition in-the-wild

Visual keyword spotting (KWS) is the problem of estimating whether a tex...
research
08/23/2021

End-to-End Open Vocabulary Keyword Search

Recently, neural approaches to spoken content retrieval have become popu...
research
03/28/2022

Filler Word Detection and Classification: A Dataset and Benchmark

Filler words such as `uh' or `um' are sounds or words people use to sign...
research
05/01/2023

Contextual Multilingual Spellchecker for User Queries

Spellchecking is one of the most fundamental and widely used search feat...
research
12/03/2021

BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge

This paper introduces the system submitted by the Yidun NISP team to the...
research
10/27/2019

Induced Inflection-Set Keyword Search in Speech

We investigate the problem of searching for a lexeme-set in speech by se...

Please sign up or login with your details

Forgot password? Click here to reset