WSRNet: Joint Spotting and Recognition of Handwritten Words

08/17/2020
by   George Retsinas, et al.
0

In this work, we present a unified model that can handle both Keyword Spotting and Word Recognition with the same network architecture. The proposed network is comprised of a non-recurrent CTC branch and a Seq2Seq branch that is further augmented with an Autoencoding module. The related joint loss leads to a boost in recognition performance, while the Seq2Seq branch is used to create efficient word representations. We show how to further process these representations with binarization and a retraining scheme to provide compact and highly efficient descriptors, suitable for keyword spotting. Numerical results validate the usefulness of the proposed architecture, as our method outperforms the previous state-of-the-art in keyword spotting, and provides results in the ballpark of the leading methods for word recognition.

READ FULL TEXT
research
09/02/2020

Seeing wake words: Audio-visual Keyword Spotting

The goal of this work is to automatically determine whether and when a w...
research
12/31/2020

EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting

Keyword spotting is a process of finding some specific words or phrases ...
research
03/31/2022

Learning Decoupling Features Through Orthogonality Regularization

Keyword spotting (KWS) and speaker verification (SV) are two important t...
research
06/30/2021

An Integrated Framework for Two-pass Personalized Voice Trigger

In this paper, we present the XMUSPEECH system for Task 1 of 2020 Person...
research
11/06/2018

Hierarchical Neural Network Architecture In Keyword Spotting

Keyword Spotting (KWS) provides the start signal of ASR problem, and thu...
research
10/30/2019

Temporal Feedback Convolutional Recurrent Neural Networks for Keyword Spotting

While end-to-end learning has become a trend in deep learning, the model...

Please sign up or login with your details

Forgot password? Click here to reset