Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

09/14/2023
by   Peng Wang, et al.
0

In spite of the excellent strides made by end-to-end (E2E) models in speech recognition in recent years, named entity recognition is still challenging but critical for semantic understanding. In order to enhance the ability to recognize named entities in E2E models, previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along with a risk of false triggering. Inspired by the success of the class-based language model (LM) in named entity recognition in conventional hybrid systems and the effective decoupling of acoustic and linguistic information in the factorized neural Transducer (FNT), we propose a novel E2E model to incorporate class-based LMs into FNT, which is referred as C-FNT. In C-FNT, the language model score of named entities can be associated with the name class instead of its surface form. The experimental results show that our proposed C-FNT presents significant error reduction in named entities without hurting performance in general word recognition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2023

Informed Named Entity Recognition Decoding for Generative Language Models

Ever-larger language models with ever-increasing capabilities are by now...
research
05/10/2023

Korean Named Entity Recognition Based on Language-Specific Features

In the paper, we propose a novel way of improving named entity recogniti...
research
12/01/2021

Building astroBERT, a language model for Astronomy Astrophysics

The existing search tools for exploring the NASA Astrophysics Data Syste...
research
09/02/2019

Phrase-Level Class based Language Model for Mandarin Smart Speaker Query Recognition

The success of speech assistants requires precise recognition of a numbe...
research
01/22/2020

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

Contextualized embeddings use unsupervised language model pretraining to...
research
03/23/2020

E2EET: From Pipeline to End-to-end Entity Typing via Transformer-Based Embeddings

Entity Typing (ET) is the process of identifying the semantic types of e...

Please sign up or login with your details

Forgot password? Click here to reset