Named Entity Detection and Injection for Direct Speech Translation

10/21/2022
by   Marco Gaido, et al.
1

In a sentence, certain words are critical for its semantic. Among them, named entities (NEs) are notoriously challenging for neural models. Despite their importance, their accurate handling has been neglected in speech-to-text (S2T) translation research, and recent work has shown that S2T models perform poorly for locations and notably person names, whose spelling is challenging unless known in advance. In this work, we explore how to leverage dictionaries of NEs known to likely appear in a given context to improve S2T model outputs. Our experiments show that we can reliably detect NEs likely present in an utterance starting from S2T encoder outputs. Indeed, we demonstrate that the current detection quality is sufficient to improve NE accuracy in the translation with a 31

READ FULL TEXT
research
05/13/2022

Who Are We Talking About? Handling Person Names in Speech Translation

Recent work has shown that systems for speech translation (ST) – similar...
research
10/21/2022

Joint Speech Translation and Named Entity Recognition

Modern automatic translation systems aim at place the human at the cente...
research
05/12/2023

Improving the Quality of Neural Machine Translation Through Proper Translation of Name Entities

In this paper, we have shown a method of improving the quality of neural...
research
09/15/2021

Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation

Automatic translation systems are known to struggle with rare words. Amo...
research
05/29/2023

Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation

End-to-end automatic speech recognition (E2E-ASR) has the potential to i...
research
07/01/2022

Multi-features based Semantic Augmentation Networks for Named Entity Recognition in Threat Intelligence

Extracting cybersecurity entities such as attackers and vulnerabilities ...
research
03/01/2023

DTW-SiameseNet: Dynamic Time Warped Siamese Network for Mispronunciation Detection and Correction

Personal Digital Assistants (PDAs) - such as Siri, Alexa and Google Assi...

Please sign up or login with your details

Forgot password? Click here to reset