End-to-end named entity extraction from speech

05/30/2018
by   Sahar Ghannay, et al.
0

Named entity recognition (NER) is among SLU tasks that usually extract semantic information from textual documents. Until now, NER from speech is made through a pipeline process that consists in processing first an automatic speech recognition (ASR) on the audio and then processing a NER on the ASR outputs. Such approach has some disadvantages (error propagation, metric to tune ASR systems sub-optimal in regards to the final task, reduced space search at the ASR output level...) and it is known that more integrated approaches outperform sequential ones, when they can be applied. In this paper, we present a first study of end-to-end approach that directly extracts named entities from speech, though a unique neural architecture. On a such way, a joint optimization is able for both ASR and NER. Experiments are carried on French data easily accessible, composed of data distributed in several evaluation campaign. Experimental results show that this end-to-end approach provides better results (F-measure=0.69 on test data) than a classical pipeline approach to detect named entity categories (F-measure=0.65).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2020

End-to-end Named Entity Recognition from English Speech

Named entity recognition (NER) from text has been a widely studied probl...
research
04/02/2022

End-to-end model for named entity recognition from speech without paired training data

Recent works showed that end-to-end neural approaches tend to become ver...
research
05/29/2023

Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation

End-to-end automatic speech recognition (E2E-ASR) has the potential to i...
research
04/26/2022

Named Entity Recognition for Audio De-Identification

Data anonymization is often a task carried out by humans. Automating it ...
research
10/21/2022

Joint Speech Translation and Named Entity Recognition

Modern automatic translation systems aim at place the human at the cente...
research
02/08/2022

A two-step approach to leverage contextual data: speech recognition in air-traffic communications

Automatic Speech Recognition (ASR), as the assistance of speech communic...
research
04/22/2021

Earnings-21: A Practical Benchmark for ASR in the Wild

Commonly used speech corpora inadequately challenge academic and commerc...

Please sign up or login with your details

Forgot password? Click here to reset