Exploring Spoken Named Entity Recognition: A Cross-Lingual Perspective

07/03/2023
by   Moncef Benaicha, et al.
0

Recent advancements in Named Entity Recognition (NER) have significantly improved the identification of entities in textual data. However, spoken NER, a specialized field of spoken document retrieval, lags behind due to its limited research and scarce datasets. Moreover, cross-lingual transfer learning in spoken NER has remained unexplored. This paper utilizes transfer learning across Dutch, English, and German using pipeline and End-to-End (E2E) schemes. We employ Wav2Vec2-XLS-R models on custom pseudo-annotated datasets and investigate several architectures for the adaptability of cross-lingual systems. Our results demonstrate that End-to-End spoken NER outperforms pipeline-based alternatives over our limited annotations. Notably, transfer learning from German to Dutch surpasses the Dutch E2E system by 7 Dutch pipeline system by 4 transfer learning in spoken NER but also sets promising outcomes for future evaluations, hinting at the need for comprehensive data collection to augment the results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2020

Neural Cross-Lingual Transfer and Limited Annotated Data for Named Entity Recognition in Danish

Named Entity Recognition (NER) has greatly advanced by the introduction ...
research
09/09/2019

What Matters for Neural Cross-Lingual Named Entity Recognition: An Empirical Analysis

Building named entity recognition (NER) models for languages that do not...
research
11/22/2019

Zero-Resource Cross-Lingual Named Entity Recognition

Recently, neural methods have achieved state-of-the-art (SOTA) results i...
research
05/24/2021

DaN+: Danish Nested Named Entities and Lexical Normalization

This paper introduces DaN+, a new multi-domain corpus and annotation gui...
research
12/14/2021

On the Use of External Data for Spoken Named Entity Recognition

Spoken language understanding (SLU) tasks involve mapping from speech au...
research
12/13/2018

Dynamic Transfer Learning for Named Entity Recognition

State-of-the-art named entity recognition (NER) systems have been improv...
research
10/08/2018

Cross Script Hindi English NER Corpus from Wikipedia

The text generated on social media platforms is essentially a mixed ling...

Please sign up or login with your details

Forgot password? Click here to reset