Inflection-Tolerant Ontology-Based Named Entity Recognition for Real-Time Applications

12/05/2018
by   Christian Jilek, et al.
0

A growing number of applications users daily interact with have to operate in (near) real-time: chatbots, digital companions, knowledge work support systems -- just to name a few. To perform the services desired by the user, these systems have to analyze user activity logs or explicit user input extremely fast. In particular, text content (e.g. in form of text snippets) needs to be processed in an information extraction task. Regarding the aforementioned temporal requirements, this has to be accomplished in just a few milliseconds, which limits the number of methods that can be applied. Practically, only very fast methods remain, which on the other hand deliver worse results than slower but more sophisticated Natural Language Processing (NLP) pipelines. In this paper, we investigate and propose methods for real-time capable Named Entity Recognition (NER). As a first improvement step we address are word variations induced by inflection, for example present in the German language. Our approach is ontology-based and makes use of several language information sources like Wiktionary. We evaluated it using the German Wikipedia (about 9.4B characters), for which the whole NER process took considerably less than an hour. Since precision and recall are higher than with comparably fast methods, we conclude that the quality gap between high speed methods and sophisticated NLP pipelines can be narrowed a bit more without losing too much runtime performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2023

German BERT Model for Legal Named Entity Recognition

The use of BERT, one of the most popular language models, has led to imp...
research
10/04/2021

Protagonists' Tagger in Literary Domain – New Datasets and a Method for Person Entity Linkage

Semantic annotation of long texts, such as novels, remains an open chall...
research
11/07/2018

microNER: A Micro-Service for German Named Entity Recognition based on BiLSTM-CRF

For named entity recognition (NER), bidirectional recurrent neural netwo...
research
06/28/2023

Sentence-to-Label Generation Framework for Multi-task Learning of Japanese Sentence Classification and Named Entity Recognition

Information extraction(IE) is a crucial subfield within natural language...
research
07/18/2023

Mutual Reinforcement Effects in Japanese Sentence Classification and Named Entity Recognition Tasks

Information extraction(IE) is a crucial subfield within natural language...
research
05/30/2023

Examining risks of racial biases in NLP tools for child protective services

Although much literature has established the presence of demographic bia...
research
12/23/2018

Water quality information dissemination at real-time in South Africa using language modelling

We present a conversational model to apprise users with limited access t...

Please sign up or login with your details

Forgot password? Click here to reset