PAMPO: using pattern matching and pos-tagging for effective Named Entities recognition in Portuguese

12/30/2016
by   Conceição Rocha, et al.
0

This paper deals with the entity extraction task (named entity recognition) of a text mining process that aims at unveiling non-trivial semantic structures, such as relationships and interaction between entities or communities. In this paper we present a simple and efficient named entity extraction algorithm. The method, named PAMPO (PAttern Matching and POs tagging based algorithm for NER), relies on flexible pattern matching, part-of-speech tagging and lexical-based rules. It was developed to process texts written in Portuguese, however it is potentially applicable to other languages as well. We compare our approach with current alternatives that support Named Entity Recognition (NER) for content written in Portuguese. These are Alchemy, Zemanta and Rembrandt. Evaluation of the efficacy of the entity extraction method on several texts written in Portuguese indicates a considerable improvement on recall and F_1 measures.

READ FULL TEXT
research
05/16/2017

NeuroNER: an easy-to-use program for named-entity recognition based on neural networks

Named-entity recognition (NER) aims at identifying entities of interest ...
research
06/20/2020

Named Entity Extraction with Finite State Transducers

We describe a named entity tagging system that requires minimal linguist...
research
03/13/2022

ProtagonistTagger – a Tool for Entity Linkage of Persons in Texts from Various Languages and Domains

Named entities recognition (NER) and disambiguation (NED) can add semant...
research
11/09/2016

Old Content and Modern Tools - Searching Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910

Named Entity Recognition (NER), search, classification and tagging of na...
research
04/25/2020

Towards Discourse Parsing-inspired Semantic Storytelling

Previous work of ours on Semantic Storytelling uses text analytics proce...
research
06/01/2021

Discontinuous Named Entity Recognition as Maximal Clique Discovery

Named entity recognition (NER) remains challenging when entity mentions ...

Please sign up or login with your details

Forgot password? Click here to reset