Improving Named Entity Recognition in Tor Darknet with Local Distance Neighbor Feature

05/18/2020
by   Mhd Wesam Al-Nabki, et al.
0

Name entity recognition in noisy user-generated texts is a difficult task usually enhanced by incorporating an external resource of information, such as gazetteers. However, gazetteers are task-specific, and they are expensive to build and maintain. This paper adopts and improves the approach of Aguilar et al. by presenting a novel feature, called Local Distance Neighbor, which substitutes gazetteers. We tested the new approach on the W-NUT-2017 dataset, obtaining state-of-the-art results for the Group, Person and Product categories of Named Entities. Next, we added 851 manually labeled samples to the W-NUT-2017 dataset to account for named entities in the Tor Darknet related to weapons and drug selling. Finally, our proposal achieved an entity and surface F1 scores of 52.96 usefulness for Law Enforcement Agencies to detect named entities in the Tor hidden services.

READ FULL TEXT

page 1

page 2

research
04/08/2020

Entity-Switched Datasets: An Approach to Auditing the In-Domain Robustness of Named Entity Recognition Models

Named entity recognition systems perform well on standard datasets compr...
research
05/10/2023

PAI at SemEval-2023 Task 2: A Universal System for Named Entity Recognition with External Entity Information

The MultiCoNER II task aims to detect complex, ambiguous, and fine-grain...
research
10/29/2020

May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance

We investigate using Named Entity Recognition on a new type of user-gene...
research
07/02/2022

A Biomedical Pipeline to Detect Clinical and Non-Clinical Named Entities

There are a few challenges related to the task of biomedical named entit...
research
04/01/2019

Recognizing Musical Entities in User-generated Content

Recognizing Musical Entities is important for Music Information Retrieva...
research
09/27/2022

DAMO-NLP at NLPCC-2022 Task 2: Knowledge Enhanced Robust NER for Speech Entity Linking

Speech Entity Linking aims to recognize and disambiguate named entities ...
research
08/13/2022

Self-Contained Entity Discovery from Captioned Videos

This paper introduces the task of visual named entity discovery in video...

Please sign up or login with your details

Forgot password? Click here to reset