USFD: Twitter NER with Drift Compensation and Linked Data

11/10/2015
by   Leon Derczynski, et al.
0

This paper describes a pilot NER system for Twitter, comprising the USFD system entry to the W-NUT 2015 NER shared task. The goal is to correctly label entities in a tweet dataset, using an inventory of ten types. We employ structured learning, drawing on gazetteers taken from Linked Data, and on unsupervised clustering features, and attempting to compensate for stylistic and topic drift - a key challenge in social media text. Our result is competitive; we provide an analysis of the components of our methodology, and an examination of the target dataset in the context of this task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2019

Named Entity Recognition on Code-Switched Data: Overview of the CALCS 2018 Shared Task

In the third shared task of the Computational Approaches to Linguistic C...
research
10/30/2017

Named Entity Recognition in Twitter using Images and Text

Named Entity Recognition (NER) is an important subtask of information ex...
research
07/24/2017

CAp 2017 challenge: Twitter Named Entity Recognition

The paper describes the CAp 2017 challenge. The challenge concerns the p...
research
09/03/2018

Named Entity Recognition on Noisy Data using Images and Text (1-page abstract)

Named Entity Recognition (NER) is an important subtask of information ex...
research
09/24/2018

Recognizing Film Entities in Podcasts

In this paper, we propose a Named Entity Recognition (NER) system to ide...
research
04/20/2021

Mitigating Temporal-Drift: A Simple Approach to Keep NER Models Crisp

Performance of neural models for named entity recognition degrades over ...
research
10/27/2021

Towards Realistic Single-Task Continuous Learning Research for NER

There is an increasing interest in continuous learning (CL), as data pri...

Please sign up or login with your details

Forgot password? Click here to reset