Domain Adaptation for Named Entity Recognition in Online Media with Word Embeddings

12/01/2016
by   Vivek Kulkarni, et al.
0

Content on the Internet is heterogeneous and arises from various domains like News, Entertainment, Finance and Technology. Understanding such content requires identifying named entities (persons, places and organizations) as one of the key steps. Traditionally Named Entity Recognition (NER) systems have been built using available annotated datasets (like CoNLL, MUC) and demonstrate excellent performance. However, these models fail to generalize onto other domains like Sports and Finance where conventions and language use can differ significantly. Furthermore, several domains do not have large amounts of annotated labeled data for training robust Named Entity Recognition models. A key step towards this challenge is to adapt models learned on domains where large amounts of annotated training data are available to domains with scarce annotated data. In this paper, we propose methods to effectively adapt models learned on one domain onto other domains using distributed word representations. First we analyze the linguistic variation present across domains to identify key linguistic insights that can boost performance across domains. We propose methods to capture domain specific semantics of word usage in addition to global semantics. We then demonstrate how to effectively use such domain specific knowledge to learn NER models that outperform previous baselines in the domain adaptation setting.

READ FULL TEXT
research
11/24/2020

Domain-Transferable Method for Named Entity Recognition Task

Named Entity Recognition (NER) is a fundamental task in the fields of na...
research
03/05/2020

Neural Cross-Lingual Transfer and Limited Annotated Data for Named Entity Recognition in Danish

Named Entity Recognition (NER) has greatly advanced by the introduction ...
research
07/05/2019

Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings

Chemical patents are an important resource for chemical information. How...
research
05/22/2020

Bootstrapping Named Entity Recognition in E-Commerce with Positive Unlabeled Learning

Named Entity Recognition (NER) in domains like e-commerce is an understu...
research
06/07/2018

Embedding Transfer for Low-Resource Medical Named Entity Recognition: A Case Study on Patient Mobility

Functioning is gaining recognition as an important indicator of global h...
research
08/31/2019

Named Entity Recognition Only from Word Embeddings

Deep neural network models have helped named entity (NE) recognition ach...
research
01/31/2017

Robust Multilingual Named Entity Recognition with Shallow Semi-Supervised Features

We present a multilingual Named Entity Recognition approach based on a r...

Please sign up or login with your details

Forgot password? Click here to reset