BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision

06/28/2020
by   Chen Liang, et al.
1

We study the open-domain named entity recognition (NER) problem under distant supervision. The distant supervision, though does not require large amounts of manual annotations, yields highly incomplete and noisy distant labels via external knowledge bases. To address this challenge, we propose a new computational framework – BOND, which leverages the power of pre-trained language models (e.g., BERT and RoBERTa) to improve the prediction performance of NER models. Specifically, we propose a two-stage training algorithm: In the first stage, we adapt the pre-trained language model to the NER tasks using the distant labels, which can significantly improve the recall and precision; In the second stage, we drop the distant labels, and propose a self-training approach to further improve the model performance. Thorough experiments on 5 benchmark datasets demonstrate the superiority of BOND over existing distantly supervised NER methods. The code and distantly labeled data have been released in https://github.com/cliang1453/BOND.

READ FULL TEXT
research
09/10/2021

Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

We study the problem of training named entity recognition (NER) models u...
research
09/10/2018

Learning Named Entity Tagger using Domain-Specific Dictionary

Recent advances in deep neural models allow us to build reliable named e...
research
05/26/2021

BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition

We study the problem of learning a named entity recognition (NER) tagger...
research
04/09/2021

Noisy-Labeled NER with Confidence Estimation

Recent studies in deep learning have shown significant progress in named...
research
05/04/2019

Learning to Denoise Distantly-Labeled Data for Entity Typing

Distantly-labeled data can be used to scale up training of statistical m...
research
06/30/2023

DeepTagger: Knowledge Enhanced Named Entity Recognition for Web-Based Ads Queries

Named entity recognition (NER) is a crucial task for online advertisemen...
research
06/17/2021

Denoising Distantly Supervised Named Entity Recognition via a Hypergeometric Probabilistic Model

Denoising is the essential step for distant supervision based named enti...

Please sign up or login with your details

Forgot password? Click here to reset