Log In Sign Up

Using Unlabeled Texts for Named-Entity Recognition

by   Katharina Morik, et al.

Named Entity Recognition (NER) poses the problem of learning with multiple views in two ways. First, there are not enough labeled texts so that the exploitation of unlabeled texts becomes necessary. Second, words and word sequences offer several aspects for representation, each reflecting another aspect of them. Instead of choosing the most promising representation as done in feature selection, the cooperation of different features enhances learning NER. In this paper, we investigate the bootstrapping of features. From labeled and unlabeled texts, features are determined which in turn are exploited to recognize names automatically. The SVM is used as the learning engine. Results on German texts and on biomedical texts show that the approach is promising.


page 1

page 2

page 3

page 4


NeuroNER: an easy-to-use program for named-entity recognition based on neural networks

Named-entity recognition (NER) aims at identifying entities of interest ...

Biomedical Named Entity Recognition via Reference-Set Augmented Bootstrapping

We present a weakly-supervised data augmentation approach to improve Nam...

Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings

Recently, due to the increasing popularity of social media, the necessit...

Adversarial Adaptation for French Named Entity Recognition

Named Entity Recognition (NER) is the task of identifying and classifyin...

Domain-Specific NER via Retrieving Correlated Samples

Successful Machine Learning based Named Entity Recognition models could ...

CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor

Extracting key information from documents, such as receipts or invoices,...

50 Ways to Bake a Cookie: Mapping the Landscape of Procedural Texts

The web is full of guidance on a wide variety of tasks, from changing th...