Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

08/19/2015
by   Wei Zhang, et al.
0

Precise geocoding and time normalization for text requires that location and time phrases be identified. Many state-of-the-art geoparsers and temporal parsers suffer from low recall. Categories commonly missed by parsers are: nouns used in a non- spatiotemporal sense, adjectival and adverbial phrases, prepositional phrases, and numerical phrases. We collected and annotated data set by querying commercial web searches API with such spatiotemporal expressions as were missed by state-of-the- art parsers. Due to the high cost of sentence annotation, active learning was used to label training data, and a new strategy was designed to better select training examples to reduce labeling cost. For the learning algorithm, we applied an average perceptron trained Featurized Hidden Markov Model (FHMM). Five FHMM instances were used to create an ensemble, with the output phrase selected by voting. Our ensemble model was tested on a range of sequential labeling tasks, and has shown competitive performance. Our contributions include (1) an new dataset annotated with named entities and expanded spatiotemporal expressions; (2) a comparison of inference algorithms for ensemble models showing the superior accuracy of Belief Propagation over Viterbi Decoding; (3) a new example re-weighting method for active ensemble learning that 'memorizes' the latest examples trained; (4) a spatiotemporal parser that jointly recognizes expanded spatiotemporal expressions as well as named entities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2020

Fase-AL – Adaptation of Fast Adaptive Stacking of Ensembles for Supporting Active Learning

Classification algorithms to mine data stream have been extensively stud...
research
02/26/2020

Streaming Active Deep Forest for Evolving Data Stream Classification

In recent years, Deep Neural Networks (DNNs) have gained progressive mom...
research
06/21/2021

Phrase-level Active Learning for Neural Machine Translation

Neural machine translation (NMT) is sensitive to domain shift. In this p...
research
10/01/2020

RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation

The task of video object segmentation with referring expressions (langua...
research
12/13/2022

Pixel is All You Need: Adversarial Trajectory-Ensemble Active Learning for Salient Object Detection

Although weakly-supervised techniques can reduce the labeling effort, it...
research
08/09/2016

TweeTime: A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter

We describe TweeTIME, a temporal tagger for recognizing and normalizing ...
research
07/12/2023

Reconstructing Spatiotemporal Data with C-VAEs

The continuous representation of spatiotemporal data commonly relies on ...

Please sign up or login with your details

Forgot password? Click here to reset