Automatic Rule Generation for Time Expression Normalization

08/31/2021
by   Wentao Ding, et al.
0

The understanding of time expressions includes two sub-tasks: recognition and normalization. In recent years, significant progress has been made in the recognition of time expressions while research on normalization has lagged behind. Existing SOTA normalization methods highly rely on rules or grammars designed by experts, which limits their performance on emerging corpora, such as social media texts. In this paper, we model time expression normalization as a sequence of operations to construct the normalized temporal value, and we present a novel method called ARTime, which can automatically generate normalization rules from training data without expert interventions. Specifically, ARTime automatically captures possible operation sequences from annotated data and generates normalization rules on time expressions with common surface forms. The experimental results show that ARTime can significantly surpass SOTA methods on the Tweets benchmark, and achieves competitive results with existing expert-engineered rule methods on the TempEval-3 benchmark.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2015

On TimeML-Compliant Temporal Expression Extraction in Turkish

It is commonly acknowledged that temporal expression extractors are impo...
research
05/20/2022

Multilingual Normalization of Temporal Expressions with Masked Language Models

The detection and normalization of temporal expressions is an important ...
research
04/27/2023

A Modular Approach for Multilingual Timex Detection and Normalization using Deep Learning and Grammar-based methods

Detecting and normalizing temporal expressions is an essential step for ...
research
08/09/2016

TweeTime: A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter

We describe TweeTIME, a temporal tagger for recognizing and normalizing ...
research
11/05/2021

Adaptive Warden Strategy for Countering Network Covert Storage Channels

The detection and elimination of covert channels are performed by a netw...
research
11/28/2018

Sequence Learning with RNNs for Medical Concept Normalization in User-Generated Texts

In this work, we consider the medical concept normalization problem, i.e...
research
03/31/2023

Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical Expressions

Temporal and numerical expression understanding is of great importance i...

Please sign up or login with your details

Forgot password? Click here to reset