TopoBERT: Plug and Play Toponym Recognition Module Harnessing Fine-tuned BERT

01/31/2023
by   Bing Zhou, et al.
0

Extracting precise geographical information from textual contents is crucial in a plethora of applications. For example, during hazardous events, a robust and unbiased toponym extraction framework can provide an avenue to tie the location concerned to the topic discussed by news media posts and pinpoint humanitarian help requests or damage reports from social media. Early studies have leveraged rule-based, gazetteer-based, deep learning, and hybrid approaches to address this problem. However, the performance of existing tools is deficient in supporting operations like emergency rescue, which relies on fine-grained, accurate geographic information. The emerging pretrained language models can better capture the underlying characteristics of text information, including place names, offering a promising pathway to optimize toponym recognition to underpin practical applications. In this paper, TopoBERT, a toponym recognition module based on a one dimensional Convolutional Neural Network (CNN1D) and Bidirectional Encoder Representation from Transformers (BERT), is proposed and fine-tuned. Three datasets (CoNLL2003-Train, Wikipedia3000, WNUT2017) are leveraged to tune the hyperparameters, discover the best training strategy, and train the model. Another two datasets (CoNLL2003-Test and Harvey2017) are used to evaluate the performance. Three distinguished classifiers, linear, multi-layer perceptron, and CNN1D, are benchmarked to determine the optimal model architecture. TopoBERT achieves state-of-the-art performance (f1-score=0.865) compared to the other five baseline models and can be applied to diverse toponym recognition tasks without additional training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2020

I-AID: Identifying Actionable Information from Disaster-related Tweets

Social media data plays a significant role in modern disaster management...
research
08/22/2020

HinglishNLP: Fine-tuned Language Models for Hinglish Sentiment Detection

Sentiment analysis for code-mixed social media text continues to be an u...
research
06/06/2023

CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental Fine-Tuning and Multi-Task Learning with Label Descriptions

The widespread popularity of social media has led to an increase in hate...
research
07/28/2020

YNU-HPCC at SemEval-2020 Task 8: Using a Parallel-Channel Model for Memotion Analysis

In recent years, the growing ubiquity of Internet memes on social media ...
research
11/30/2021

Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

Social media posts contain potentially valuable information about medica...
research
08/15/2023

Finding Stakeholder-Material Information from 10-K Reports using Fine-Tuned BERT and LSTM Models

All public companies are required by federal securities law to disclose ...
research
03/19/2023

PACO: Provocation Involving Action, Culture, and Oppression

In India, people identify with a particular group based on certain attri...

Please sign up or login with your details

Forgot password? Click here to reset