Language Independent Acquisition of Abbreviations

09/23/2017
by   Michael R. Glass, et al.
0

This paper addresses automatic extraction of abbreviations (encompassing acronyms and initialisms) and corresponding long-form expansions from plain unstructured text. We create and are going to release a multilingual resource for abbreviations and their corresponding expansions, built automatically by exploiting Wikipedia redirect and disambiguation pages, that can be used as a benchmark for evaluation. We address a shortcoming of previous work where only the redirect pages were used, and so every abbreviation had only a single expansion, even though multiple different expansions are possible for many of the abbreviations. We also develop a principled machine learning based approach to scoring expansion candidates using different techniques such as indicators of near synonymy, topical relatedness, and surface similarity. We show improved performance over seven languages, including two with a non-Latin alphabet, relative to strong baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2012

Product/Brand extraction from WikiPedia

In this paper we describe the task of extracting product and brand pages...
research
02/17/2020

What is Trending on Wikipedia? Capturing Trends and Language Biases Across Wikipedia Editions

In this work, we propose an automatic evaluation and comparison of the b...
research
05/01/2020

Multilingual Unsupervised Sentence Simplification

Progress in Sentence Simplification has been hindered by the lack of sup...
research
05/30/2019

Assessing The Factual Accuracy of Generated Text

We propose a model-based metric to estimate the factual accuracy of gene...
research
06/16/2023

RED^ FM: a Filtered and Multilingual Relation Extraction Dataset

Relation Extraction (RE) is a task that identifies relationships between...
research
11/25/2017

Acronym Disambiguation: A Domain Independent Approach

Acronyms are omnipresent. They usually express information that is repet...

Please sign up or login with your details

Forgot password? Click here to reset