LIDIOMS: A Multilingual Linked Idioms Data Set

02/22/2018
by   Diego Moussallem, et al.
0

In this paper, we describe the LIDIOMS data set, a multilingual RDF representation of idioms currently containing five languages: English, German, Italian, Portuguese, and Russian. The data set is intended to support natural language processing applications by providing links between idioms across languages. The underlying data was crawled and integrated from various sources. To ensure the quality of the crawled data, all idioms were evaluated by at least two native speakers. Herein, we present the model devised for structuring the data. We also provide the details of linking LIDIOMS to well-known multilingual data sets such as BabelNet. The resulting data set complies with best practices according to Linguistic Linked Open Data Community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2023

How Different Is Stereotypical Bias Across Languages?

Recent studies have demonstrated how to assess the stereotypical bias in...
research
08/28/2022

Adapting the LodView RDF Browser for Navigation over the Multilingual Linguistic Linked Open Data Cloud

The paper is dedicated to the use of LodView for navigation over the mul...
research
06/12/2019

Linking geospatial data with Geo-L – analysis and experiments of big data readiness of common technologies

Geospatial Linked Data is an emerging domain, with growing interest in r...
research
09/02/2019

Blended Integrated Open Data: dados abertos públicos integrados

While several public institutions provide its data openly, the effort re...
research
07/19/2019

Linked Crunchbase: A Linked Data API and RDF Data Set About Innovative Companies

Crunchbase is an online platform collecting information about startups a...
research
10/14/2022

The State of Profanity Obfuscation in Natural Language Processing

Work on hate speech has made the consideration of rude and harmful examp...
research
10/24/2022

Universal and Independent: Multilingual Probing Framework for Exhaustive Model Interpretation and Evaluation

Linguistic analysis of language models is one of the ways to explain and...

Please sign up or login with your details

Forgot password? Click here to reset