Top-Rank-Focused Adaptive Vote Collection for the Evaluation of Domain-Specific Semantic Models

10/09/2020
by   Pierangelo Lombardo, et al.
0

The growth of domain-specific applications of semantic models, boosted by the recent achievements of unsupervised embedding learning algorithms, demands domain-specific evaluation datasets. In many cases, content-based recommenders being a prime example, these models are required to rank words or texts according to their semantic relatedness to a given concept, with particular focus on top ranks. In this work, we give a threefold contribution to address these requirements: (i) we define a protocol for the construction, based on adaptive pairwise comparisons, of a relatedness-based evaluation dataset tailored on the available resources and optimized to be particularly accurate in top-rank evaluation; (ii) we define appropriate metrics, extensions of well-known ranking correlation coefficients, to evaluate a semantic model via the aforementioned dataset by taking into account the greater significance of top ranks. Finally, (iii) we define a stochastic transitivity model to simulate semantic-driven pairwise comparisons, which confirms the effectiveness of the proposed dataset construction protocol.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/12/2021

Kicktionary-LOME: A Domain-Specific Multilingual Frame Semantic Parsing Model for Football Language

This technical report introduces an adapted version of the LOME frame se...
research
02/13/2023

Evaluation of Word Embeddings for the Social Sciences

Word embeddings are an essential instrument in many NLP tasks. Most avai...
research
07/20/2023

MediaGPT : A Large Language Model For Chinese Media

Large language models (LLMs) have shown remarkable capabilities in gener...
research
07/28/2023

ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation

This paper presents the development and evaluation of ChatHome, a domain...
research
02/06/2020

Towards Semantic Noise Cleansing of Categorical Data based on Semantic Infusion

Semantic Noise affects text analytics activities for the domain-specific...
research
06/17/2021

IFCNet: A Benchmark Dataset for IFC Entity Classification

Enhancing interoperability and information exchange between domain-speci...
research
05/11/2023

A maturity model for catalogues of semantic artefacts

The work presented in this paper is twofold. On the one hand, we aim to ...

Please sign up or login with your details

Forgot password? Click here to reset