OYXOY: A Modern NLP Test Suite for Modern Greek

09/13/2023
by   Konstantinos Kogkalidis, et al.
0

This paper serves as a foundational step towards the development of a linguistically motivated and technically relevant evaluation suite for Greek NLP. We initiate this endeavor by introducing four expert-verified evaluation tasks, specifically targeted at natural language inference, word sense disambiguation (through example comparison or sense selection) and metaphor detection. More than language-adapted replicas of existing tasks, we contribute two innovations which will resonate with the broader resource and evaluation community. Firstly, our inference dataset is the first of its kind, marking not just one, but rather all possible inference labels, accounting for possible shifts due to e.g. ambiguity or polysemy. Secondly, we demonstrate a cost-efficient method to obtain datasets for under-resourced languages. Using ChatGPT as a language-neutral parser, we transform the Dictionary of Standard Modern Greek into a structured format, from which we derive the other three tasks through simple projections. Alongside each task, we conduct experiments using currently available state of the art machinery. Our experimental baselines affirm the challenging nature of our tasks and highlight the need for expedited progress in order for the Greek NLP ecosystem to keep pace with contemporary mainstream research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2022

Not another Negation Benchmark: The NaN-NLI Test Suite for Sub-clausal Negation

Negation is poorly captured by current language models, although the ext...
research
09/18/2020

FarsTail: A Persian Natural Language Inference Dataset

Natural language inference (NLI) is known as one of the central tasks in...
research
02/13/2023

Linguistic ambiguity analysis in ChatGPT

Linguistic ambiguity is and has always been one of the main challenges i...
research
06/16/2023

No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference

Natural Language Inference (NLI) has been a cornerstone task in evaluati...
research
05/02/2020

Predicting Performance for Natural Language Processing Tasks

Given the complexity of combinations of tasks, languages, and domains in...
research
07/19/2023

Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation

Rising computational demands of modern natural language processing (NLP)...
research
09/15/2021

Comparing Text Representations: A Theory-Driven Approach

Much of the progress in contemporary NLP has come from learning represen...

Please sign up or login with your details

Forgot password? Click here to reset