Extending an Event-type Ontology: Adding Verbs and Classes Using Fine-tuned LLMs Suggestions

06/03/2023
by   Jana Straková, et al.
0

In this project, we have investigated the use of advanced machine learning methods, specifically fine-tuned large language models, for pre-annotating data for a lexical extension task, namely adding descriptive words (verbs) to an existing (but incomplete, as of yet) ontology of event types. Several research questions have been focused on, from the investigation of a possible heuristics to provide at least hints to annotators which verbs to include and which are outside the current version of the ontology, to the possible use of the automatic scores to help the annotators to be more efficient in finding a threshold for identifying verbs that cannot be assigned to any existing class and therefore they are to be used as seeds for a new class. We have also carefully examined the correlation of the automatic scores with the human annotation. While the correlation turned out to be strong, its influence on the annotation proper is modest due to its near linearity, even though the mere fact of such pre-annotation leads to relatively short annotation times.

READ FULL TEXT
research
05/23/2021

OntoED: Low-resource Event Detection with Ontology Embedding

Event Detection (ED) aims to identify event trigger words from a given t...
research
06/20/2018

Word Tagging with Foundational Ontology Classes: Extending the WordNet-DOLCE Mapping to Verbs

Semantic annotation is fundamental to deal with large-scale lexical info...
research
07/31/2023

Ontology engineering with Large Language Models

We tackle the task of enriching ontologies by automatically translating ...
research
02/13/2023

Machine Learning Model Attribution Challenge

We present the findings of the Machine Learning Model Attribution Challe...
research
06/01/2023

Column Type Annotation using ChatGPT

Column type annotation is the task of annotating the columns of a relati...
research
10/23/2022

Lexical Generalization Improves with Larger Models and Longer Training

While fine-tuned language models perform well on many tasks, they were a...
research
08/07/2023

WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset

A fundamental challenge in the current NLP context, dominated by languag...

Please sign up or login with your details

Forgot password? Click here to reset