Approaching Peak Ground Truth

12/31/2022
by   Florian Kofler, et al.
0

Machine learning models are typically evaluated by computing similarity with reference annotations and trained by maximizing similarity with such. Especially in the bio-medical domain, annotations are subjective and suffer from low inter- and intra-rater reliability. Since annotations only reflect the annotation entity's interpretation of the real world, this can lead to sub-optimal predictions even though the model achieves high similarity scores. Here, the theoretical concept of Peak Ground Truth (PGT) is introduced. PGT marks the point beyond which an increase in similarity with the reference annotation stops translating to better Real World Model Performance (RWMP). Additionally, a quantitative technique to approximate PGT by computing inter- and intra-rater reliability is proposed. Finally, three categories of PGT-aware strategies to evaluate and improve model performance are reviewed.

READ FULL TEXT
research
08/19/2021

Czech News Dataset for Semantic Textual Similarity

This paper describes a novel dataset consisting of sentences with semant...
research
10/14/2022

The Invariant Ground Truth of Affect

Affective computing strives to unveil the unknown relationship between a...
research
11/11/2016

Improving Reliability of Word Similarity Evaluation by Redesigning Annotation Task and Performance Measure

We suggest a new method for creating and using gold-standard datasets fo...
research
02/19/2021

Subjective Assessments of Legibility in Ancient Manuscript Images – The SALAMI Dataset

The research field concerned with the digital restoration of degraded wr...
research
03/12/2019

Noisy Supervision for Correcting Misaligned Cadaster Maps Without Perfect Ground Truth Data

In machine learning the best performance on a certain task is achieved b...
research
02/28/2020

Neural Network Segmentation of Interstitial Fibrosis, Tubular Atrophy, and Glomerulosclerosis in Renal Biopsies

Glomerulosclerosis, interstitial fibrosis, and tubular atrophy (IFTA) ar...
research
01/04/2023

Learning Ambiguity from Crowd Sequential Annotations

Most crowdsourcing learning methods treat disagreement between annotator...

Please sign up or login with your details

Forgot password? Click here to reset