A Visual Analytics Framework for Composing a Hierarchical Classification for Medieval Illuminations

08/20/2022
by   Christofer Meinecke, et al.
0

Annotated data is a requirement for applying supervised machine learning methods, and the quality of annotations is crucial for the result. Especially when working with cultural heritage collections that inhere a manifold of uncertainties, annotating data remains a manual, arduous task to be carried out by domain experts. Our project started with two already annotated sets of medieval manuscript images which however were incomplete and comprised conflicting metadata based on scholarly and linguistic differences. Our aims were to create (1) a uniform set of descriptive labels for the combined data set, and (2) a hierarchical classification of a high quality that can be used as a valuable input for supervised machine learning. To reach these goals, we developed a visual analytics system to enable medievalists to combine, regularize and extend the vocabulary used to describe these data sets. Visual interfaces for word and image embeddings as well as co-occurrences of the annotations across the data sets enable annotating multiple images at the same time, recommend annotation label candidates and support composing a hierarchical classification of labels. Our system itself implements a semi-supervised method as it updates visual representations based on the medievalists' feedback, and a series of usage scenarios document its value for the target community.

READ FULL TEXT

page 5

page 11

research
08/29/2022

Labeling of Cultural Heritage Collections on the Intersection of Visual Analytics and Digital Humanities

Engaging in interdisciplinary projects on the intersection between visua...
research
07/27/2020

Semi-Automatic Data Annotation guided by Feature Space Projection

Data annotation using visual inspection (supervision) of each training s...
research
08/15/2023

BI-LAVA: Biocuration with Hierarchical Image Labeling through Active Learning and Visual Analysis

In the biomedical domain, taxonomies organize the acquisition modalities...
research
06/28/2021

Rail-5k: a Real-World Dataset for Rail Surface Defects Detection

This paper presents the Rail-5k dataset for benchmarking the performance...
research
12/15/2022

Silhouette: Toward Performance-Conscious and Transferable CPU Embeddings

Learned embeddings are widely used to obtain concise data representation...
research
09/18/2023

Concurrent Haptic, Audio, and Visual Data Set During Bare Finger Interaction with Textured Surfaces

Perceptual processes are frequently multi-modal. This is the case of hap...
research
08/20/2020

Varying Annotations in the Steps of the Visual Analysis

Annotations in Visual Analytics (VA) have become a common means to suppo...

Please sign up or login with your details

Forgot password? Click here to reset