Using Natural Language Processing to Predict Costume Core Vocabulary of Historical Artifacts

Historic dress artifacts are a valuable source for human studies. In particular, they can provide important insights into the social aspects of their corresponding era. These insights are commonly drawn from garment pictures as well as the accompanying descriptions and are usually stored in a standardized and controlled vocabulary that accurately describes garments and costume items, called the Costume Core Vocabulary. Building an accurate Costume Core from garment descriptions can be challenging because the historic garment items are often donated, and the accompanying descriptions can be based on untrained individuals and use a language common to the period of the items. In this paper, we present an approach to use Natural Language Processing (NLP) to map the free-form text descriptions of the historic items to that of the controlled vocabulary provided by the Costume Core. Despite the limited dataset, we were able to train an NLP model based on the Universal Sentence Encoder to perform this mapping with more than 90 of the Costume Core vocabulary. We describe our methodology, design choices, and development of our approach, and show the feasibility of predicting the Costume Core for unseen descriptions. With more garment descriptions still being curated to be used for training, we expect to have higher accuracy for better generalizability.

READ FULL TEXT
research
08/04/2022

Core Challenge 2022: Solver and Graph Descriptions

This paper collects all descriptions of solvers and ISR instances submit...
research
01/21/2021

Challenges Encountered in Turkish Natural Language Processing Studies

Natural language processing is a branch of computer science that combine...
research
09/09/2019

General Fragment Model for Information Artifacts

The use of semantic descriptions in data intensive domains require a sys...
research
08/04/2022

Vocabulary Transfer for Medical Texts

Vocabulary transfer is a transfer learning subtask in which language mod...
research
03/28/2023

Exploring Natural Language Processing Methods for Interactive Behaviour Modelling

Analysing and modelling interactive behaviour is an important topic in h...
research
05/29/2017

Dynamics of core of language vocabulary

Studies of the overall structure of vocabulary and its dynamics became p...
research
03/21/2021

Automated Software Vulnerability Assessment with Concept Drift

Software Engineering researchers are increasingly using Natural Language...

Please sign up or login with your details

Forgot password? Click here to reset