Unsilencing Colonial Archives via Automated Entity Recognition

10/03/2022
by   Mrinalini Luthra, et al.
0

Colonial archives are at the center of increased interest from a variety of perspectives, as they contain traces of historically marginalized people. Unfortunately, like most archives, they remain difficult to access due to significant persisting barriers. We focus here on one of them: the biases to be found in historical findings aids, such as indexes of person names, which remain in use to this day. In colonial archives, indexes can perpetuate silences by omitting to include mentions of historically marginalized persons. In order to overcome such limitations and pluralize the scope of existing finding aids, we propose using automated entity recognition. To this end, we contribute a fit-for-purpose annotation typology and apply it on the colonial archive of the Dutch East India Company (VOC). We release a corpus of nearly 70,000 annotations as a shared task, for which we provide baselines using state-of-the-art neural network models. Our work intends to stimulate further contributions in the direction of broadening access to (colonial) archives, integrating automation as a possible means to this end.

READ FULL TEXT

page 5

page 14

research
10/29/2020

May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance

We investigate using Named Entity Recognition on a new type of user-gene...
research
12/20/2019

TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

In the last years, the consolidation of deep neural network architecture...
research
07/31/2020

Robust Benchmarking for Machine Learning of Clinical Entity Extraction

Clinical studies often require understanding elements of a patient's nar...
research
06/23/2017

Named Entity Recognition with stack residual LSTM and trainable bias decoding

Recurrent Neural Network models are the state-of-the-art for Named Entit...
research
04/09/2020

Interpretability Analysis for Named Entity Recognition to Understand System Predictions and How They Can Improve

Named Entity Recognition systems achieve remarkable performance on domai...
research
08/16/2021

Partially Supervised Named Entity Recognition via the Expected Entity Ratio Loss

We study learning named entity recognizers in the presence of missing en...
research
03/30/2023

Yes but.. Can ChatGPT Identify Entities in Historical Documents?

Large language models (LLMs) have been leveraged for several years now, ...

Please sign up or login with your details

Forgot password? Click here to reset