Kratt: Developing an Automatic Subject Indexing Tool for The National Library of Estonia

03/24/2022
by   Marit Asula, et al.
0

Manual subject indexing in libraries is a time-consuming and costly process and the quality of the assigned subjects is affected by the cataloguer's knowledge on the specific topics contained in the book. Trying to solve these issues, we exploited the opportunities arising from artificial intelligence to develop Kratt: a prototype of an automatic subject indexing tool. Kratt is able to subject index a book independent of its extent and genre with a set of keywords present in the Estonian Subject Thesaurus. It takes Kratt approximately 1 minute to subject index a book, outperforming humans 10-15 times. Although the resulting keywords were not considered satisfactory by the cataloguers, the ratings of a small sample of regular library users showed more promise. We also argue that the results can be enhanced by including a bigger corpus for training the model and applying more careful preprocessing techniques.

READ FULL TEXT

page 3

page 8

research
06/16/2021

Sentiment Progression based Searching and Indexing of Literary Textual Artefacts

Literary artefacts are generally indexed and searched based on titles, m...
research
08/17/2019

Comparison-Based Indexing From First Principles

Basic assumptions about comparison-based indexing are laid down and a ge...
research
04/28/2022

MeSHup: A Corpus for Full Text Biomedical Document Indexing

Medical Subject Heading (MeSH) indexing refers to the problem of assigni...
research
09/08/2017

FAST: Frequency-Aware Spatio-Textual Indexing for In-Memory Continuous Filter Query Processing

Many applications need to process massive streams of spatio-textual data...
research
05/13/2020

MeSH descriptors indicate the knowledge growth in the SARS-CoV-2/COVID-19 pandemic

The scientific papers dealing with the novel betacoronavirus SARS-CoV-2 ...
research
10/02/2019

BookQA: Stories of Challenges and Opportunities

We present a system for answering questions based on the full text of bo...
research
01/19/2018

Information and Environment: IoT-Powered Recommender Systems

Internet of Things (IoT) infrastructure within the physical library envi...

Please sign up or login with your details

Forgot password? Click here to reset