Publisher References in Bibliographic Entity Descriptions

by   Jim Hahn, et al.

This paper describes a method for improved access to publisher references in linked data RDF editors using data mining techniques and a large set of library metadata encoded in the MARC21 standard. The corpus is comprised of clustered sets of publishers and publisher locations from the library MARC21 records found in the POD Data Lake, an Ivy+ Library Consortium metadata sharing initiative. The POD Data Lake contains seventy million MARC21 records, forty million of which are unique. The discovery of publisher entity sets described forms the basis for the streamlined description of BIBFRAME Instance entities. This study resulted in two major outputs: 1) A prediction database and 2) sets of publisher location and name association rules. The association rules are the basis of a prototype autosuggestion feature of BIBFRAME Instance entity description properties designed specifically to support the autopopulation of publisher entities in linked data RDF editors.



There are no comments yet.


page 1

page 2

page 3

page 4


Communicating and resolving entity references

Statements about entities occur everywhere, from newspapers and web page...

GeoVectors: A Linked Open Corpus of OpenStreetMap Embeddings on World Scale

OpenStreetMap (OSM) is currently the richest publicly available informat...

The number of linked references of publications in Microsoft Academic in comparison with the Web of Science

In the context of a comprehensive Microsoft Academic (MA) study, we expl...

AMORE-UPF at SemEval-2018 Task 4: BiLSTM with Entity Library

This paper describes our winning contribution to SemEval 2018 Task 4: Ch...

Creating a Linked Data-Friendly Metadata Application Profile for Archival Description

We provide an overview of efforts to apply and extend for arc...

Communicating Semantics: Reference by Description

Messages often refer to entities such as people, places and events. Corr...

Reliable Granular References to Changing Linked Data

Nanopublications are a concept to represent Linked Data in a granular an...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.