Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists

09/18/2022
by   Himarsha R. Jayanetti, et al.
0

As web archives' holdings grow, archivists subdivide them into collections so they are easier to understand and manage. In this work, we review the collection structures of eight web archive platforms: : Archive-It, Conifer, the Croatian Web Archive (HAW), the Internet Archive's user account web archives, Library of Congress (LC), PANDORA, Trove, and the UK Web Archive (UKWA). We note a plethora of different approaches to web archive collection structures. Some web archive collections support sub-collections and some permit embargoes. Curatorial decisions may be attributed to a single organization or many. Archived web pages are known by many names: mementos, copies, captures, or snapshots. Some platforms restrict a memento to a single collection and others allow mementos to cross collections. Knowledge of collection structures has implications for many different applications and users. Visitors will need to understand how to navigate collections. Future archivists will need to understand what options are available for designing collections. Platform designers need it to know what possibilities exist. The developers of tools that consume collections need to understand collection structures so they can meet the needs of their users.

READ FULL TEXT

page 2

page 3

page 4

research
08/01/2020

MementoEmbed and Raintale for Web Archive Storytelling

For traditional library collections, archivists can select a representat...
research
05/17/2017

Stories From the Past Web

Archiving Web pages into themed collections is a method for ensuring the...
research
06/18/2018

The Many Shapes of Archive-It

Web archives, a key area of digital preservation, meet the needs of jour...
research
10/26/2020

Multi-Objective Frequent Termset Clustering

Large media collections rapidly evolve in the World Wide Web. In additio...
research
05/01/2013

MATAWS: A Multimodal Approach for Automatic WS Semantic Annotation

Many recent works aim at developing methods and tools for the processing...
research
06/18/2018

The Off-Topic Memento Toolkit

Web archive collections are created with a particular purpose in mind. A...
research
11/16/2016

How to do lexical quality estimation of a large OCRed historical Finnish newspaper collection with scarce resources

The National Library of Finland has digitized the historical newspapers ...

Please sign up or login with your details

Forgot password? Click here to reset