The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives

01/15/2020
by   Nick Ruest, et al.
0

The Archives Unleashed project aims to improve scholarly access to web archives through a multi-pronged strategy involving tool creation, process modeling, and community building - all proceeding concurrently in mutually-reinforcing efforts. As we near the end of our initially-conceived three-year project, we report on our progress and share lessons learned along the way. The main contribution articulated in this paper is a process model that decomposes scholarly inquiries into four main activities: filter, extract, aggregate, and visualize. Based on the insight that these activities can be disaggregated across time, space, and tools, it is possible to generate "derivative products", using our Archives Unleashed Toolkit, that serve as useful starting points for scholarly inquiry. Scholars can download these products from the Archives Unleashed Cloud and manipulate them just like any other dataset, thus providing access to web archives without requiring any specialized knowledge. Over the past few years, our platform has processed over a thousand different collections from about two hundred users, totaling over 280 terabytes of web archives.

READ FULL TEXT
research
04/25/2022

About MathPartner web service

The report is devoted to the current state of the MathPartner computer a...
research
07/27/2009

Fact Sheet on Semantic Web

The report gives an overview about activities on the topic Semantic Web....
research
02/27/2023

Implementing a Model-based Engineering Tool as Web Application

This paper reports on a study of transferring a desktop-based model-base...
research
08/11/2020

PlugSonic: a web- and mobile-based platform for binaural audio and sonic narratives

PlugSonic is a suite of web- and mobile-based applications for the curat...
research
06/27/2022

Deployment of ML Models using Kubeflow on Different Cloud Providers

This project aims to explore the process of deploying Machine learning m...
research
10/23/2018

LincoSim: a web based HPC-cloud platform for automatic virtual towing tank analysis

In this work, we present a new web based HPC-cloud platform for automati...
research
06/03/2020

Visualizing Webpage Changes Over Time

We report on the development of TMVis, a web service to provide visualiz...

Please sign up or login with your details

Forgot password? Click here to reset