Social Cards Probably Provide For Better Understanding Of Web Archive Collections

05/27/2019
by   Shawn M. Jones, et al.
0

Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their own research? Search engine results and social media links are represented as surrogates, small easily digestible summaries of the underlying page. Search engines and social media have a different focus, and hence produce different surrogates than web archives. Search engine surrogates help a user answer the question "Will this link meet my information need?" Social media surrogates help a user decide "Should I click on this?" Our use case is subtly different. We hypothesize that groups of surrogates together are useful for summarizing a collection. We want to help users answer the question of "What does the underlying collection contain?" But which surrogate should we use? With Mechanical Turk participants, we evaluate six different surrogate types against each other. We find that the type of surrogate does not influence the time to complete the task we presented the participants. Of particular interest are social cards, surrogates typically found on social media, and browser thumbnails, screen captures of web pages rendered in a browser. At p=0.0569, and p=0.0770, respectively, we find that social cards and social cards paired side-by-side with browser thumbnails probably provide better collection understanding than the surrogates currently used by the popular Archive-It web archiving platform. We measure user interactions with each surrogate and find that users interact with social cards less than other types. The results of this study have implications for our web archive summarization work, live web curation platforms, social media, and more.

READ FULL TEXT
research
08/01/2020

MementoEmbed and Raintale for Web Archive Storytelling

For traditional library collections, archivists can select a representat...
research
12/19/2016

iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling

Researchers in the Digital Humanities and journalists need to monitor, c...
research
05/17/2017

Stories From the Past Web

Archiving Web pages into themed collections is a method for ensuring the...
research
07/06/2021

Garbage, Glitter, or Gold: Assigning Multi-dimensional Quality Scores to Social Media Seeds for Web Archive Collections

From popular uprisings to pandemics, the Web is an essential source cons...
research
12/19/2016

The iCrawl Wizard -- Supporting Interactive Focused Crawl Specification

Collections of Web documents about specific topics are needed for many a...
research
04/08/2018

A Structure-Oriented Unsupervised Crawling Strategy for Social Media Sites

Existing techniques for efficiently crawling social media sites rely on ...
research
11/11/2016

Show me the material evidence: Initial experiments on evaluating hypotheses from user-generated multimedia data

Subjective questions such as `does neymar dive', or `is clinton lying', ...

Please sign up or login with your details

Forgot password? Click here to reset