Navigating the Mise-en-Page: Interpretive Machine Learning Approaches to the Visual Layouts of Multi-Ethnic Periodicals

This paper presents a computational method of analysis that draws from machine learning, library science, and literary studies to map the visual layouts of multi-ethnic newspapers from the late 19th and early 20th century United States. This work departs from prior approaches to newspapers that focus on individual pieces of textual and visual content. Our method combines Chronicling America's MARC data and the Newspaper Navigator machine learning dataset to identify the visual patterns of newspaper page layouts. By analyzing high-dimensional visual similarity, we aim to better understand how editors spoke and protested through the layout of their papers.

READ FULL TEXT

page 3

page 9

research
12/12/2022

Page Layout Analysis of Text-heavy Historical Documents: a Comparison of Textual and Visual Approaches

Page layout analysis is a fundamental step in document processing which ...
research
12/09/2019

Modular Multimodal Architecture for Document Classification

Page classification is a crucial component to any document analysis syst...
research
10/19/2017

Learning Visual Features from Snapshots for Web Search

When applying learning to rank algorithms to Web search, a large number ...
research
04/08/2021

GrASP: A Library for Extracting and Exploring Human-Interpretable Textual Patterns

Data exploration is an important step of every data science and machine ...
research
09/05/2017

Machine Learning and Social Robotics for Detecting Early Signs of Dementia

This paper presents the EACare project, an ambitious multi-disciplinary ...
research
06/26/2023

Scenic Routes in R^d

In this work, we introduce the problem of scenic routes among points in ...

Please sign up or login with your details

Forgot password? Click here to reset