Navigating the Mise-en-Page: Interpretive Machine Learning Approaches to the Visual Layouts of Multi-Ethnic Periodicals

09/03/2021 ∙ by Benjamin Charles Germain Lee, et al. ∙ Texas A&M University Corpus Christi The University of Texas at Austin Penn State University University of Washington 0

This paper presents a computational method of analysis that draws from machine learning, library science, and literary studies to map the visual layouts of multi-ethnic newspapers from the late 19th and early 20th century United States. This work departs from prior approaches to newspapers that focus on individual pieces of textual and visual content. Our method combines Chronicling America's MARC data and the Newspaper Navigator machine learning dataset to identify the visual patterns of newspaper page layouts. By analyzing high-dimensional visual similarity, we aim to better understand how editors spoke and protested through the layout of their papers.

READ FULL TEXT VIEW PDF
POST COMMENT

Comments

There are no comments yet.

Authors

page 3

page 9

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

1 Introduction

This paper presents a set of developing methods for analyzing Black, Latinx, and other ethnic newspapers in the late nineteenth and early twentieth-century United States. This historical period was marked by rising tides of racism and xenophobia. Editing a newspaper dedicated to the progress of Black, Latinx, or immigrant communities often entailed facing down forces of hate and violence.

Editors in the historical multi-ethnic press often chose to proceed circumspectly. Rather than publish bold condemnations in headlines or editorials, it might have been safer to rely on other affordances of newspapers. It was possible to convey ideas using the paper’s very mise-en-page, in between the lines (or columns) through editorial strategies such as juxtaposing different content types, riffing on a given format, or deliberately reprinting unusual or out-of-context material. In the multi-ethnic press, editors often employed these strategies to render layout as its own field of meaning-making: relations between items on the page could recast or resignify seemingly distinct components of the newspaper. This feature of multi-ethnic editorial communication makes it untenable to extract and analyze isolated textual or visual items from these newspapers.

Computational methods make it possible to analyze the visual patterns of multi-ethnic newspapers at scale. While prior research has approached these questions through textual analysis, a pivot to computational visual analysis compensates for the frequently noisy digital surrogates of surviving multi-ethnic newspapers, too often poorly preserved.

Figure 1: A visualization of the front page of the May 22, 1917, issue of La Prensa [lp_22_may_1917], annotated with bounding boxes of visual content derived from the Newspaper Navigator dataset [nn_dataset]. These bounding boxes are predictions made by a machine learning model showing, in this case, the locations of headlines, photographs, and maps (each bounding box includes the predicted class, as well as a confidence score, in the top left corner). In this paper, we utilize these bounding boxes to compare newspaper titles according to visual similarity.

Our working method uses machine learning to map the layouts of hundreds of thousands of newspaper pages, quantified by visual similarity, using the Newspaper Navigator dataset (as pictured in Figure 1) [nn_dataset]. Visualizing and comparing layouts at scale provides a wayfinding function, helping to direct archival researchers to distinctive and meaningful places in large collections of digitized newspapers and cultural heritage collections writ large.

In this paper, we describe work in progress that brings machine learning, library science, and literary studies into conversation. This work includes: (1) a dataset of visual features from 16.3 million pages of historical newspapers; (2) remediation in 207 catalog records for 309 ethnic editors; and (3) modes for reading the practical languages of editing in specific historical contexts. Prior research efforts blend together in our exploratory machine learning prototypes for quantitatively and qualitatively measuring the visual similarities of the formats of a newspaper page. These visual similarities can help illuminate the ideas and activism that flowed through the editorial craft and conduct of historical multi-ethnic newspapers. Moreover, our work formulating a “similarity score” affirms the promise of multi-disciplinary collaborations for improving access and analysis of larger-scale digitized newspaper collections.

2 Related Prior Work on Developing Computational Approaches to Multi-Ethnic Periodicals

Multiple fields are eager to explore the new possibilities for analyzing digitized collections of historical newspapers. Complementary conversations in computational, archival, and literary studies have begun to pursue those possibilities in rich but largely disparate ways. Silos persist even as many scholars make parallel use of the Library of Congress’s Chronicling America and other national newspaper digital initiatives[chronam, trove, pekarek_europeana_2012, willems_europeana_2015, kreitz_american_2018, delpher, british, ourdigitalworld, paperspast].

This paper builds on multi-ethnic press scholarship and archival research in Chronicling America and other repositories. The prevailing view of political activism in historical multi-ethnic newspapers has tended to focus on the textual and visual content that drove demands for social justice [nerone_us_2003, cole_how_2020, williams_cultivating_2015, stein_2007]. A complementary approach has focused on social histories, tracing the networks of a paper’s contributors and readers [foster_narrative_2005, gardner_black_2015, rhodes_mary_1998, spires_practice_2019, fagan_black_2016]. What can be easy to miss, however, is that editors can use the format of a paper itself to voice critiques and protests. The formal nature of editors’ craft is difficult to grasp in the meticulous approaches to archival research on a case-by-case basis. Our work aims to develop experimental machine learning applications to map the larger patterns of editorial practices and editing conventions that span languages, communities, and eras.

Conversations in library and archival studies are exploring how best to account for histories of race and ethnicity in metadata. Rather than being reduced to a theme or topic within a collection, specialists have rethought how race and ethnicity might work as organizing principles for collecting and describing historical materials. Race and ethnicity as categories of identity are historically contingent and socially constructed. That fluidity challenges the rigors of knowledge organization systems that impose stable categories and schemas. For some archivists, however, categories of race and identity productively serve as a form of provenance to guide collection development and access policies [wurl_ethnicity_2005, daniel_archival_2014, daniel_documenting_2014]. Many of these changes at the collection level are being addressed in the metadata and, more recently, through linked open data. Because the development of controlled vocabularies, such as the Library of Congress Subject Headings, and Machine Readable Cataloging (MARC) largely predate sensitivity to these histories in libraries and archives, metadata has become a vital space for rediscovering the historical editors of multi-ethnic newspapers [ventura_recovering_2019].

At the same time, an overabundance of work with more “computationally robust,” institutionally privileged white print collections has prompted preliminary conversations regarding the directions of computational humanities research. Indeed, from OCR engines that do not support indigenous languages or different dialects, to entire subfields of natural language processing and textual analysis that assume English as the default language of study, algorithmically-mediated erasure continues to be uncovered and detailed by scholars

[fagan_chronicling_2016, alpert_abrams_machine_2016, joshi_20, yakel_archival_2003, hardy, williams_cas, gustafson_ethnic_2015, risam_new_2018, noble_2018]. Many have advocated for the development of computational approaches for those collections that remain marginal. This paper concerns one such corrective for digitized periodicals: utilizing machine learning to study visual content and page layout rather than textual content, circumventing the use of OCR engines entirely.

Computational scholars have isolated visual content within periodicals in order to produce datasets of isolated content. Humanistic scholars have utilized these atomized items for interpretation. We are instead interested in the visual layouts themselves and the relations they imply on the page. By treating relations and patterns within layouts as forms of interpretive content – reading them as a “visual grammar” of photographs, advertisements, headlines, and beyond – we are in effect exploring a publication’s creative history, local contexts, and circulation networks.

3 Methods

The work described in this paper began in part with the Newspaper Navigator project, created as part of Library of Congress’s Innovator in Residence program in 2019 with the goal of excavating visual content in Chronicling America. Developed in partnership with LC Labs, the National Digital Newspaper Program, and IT Design & Development at the Library of Congress, Newspaper Navigator utilizes a machine learning model to extract seven visual features from 16 million pages in Chronicling America: advertisements, photographs, illustrations, maps, comics, editorial cartoons, and headlines [nn_dataset]. This machine learning model consists of an object detection model that had been finetuned on thousands of bounding box annotations of visual content created by volunteers as part of the Beyond Words crowdsourcing initiative launched by LC Labs in 2017 [beyond_words]. The resulting extracted visual content was publicly released as the Newspaper Navigator dataset, available in the public domain for all to use [nn_dataset].

Newspaper Navigator coincides with current trends in periodical studies. As Sean Latham and Robert Scholes argue in an influential essay, “We have often been too quick to see magazines [and other periodicals] as containers of discrete bits of information rather than autonomous objects of study” [latham_scholes_2006]. In practice, this means that periodical scholars tend to examine not only the individual pieces of content in a given publication, but also its overall construction, taking into account such architectural elements as a publication’s formats, columns, and visual layout. The typical methods to apply this approach require intensive reading of individual newspaper pages. Researchers gradually develop an intuition about the general style and flavor of a certain publication. A publication’s implicit patterns can be relatively intangible because periodical formats are necessarily fixed—to preserve continuity—and flexible—to accommodate new and changing scenarios. A paper looks like itself almost always, but not always. Even beyond academic inquiry, dedicated readers of any periodical often develop this kind of intuitive mental map of a publication’s page designs and formats.

We saw the possibility of combining these two types of reading. On one hand, Newspaper Navigator makes it possible to chart visual features across the entirety of Chronicling America. On the other hand, the methods in periodical studies encourage attention to the juxtaposition of visual features on a page. Starting with these two methods, we developed a set of preliminary working questions. How do the visual features on the page of a newspaper interlock and mutually inform each other? If we look at all of those features holistically, what kinds of styles and ideas can we detect over the lifespan of a single newspaper? More broadly, what added perspective can we gain by using the Newspaper Navigator to trace the sum of newspaper layouts over hundreds of publications? Finally, can we establish a “baseline” for newspaper production in the later 19th and early 20th century United States that would allow us to see the critical departures made by multi-ethnic newspapers in the service of social justice during moments of elevated racism or xenophobia?

3.1 Methods I: Adapting MARC 21 Metadata for Research

Adapting the Newspaper Navigator dataset to answer these questions required us to collect additional data about the subsets of multi-ethnic newspapers within Chronicling America. This information was partially extracted from the bibliographic data of newspaper titles in the machine-readable cataloging (MARC) format. All records for newspapers in Chronicling America follow the cataloging standards of the Cooperative ONline SERials Program (CONSER) [conser_about_loc], providing a somewhat normalized set of features for sorting newspapers. Additionally, Chronicling America’s partnering institutions provide a “title essay” that describes and contextualizes each newspaper. These unstructured data has recently been the focus of efforts for metadata enhancement in remediation projects geared to identifying ethnic editors [ortiz_baco_title_2019].

The last step in collecting and organizing our data was combining MARC fields and structured data generated from the text of title essays to find relevant publications. Metadata from the title essays allowed us to select only ethnic newspapers with identifiable editors. From this subset, we selected MARC fields used by cataloguers to record title changes, race, ethnicity, and languages associated with each publication. We initially hoped to find an indicator of visual change within the metadata for newspapers. We considered that serials with more than one language edition, language notes, or different titles over their publication runs would feature changes to visual and layout elements of newspapers to mark these relationships or changes.111The 780 and 785 fields indicate all preceding and succeeding titles for a publication, respectively. The 546 field describes the language of a resource while the 765 indicates the original language of a translated publication and the 775 captures other language editions of a publication Title changes, for example, are a cataloguing practice for linking preceding and succeeding titles of a publication that has had a substantial name change, which we assumed would also produce modifications to the graphic elements on the page. We were able to parse these metadata fields where the change inherent in the seriality of newspapers is recorded, which in turn provided parameters for capturing difference, similarity, and linking in an otherwise static knowledge organization system.

Figure 2: A heatmap of advertisements appearing on the front page of the Opelousas Courier, 1870-1909. Darker regions on the heatmaps correspond to a higher concentration of advertisement pages appearing in that region in aggregate over a given year.

3.2 Methods 2: Quantifying and Visualizing the Mise-en-Page

The first visual output we experimented with were heat maps, created with Newspaper Navigator data that encodes the positions of visual features on a newspaper page. Inspired by PageOneX [pageonex], these heat maps concisely summarize visual content patterns in an interpretable fashion. Figure 2 shows the locations and relationships of advertisements across four decades of The Opelousas Courier,222The Opelousas Courier is described in Chronicling America as a Louisiana multilingual newspaper, with content in French and English. https://chroniclingamerica.loc.gov/lccn/sn83026389/marc/. As an initial experiment in visual layout representation, the heat maps enabled us to trace the development of discrete visual features, otherwise impossible to quantify by examining front pages by eye.

This approach facilitates several comparative opportunities. We rendered multiple visual categories into heat maps to understand the interplay between visual components in a single publication. We also utilized the heat map graphics to compare different newspapers using the same visual categories. The heat maps offered a useful preliminary system for studying newspaper layout as historical information.

Heat maps added greater nuance to our historical questions about newspaper layout. For example, what are the relationships between visual content, editorial practices, and print technology? What are the roles of regional and local context in predicting layout? How do we account for journalistic trends, such as the rise of syndicated content or nationally-circulating advertisements?

The heat maps showed the need for further exploration. In analyzing one publication at a time, the heat maps reproduced the methods of archival periodical studies, even in visually abstracted ways. While the MARC records helped us to identify the titles we were most interested in rendering into heat maps, this piecemeal approach to newspaper metadata and layouts proved only a first step.

3.3 Methods 3: Toward a Visual Constellation of Multi-Ethnic Newspapers

Our next experiment attempted to scale up possible representations to refocus on illustrating change and similarity across tens, hundreds, and even thousands of publications. Our resulting exploratory model utilized the quantified heat maps to define a metric over visual similarity. In particular, the distance between two newspaper titles can be captured by the residuals from subtracting two heatmaps from one another. As an initial approach, we segmented each newspaper title’s run into individual years. For each title and year, we then treated the seven resulting heat maps (one for advertisements, one for headlines, etc.) collectively as a single, high-dimensional vector. In order to interpret the clusters in this high-dimensional space, we utilized T-SNE for dimensionality reduction

[tsne].

Figure 3: A 2-dimensional map of visual similarity across all ethnic titles in Chronicling America, filtered using [ortiz_baco_title_2019]. Each point on the similarity map represents the composite front page of a given newspaper title for a given year, from 1890 to 1909 (individual years are labeled on the visualization). The magnified cluster reveals that The Washington Bee, The Appeal, and The Colored American are largely grouped together across years (individual front pages from each title are shown in Figure 4). Notably, all three titles are from the Black press.
Figure 4: Sample front pages from The Appeal, The Washington Bee, and The Colored American from 1891, 1896, and 1900, respectively – all of which are title-year pairs appearing in the same cluster in Figure 3 [appeal_14_march_1891, appeal_9_may_1891, bee_29_feb_1896, bee_29_aug_1896, ca_29_feb_1900, ca_8_sep_1900]. All six front pages feature visual content – namely, illustrations and photographs of individuals – prominently in the center of the page.

This process resulted in exploratory constellations of newspaper titles, as shown in Figure 3. In this figure, we also show a close-up of one such “similarity cluster,” revealing that three Black newspapers (The Washington Bee, The Appeal, and The Colored American) are largely grouped together across years. In Figure 4, we show example front pages for these three titles from years included within the cluster. All six pictured front pages feature visual content on the center of the page. Notably, all six feature photographs and illustrations of individuals, a trend that is even more apparent when browsing these front pages across different issues. These images demonstrate that the similarity clusters, generated using Newspaper Navigator’s rubric of seven content classes, accurately reflect common visual patterns of layout in distinct newspapers across the Black newspapers held in Chronicling America. The shared qualities across each of the three newspapers pictured in Figure 4 are suggestive. They are roughly contemporaneous African American newspapers that locate specific types of images near the center of the front page. These results indicate the capacity of the method to produce a multifaceted research question: what can we learn from the use of visual culture and portraiture in the material pages of late nineteenth-century African American newspapers?

These constellations of similarity provide a means to uncover small networks to focus on and interpret at a much more granular level. This is not a fully quantitative representation, but rather an exploratory framework for identifying and interpreting networks of related publications and the editorial practices that shape them. Given our overarching interest in the historical trajectories and meanings of newspaper formats, these visualizations show great promise. The constellation map is a model for how to convert raw bounding box information into interpretable provocations about historical similarity. Similarity clusters introduce visual layouts as clues to a shared context, inviting new research into the different historical traditions, technological moments, or political causes that comprise the histories of editorship.

4 Findings & Future Work

Our ongoing collaboration offers three central findings to date. First, this work finds that full level CONSER MARC records provide useful data features in machine-learning applications. Specifically, the most value comes from fields that register change over time, such as preceding and succeeding titles, and describe relationships between different editions of the same publication. Further metadata enhancement focused on identifying ethnic editors will significantly improve the interpretative value of visualizations and pattern recognition from large-scale collections like Chronicling America.

Second, this collaboration reveals how computational notions of layout similarity can be leveraged for humanistic inquiry. Using the machine learning-constructed Newspaper Navigator dataset of visual content, we have generated heat maps and constellation maps for exploring trends of visual layout within and across newspaper titles, respectively. These visualizations provide us with new visual grammars and affordances for excavating editorial practices.

Third, this work required us to find a collaborative working process for creating multidisciplinary methods of analysis that engage with machine learning, library science, and the humanities. Rather than trying to blend or transplant our respective research methods, we focused on creating a shared vocabulary that could speak to current research agendas in each of the relevant fields. This set of theoretical exchanges required iterative conversations, frequently pedagogical in nature, to find a method that is not merely extractive and static but generative and dynamic. This process will help guide the future growth of conversations in computational periodicals research.

Our roadmap for future work is guided by the provocations offered throughout this paper. First, we plan to further utilize MARC data and the constellation maps in concert with one another in order to inform our understanding of both. The MARC data has the capacity to guide our navigation and understanding of the constellation maps; conversely, the constellation maps provide mechanisms for record enhancement by allowing scholars, catalogers, and librarians alike to sift through incomplete legacy records of ethnic publications. We offer this direction of future work in pursuit of uncovering editorship in ethnic presses. We plan to refine our similarity metrics and visualizations from a computational perspective in pursuit of this goal as well. The results of this research will drive new inquiry into the longer and varied histories of radical dissent and veiled protest in the multi-ethnic press.

Acknowledgements.
The authors would like to thank and acknowledge Dr. Molly Hardy, Senior Program Officer at the National Endowment for the Humanities, along with Georgia Higley and Vanessa Mitchell from the Serial and Government Publications Division at the Library of Congress, for sharing their expertise. This work would not be possible without Dr. Hardy’s input and advice.

References