Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library

02/03/2017
by   Jaimie Murdock, et al.
0

We show how faceted search using a combination of traditional classification systems and mixed-membership topic models can go beyond keyword search to inform resource discovery, hypothesis formulation, and argument extraction for interdisciplinary research. Our test domain is the history and philosophy of scientific work on animal mind and cognition. The methods can be generalized to other research areas and ultimately support a system for semi-automatic identification of argument structures. We provide a case study for the application of the methods to the problem of identifying and extracting arguments about anthropomorphism during a critical period in the development of comparative psychology. We show how a combination of classification systems and mixed-membership models trained over large digital libraries can inform resource discovery in this domain. Through a novel approach of "drill-down" topic modeling---simultaneously reducing both the size of the corpus and the unit of analysis---we are able to reduce a large collection of fulltext volumes to a much smaller set of pages within six focal volumes containing arguments of interest to historians and philosophers of comparative psychology. The volumes identified in this way did not appear among the first ten results of the keyword search in the HathiTrust digital library and the pages bear the kind of "close reading" needed to generate original interpretations that is the heart of scholarly work in the humanities. Zooming back out, we provide a way to place the books onto a map of science originally constructed from very different data and for different purposes. The multilevel approach advances understanding of the intellectual and societal contexts in which writings are interpreted.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2021

Towards Full-Fledged Argument Search: A Framework for Extracting and Clustering Arguments from Unstructured Text

Argument search aims at identifying arguments in natural language texts....
research
04/24/2023

Generating Topic Pages for Scientific Concepts Using Scientific Publications

In this paper, we describe Topic Pages, an inventory of scientific conce...
research
09/24/2019

Recognizing Topic Change in Search Sessions of Digital Libraries based on Thesaurus and Classification System

Log analysis in Web search showed that user sessions often contain sever...
research
06/21/2018

Metadata Enrichment of Multi-Disciplinary Digital Library: A Semantic-based Approach

In the scientific digital libraries, some papers from different research...
research
09/17/2018

Categorization of Comparative Sentences for Argument Mining

We present the first work on domain-independent comparative argument min...
research
12/02/2015

Klasifikasi Komponen Argumen Secara Otomatis pada Dokumen Teks berbentuk Esai Argumentatif

By automatically recognize argument component, essay writers can do some...
research
09/30/2021

Library of Congress Subject Heading (LCSH) Browsing and Natural Language Searching

Controlled topical vocabularies (CVs) are built into information systems...

Please sign up or login with your details

Forgot password? Click here to reset