History Playground: A Tool for Discovering Temporal Trends in Massive Textual Corpora

06/04/2018
by   Thomas Lansdall-Welfare, et al.
0

Recent studies have shown that macroscopic patterns of continuity and change over the course of centuries can be detected through the analysis of time series extracted from massive textual corpora. Similar data-driven approaches have already revolutionised the natural sciences, and are widely believed to hold similar potential for the humanities and social sciences, driven by the mass-digitisation projects that are currently under way, and coupled with the ever-increasing number of documents which are "born digital". As such, new interactive tools are required to discover and extract macroscopic patterns from these vast quantities of textual data. Here we present History Playground, an interactive web-based tool for discovering trends in massive textual corpora. The tool makes use of scalable algorithms to first extract trends from textual corpora, before making them available for real-time search and discovery, presenting users with an interface to explore the data. Included in the tool are algorithms for standardization, regression, change-point detection in the relative frequencies of ngrams, multi-term indices and comparison of trends across different corpora.

READ FULL TEXT
research
10/31/2017

Doris: A tool for interactive exploration of historic corpora (Extended Version)

Insights into social phenomenon can be gleaned from trends and patterns ...
research
08/19/2021

A Framework for Neural Topic Modeling of Text Corpora

Topic Modeling refers to the problem of discovering the main topics that...
research
12/31/2022

Logic Mill – A Knowledge Navigation System

Logic Mill is a scalable and openly accessible software system that iden...
research
03/23/2016

The Anatomy of a Search and Mining System for Digital Archives

Samtla (Search And Mining Tools with Linguistic Analysis) is a digital h...
research
03/13/2017

MetaPAD: Meta Pattern Discovery from Massive Text Corpora

Mining textual patterns in news, tweets, papers, and many other kinds of...
research
04/27/2020

Automatic Textual Evidence Mining in COVID-19 Literature

We created this EVIDENCEMINER system for automatic textual evidence mini...
research
12/20/2019

"The Squawk Bot": Joint Learning of Time Series and Text Data Modalities for Automated Financial Information Filtering

Multimodal analysis that uses numerical time series and textual corpora ...

Please sign up or login with your details

Forgot password? Click here to reset