Mining Events with Declassified Diplomatic Documents

12/20/2017
by   Yuanjun Gao, et al.
0

Since 1973 the State Department has been using electronic records systems to preserve classified communications. Recently, approximately 1.9 million of these records from 1973-77 have been made available by the U.S. National Archives. While some of these communication streams have periods witnessing an acceleration in the rate of transmission; others do not show any notable patterns in communication intensity. Given the sheer volume of these communications -- far greater than what had been available until now -- scholars need automated statistical techniques to identify the communications that warrant closer study. We develop a statistical framework that can semi-automatically identify from a large corpus of documents a handful that historians would consider more interesting electronic records. Our approach brings together related but distinct statistical concepts from nonparametric signal estimation and statistical hypothesis testing -- which when put together help us identify and analyze various geometrical aspects of the communication streams. Dominant periods of heightened and sustained activities aka bursts, as identified through these methods, correspond well with historical events recognized by standard reference works on the 1970s.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2016

Using Artificial Intelligence to Identify State Secrets

Whether officials can be trusted to protect national security informatio...
research
04/13/2021

Restoring and Mining the Records of the Joseon Dynasty via Neural Language Modeling and Machine Translation

Understanding voluminous historical records provides clues on the past i...
research
02/21/2023

Electronic Laboratory Notebook on Web2py Framework

Proper experimental record-keeping is an important cornerstone in resear...
research
02/12/2018

The Complex Event Recognition Group

The Complex Event Recognition (CER) group is a research team, affiliated...
research
04/28/2022

The Paper Pile at Home: Adopting Personal Electronic Records

Research has found that if respondents do not manage their personal reco...
research
09/04/2020

Externalizing Transformations of Historical Documents: Opportunities for Provenance-Driven Visualization

Transcription, annotation, digitization and/or visualization are common ...
research
06/15/2019

Modeling Consonance and its Relationships with Temperament, Harmony, and Electronic Amplification

After briefly revising the concepts of consonance/dissonance, a respecti...

Please sign up or login with your details

Forgot password? Click here to reset