Visualizing textual models with in-text and word-as-pixel highlighting

06/20/2016
by   Abram Handler, et al.
0

We explore two techniques which use color to make sense of statistical text models. One method uses in-text annotations to illustrate a model's view of particular tokens in particular documents. Another uses a high-level, "words-as-pixels" graphic to display an entire corpus. Together, these methods offer both zoomed-in and zoomed-out perspectives into a model's understanding of text. We show how these interconnected methods help diagnose a classifier's poor performance on Twitter slang, and make sense of a topic model on historical political texts.

READ FULL TEXT
research
05/21/2021

Word-level Text Highlighting of Medical Texts forTelehealth Services

The medical domain is often subject to information overload. The digitiz...
research
10/22/2017

How big is big enough? Unsupervised word sense disambiguation using a very large corpus

In this paper, the problem of disambiguating a target word for Polish is...
research
04/29/2020

Don't Neglect the Obvious: On the Role of Unambiguous Words in Word Sense Disambiguation

State-of-the-art methods for Word Sense Disambiguation (WSD) combine two...
research
02/24/2020

Emosaic: Visualizing Affective Content of Text at Varying Granularity

This paper presents Emosaic, a tool for visualizing the emotional tone o...
research
04/10/2022

ClioQuery: Interactive Query-Oriented Text Analytics for Comprehensive Investigation of Historical News Archives

Historians and archivists often find and analyze the occurrences of quer...
research
11/07/2020

Naturalization of Text by the Insertion of Pauses and Filler Words

In this article, we introduce a set of methods to naturalize text based ...
research
01/19/2017

Profiling of OCR'ed Historical Texts Revisited

In the absence of ground truth it is not possible to automatically deter...

Please sign up or login with your details

Forgot password? Click here to reset