Contextualization of topics: Browsing through the universe of bibliographic information

02/27/2017
by   Rob Koopman, et al.
0

This paper describes how semantic indexing can help to generate a contextual overview of topics and visually compare clusters of articles. The method was originally developed for an innovative information exploration tool, called Ariadne, which operates on bibliographic databases with tens of millions of records. In this paper, the method behind Ariadne is further developed and applied to the research question of the special issue "Same data, different results" - the better understanding of topic (re-)construction by different bibliometric approaches. For the case of the Astro dataset of 111,616 articles in astronomy and astrophysics, a new instantiation of the interactive exploring tool, LittleAriadne, has been created. This paper contributes to the overall challenge to delineate and define topics in two different ways. First, we produce two clustering solutions based on vector representations of articles in a lexical space. These vectors are built on semantic indexing of entities associated with those articles. Second, we discuss how LittleAriadne can be used to browse through the network of topical terms, authors, journals, citations and various cluster solutions of the Astro dataset. More specifically, we treat the assignment of an article to the different clustering solutions as an additional element of its bibliographic record. Keeping the principle of semantic indexing on the level of such an extended list of entities of the bibliographic record, LittleAriadne in turn provides a visualization of the context of a specific clustering solution. It also conveys the similarity of article clusters produced by different algorithms, hence representing a complementary approach to other possible means of comparison.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2017

Clustering articles based on semantic similarity

Document clustering is generally the first step for topic identification...
research
02/27/2017

Mutual Information based labelling and comparing clusters

After a clustering solution is generated automatically, labelling these ...
research
05/03/2020

Extracting Entities and Topics from News and Connecting Criminal Records

The goal of this paper is to summarize methodologies used in extracting ...
research
12/22/2016

ScienceWISE: Topic Modeling over Scientific Literature Networks

We provide an up-to-date view on the knowledge management system Science...
research
10/31/2018

On the Persistence of Clustering Solutions and True Number of Clusters in a Dataset

Typically clustering algorithms provide clustering solutions with prespe...
research
04/11/2022

Rank One Approximation as a Strategy for Wordle

This paper presents a mathematical method of playing the puzzle game Wor...
research
10/03/2012

Logical segmentation for article extraction in digitized old newspapers

Newspapers are documents made of news item and informative articles. The...

Please sign up or login with your details

Forgot password? Click here to reset