Domain-topic models with chained dimensions: modeling the evolution of a major oncology conference (1995-2017)

12/31/2019
by   Alexandre Hannud Abdo, et al.
0

In this paper we introduce a novel approach for the computational analysis of research activities and their dynamics. Named SASHIMI (Symmetrical And Sequential analysis from Hierarchical Inference of Multidimensional Information), our approach provides a multi-level description of the structure of scientific activities that offers numerous advantages over traditional methods such as topic models or network analyses. Our method generates a dual description of corpora in terms of research domains (collections of documents) and topics (collections of words). It also extends this description to clusters of associated dimensions, such as time. SASHIMI only requires access to the textual content of individual documents, rather than specific metadata such as citations, authors, or keywords as is the case with other science-mapping approaches. We illustrate the analytical power of our method by applying it to the empirical analysis of an original dataset, namely the 1995-2017 collection of abstracts presented at ASCO, the largest annual oncology research conference. We show that SASHIMI is able to detect the presence of significant temporal patterns and to identify the major thematic transformations of oncology that underlie these patterns.

READ FULL TEXT

page 11

page 14

page 16

page 21

page 23

page 25

research
12/31/2019

Domain-topic models with chained dimensions: charting the evolution of a major oncology conference (1995-2017)

This paper presents three main contributions to the computational study ...
research
05/06/2018

Dynamic and Static Topic Model for Analyzing Time-Series Document Collections

For extracting meaningful topics from texts, their structures should be ...
research
11/25/2019

My Approach = Your Apparatus? Entropy-Based Topic Modeling on Multiple Domain-Specific Text Collections

Comparative text mining extends from genre analysis and political bias d...
research
02/03/2023

ANTM: An Aligned Neural Topic Model for Exploring Evolving Topics

As the amount of text data generated by humans and machines increases, t...
research
07/04/2016

Temporal Topic Analysis with Endogenous and Exogenous Processes

We consider the problem of modeling temporal textual data taking endogen...
research
10/25/2016

Scalable Dynamic Topic Modeling with Clustered Latent Dirichlet Allocation (CLDA)

Topic modeling, a method for extracting the underlying themes from a col...
research
12/22/2021

Dynamics of senses of new physics discourse: co-keywords analysis

The paper presents a longitudinal analysis of the evolution of new physi...

Please sign up or login with your details

Forgot password? Click here to reset