Cross-referencing using Fine-grained Topic Modeling

05/18/2019
by   Jeffrey Lund, et al.
0

Cross-referencing, which links passages of text to other related passages, can be a valuable study aid for facilitating comprehension of a text. However, cross-referencing requires first, a comprehensive thematic knowledge of the entire corpus, and second, a focused search through the corpus specifically to find such useful connections. Due to this, cross-reference resources are prohibitively expensive and exist only for the most well-studied texts (e.g. religious texts). We develop a topic-based system for automatically producing candidate cross-references which can be easily verified by human annotators. Our system utilizes fine-grained topic modeling with thousands of highly nuanced and specific topics to identify verse pairs which are topically related. We demonstrate that our system can be cost effective compared to having annotators acquire the expertise necessary to produce cross-reference resources unaided.

READ FULL TEXT

page 3

page 5

page 6

research
10/16/2022

Coordinated Topic Modeling

We propose a new problem called coordinated topic modeling that imitates...
research
03/16/2020

HELFI: a Hebrew-Greek-Finnish Parallel Bible Corpus with Cross-Lingual Morpheme Alignment

Twenty-five years ago, morphologically aligned Hebrew-Finnish and Greek-...
research
08/30/2022

Image-Specific Information Suppression and Implicit Local Alignment for Text-based Person Search

Text-based person search is a challenging task that aims to search pedes...
research
08/30/2017

Learning Fine-Grained Knowledge about Contingent Relations between Everyday Events

Much of the user-generated content on social media is provided by ordina...
research
07/27/2022

CompText: Visualizing, Comparing Understanding Text Corpus

A common practice in Natural Language Processing (NLP) is to visualize t...
research
06/10/2019

Detecting Everyday Scenarios in Narrative Texts

Script knowledge consists of detailed information on everyday activities...
research
02/08/2022

Police Text Analysis: Topic Modeling and Spatial Relative Density Estimation

We analyze a large corpus of police incident narrative documents in unde...

Please sign up or login with your details

Forgot password? Click here to reset