Unsupervised Scientific Abstract Segmentation with Normalized Mutual Information

05/19/2023
by   Yingqiang Gao, et al.
0

The abstracts of scientific papers consist of premises and conclusions. Structured abstracts explicitly highlight the conclusion sentences, whereas non-structured abstracts may have conclusion sentences at uncertain positions. This implicit nature of conclusion positions makes the automatic segmentation of scientific abstracts into premises and conclusions a challenging task. In this work, we empirically explore using Normalized Mutual Information (NMI) for abstract segmentation. We consider each abstract as a recurrent cycle of sentences and place segmentation boundaries by greedily optimizing the NMI score between premises and conclusions. On non-structured abstracts, our proposed unsupervised approach GreedyCAS achieves the best performance across all evaluation metrics; on structured abstracts, GreedyCAS outperforms all baseline methods measured by P_k. The strong correlation of NMI to our evaluation metrics reveals the effectiveness of NMI for abstract segmentation.

READ FULL TEXT

page 12

page 13

research
07/03/2023

Normalized mutual information is a biased measure for classification and community detection

Normalized mutual information is widely used as a similarity measure for...
research
04/09/2016

On the Composition of Scientific Abstracts

Scientific abstracts contain what is considered by the author(s) as info...
research
05/11/2020

Segmenting Scientific Abstracts into Discourse Categories: A Deep Learning-Based Approach for Sparse Labeled Data

The abstract of a scientific paper distills the contents of the paper in...
research
09/08/2021

A Bayesian Framework for Information-Theoretic Probing

Pimentel et al. (2020) recently analysed probing from an information-the...
research
11/17/2020

Mutual Information Based Method for Unsupervised Disentanglement of Video Representation

Video Prediction is an interesting and challenging task of predicting fu...
research
10/26/2021

Assessing the Sufficiency of Arguments through Conclusion Generation

The premises of an argument give evidence or other reasons to support a ...
research
10/25/2022

Topical Segmentation of Spoken Narratives: A Test Case on Holocaust Survivor Testimonies

The task of topical segmentation is well studied, but previous work has ...

Please sign up or login with your details

Forgot password? Click here to reset