Topic Segmentation Model Focusing on Local Context

01/05/2023
by   Jeonghwan Lee, et al.
0

Topic segmentation is important in understanding scientific documents since it can not only provide better readability but also facilitate downstream tasks such as information retrieval and question answering by creating appropriate sections or paragraphs. In the topic segmentation task, topic coherence is critical in predicting segmentation boundaries. Most of the existing models have tried to exploit as many contexts as possible to extract useful topic-related information. However, additional context does not always bring promising results, because the local context between sentences becomes incoherent despite more sentences being supplemented. To alleviate this issue, we propose siamese sentence embedding layers which process two input sentences independently to get appropriate amount of information without being hampered by excessive information. Also, we adopt multi-task learning techniques including Same Topic Prediction (STP), Topic Classification (TC) and Next Sentence Prediction (NSP). When these three classification layers are combined in a multi-task manner, they can make up for each other's limitations, improving performance in all three tasks. We experiment different combinations of the three layers and report how each layer affects other layers in the same combination as well as the overall segmentation performance. The model we proposed achieves the state-of-the-art result in the WikiSection dataset.

READ FULL TEXT
research
08/18/2019

TDAM: a Topic-Dependent Attention Model for Sentiment Analysis

We propose a topic-dependent attention model for sentiment classificatio...
research
07/20/2016

An Adaptation of Topic Modeling to Sentences

Advances in topic modeling have yielded effective methods for characteri...
research
05/21/2019

Answering while Summarizing: Multi-task Learning for Multi-hop QA with Evidence Extraction

Question answering (QA) using textual sources such as reading comprehens...
research
11/18/2019

Multi-task Sentence Encoding Model for Semantic Retrieval in Question Answering Systems

Question Answering (QA) systems are used to provide proper responses to ...
research
04/28/2021

Exploring Relational Context for Multi-Task Dense Prediction

The timeline of computer vision research is marked with advances in lear...
research
12/30/2022

TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)

Keyphrase identification and classification is a Natural Language Proces...
research
10/07/2020

Improving Context Modeling in Neural Topic Segmentation

Topic segmentation is critical in key NLP tasks and recent works favor h...

Please sign up or login with your details

Forgot password? Click here to reset