Attention-based Neural Text Segmentation

08/29/2018
by   Pinkesh Badjatiya, et al.
0

Text segmentation plays an important role in various Natural Language Processing (NLP) tasks like summarization, context understanding, document indexing and document noise removal. Previous methods for this task require manual feature engineering, huge memory requirements and large execution times. To the best of our knowledge, this paper is the first one to present a novel supervised neural approach for text segmentation. Specifically, we propose an attention-based bidirectional LSTM model where sentence embeddings are learned using CNNs and the segments are predicted based on contextual information. This model can automatically handle variable sized context information. Compared to the existing competitive baselines, the proposed model shows a performance improvement of 7

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2021

Bidirectional LSTM-CRF Attention-based Model for Chinese Word Segmentation

Chinese word segmentation (CWS) is the basic of Chinese natural language...
research
10/06/2020

An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks

Typically, tokenization is the very first step in most text processing w...
research
03/25/2018

Text Segmentation as a Supervised Learning Task

Text segmentation, the task of dividing a document into contiguous segme...
research
08/16/2019

Learning Conceptual-Contexual Embeddings for Medical Text

External knowledge is often useful for natural language understanding ta...
research
05/11/2020

CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding

Classification of crisis events, such as natural disasters, terrorist at...
research
05/11/2020

CrisisBERT: Robust Transformer for Crisis Classification and Contextual Crisis Embedding

Classification of crisis events, such as natural disasters, terrorist at...
research
05/13/2018

An attention-based Bi-GRU-CapsNet model for hypernymy detection between compound entities

Named entities which composed of multiple continuous words frequently oc...

Please sign up or login with your details

Forgot password? Click here to reset