Structuring an unordered text document

01/29/2019
by   Shashank Yadav, et al.
0

Segmenting an unordered text document into different sections is a very useful task in many text processing applications like multiple document summarization, question answering, etc. This paper proposes structuring of an unordered text document based on the keywords in the document. We test our approach on Wikipedia documents using both statistical and predictive methods such as the TextRank algorithm and Google's USE (Universal Sentence Encoder). From our experimental results, we show that the proposed model can effectively structure an unordered document into sections.

READ FULL TEXT

page 1

page 2

page 3

research
01/20/2023

Document Summarization with Text Segmentation

In this paper, we exploit the innate document segment structure for impr...
research
01/08/2022

Coherence-Based Distributed Document Representation Learning for Scientific Documents

Distributed document representation is one of the basic problems in natu...
research
11/27/2018

Isabelle/jEdit as IDE for Domain-specific Formal Languages and Informal Text Documents

Isabelle/jEdit is the main application of the Prover IDE (PIDE) framewor...
research
09/16/2019

Document classification methods

Information on different fields which are collected by users requires ap...
research
03/03/2013

Genetic Programming for Document Segmentation and Region Classification Using Discipulus

Document segmentation is a method of rending the document into distinct ...
research
11/30/2018

Document Structure Measure for Hypernym discovery

Hypernym discovery is the problem of finding terms that have is-a relati...
research
10/31/2017

Replace or Retrieve Keywords In Documents at Scale

In this paper we introduce, the FlashText algorithm for replacing keywor...

Please sign up or login with your details

Forgot password? Click here to reset