The method of automatic summarization from different sources

05/04/2019
by   Nataliya Shakhovska, et al.
0

In this article is analyzed technology of automatic text abstracting and annotation. The role of annotation in automatic search and classification for different scientific articles is described. The algorithm of summarization of natural language documents using the concept of importance coefficients is developed. Such concept allows considering the peculiarity of subject areas and topics that could be found in different kinds of documents. Method for generating abstracts of single document based on frequency analysis is developed. The recognition elements for unstructured text analysis are given. The method of pre-processing analysis of several documents is developed. This technique simultaneously considers both statistical approaches to abstracting and the importance of terms in a particular subject domain. The quality of generated abstract is evaluated. For the developed system there was conducted experts evaluation. It was held only for texts in Ukrainian. The developed system concluding essay has higher aggregate score on all criteria. The summarization system architecture is building. To build an information system model there is used CASE-tool AllFusion ERwin Data Modeler. The database scheme for information saving was built. The system is designed to work primarily with Ukrainian texts, which gives a significant advantage, since most modern systems still oriented to English texts

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

05/31/2021

Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Faceted summarization provides briefings of a document from different pe...
07/15/2020

Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation

Summarizing texts is not a straightforward task. Before even considering...
08/24/2019

Automatic Text Summarization of Legal Cases: A Hybrid Approach

Manual Summarization of large bodies of text involves a lot of human eff...
04/01/2019

Automatic text summarization: What has been done and what has to be done

Summaries are important when it comes to process huge amounts of informa...
02/10/2018

To the problem of "The Instrumental complex for ontological engineering purpose" software system design

The given work describes methodological principles of design instrumenta...
12/14/2018

Measuring Similarity: Computationally Reproducing the Scholar's Interests

Computerized document classification already orders the news articles th...
02/11/2016

Variations of the Similarity Function of TextRank for Automated Summarization

This article presents new alternatives to the similarity function for th...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.