Utilização de Grafos e Matriz de Similaridade na Sumarização Automática de Documentos Baseada em Extração de Frases

02/05/2016
by   Elvys Linhares Pontes, et al.
0

The internet increased the amount of information available. However, the reading and understanding of this information are costly tasks. In this scenario, the Natural Language Processing (NLP) applications enable very important solutions, highlighting the Automatic Text Summarization (ATS), which produce a summary from one or more source texts. Automatically summarizing one or more texts, however, is a complex task because of the difficulties inherent to the analysis and generation of this summary. This master's thesis describes the main techniques and methodologies (NLP and heuristics) to generate summaries. We have also addressed and proposed some heuristics based on graphs and similarity matrix to measure the relevance of judgments and to generate summaries by extracting sentences. We used the multiple languages (English, French and Spanish), CSTNews (Brazilian Portuguese), RPM (French) and DECODA (French) corpus to evaluate the developped systems. The results obtained were quite interesting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2017

A Semantic Relevance Based Neural Network for Text Summarization and Text Simplification

Text summarization and text simplification are two major ways to simplif...
research
10/24/2018

A Multilingual Study of Compressive Cross-Language Text Summarization

Cross-Language Text Summarization (CLTS) generates summaries in a langua...
research
01/10/2021

Summaformers @ LaySumm 20, LongSumm 20

Automatic text summarization has been widely studied as an important tas...
research
03/19/2017

Métodos de Otimização Combinatória Aplicados ao Problema de Compressão MultiFrases

The Internet has led to a dramatic increase in the amount of available i...
research
03/01/2023

Uzbek text summarization based on TF-IDF

The volume of information is increasing at an incredible rate with the r...
research
04/28/2020

Human-Like Summaries from Heterogeneous and Time-Windowed Software Development Artefacts

Automatic text summarisation has drawn considerable interest in the area...
research
04/12/2021

Paragraph-level Simplification of Medical Texts

We consider the problem of learning to simplify medical texts. This is i...

Please sign up or login with your details

Forgot password? Click here to reset