The Influence of Data Pre-processing and Post-processing on Long Document Summarization

12/03/2021
by   Xinwei Du, et al.
0

Long document summarization is an important and hard task in the field of natural language processing. A good performance of the long document summarization reveals the model has a decent understanding of the human language. Currently, most researches focus on how to modify the attention mechanism of the transformer to achieve a higher ROUGE score. The study of data pre-processing and post-processing are relatively few. In this paper, we use two pre-processing methods and a post-processing method and analyze the effect of these methods on various long document summarization models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2021

Impact of Data Processing on Fairness in Supervised Learning

We study the impact of pre and post processing for reducing discriminati...
research
11/04/2018

Char2char Generation with Reranking for the E2E NLG Challenge

This paper describes our submission to the E2E NLG Challenge. Recently, ...
research
05/04/2015

Learning Document Image Binarization from Data

In this paper we present a fully trainable binarization solution for deg...
research
10/10/2021

Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization

Long text understanding is important yet challenging in natural language...
research
09/02/2023

A Post-Processing Based Bengali Document Layout Analysis with YOLOV8

This paper focuses on enhancing Bengali Document Layout Analysis (DLA) u...
research
06/22/2023

Adversarial guesswork with quantum side information

The guesswork of a classical-quantum channel quantifies the cost incurre...
research
04/03/2023

A Comparison of Document Similarity Algorithms

Document similarity is an important part of Natural Language Processing ...

Please sign up or login with your details

Forgot password? Click here to reset