Less is More for Long Document Summary Evaluation by LLMs

09/14/2023
by   Yunshu Wu, et al.
0

Large Language Models (LLMs) have shown promising performance in summary evaluation tasks, yet they face challenges such as high computational costs and the Lost-in-the-Middle problem where important information in the middle of long documents is often overlooked. To address these issues, this paper introduces a novel approach, Extract-then-Evaluate, which involves extracting key sentences from a long source document and then evaluating the summary by prompting LLMs. The results reveal that the proposed method not only significantly reduces evaluation costs but also exhibits a higher correlation with human evaluations. Furthermore, we provide practical recommendations for optimal document length and sentence extraction methods, contributing to the development of cost-effective yet more accurate methods for LLM-based text generation evaluation.

READ FULL TEXT

page 1

page 8

page 9

research
03/19/2021

Play the Shannon Game With Language Models: A Human-Free Approach to Summary Evaluation

The goal of a summary is to concisely state the most important informati...
research
08/28/2022

Podcast Summary Assessment: A Resource for Evaluating Summary Assessment Methods

Automatic summary assessment is useful for both machine-generated and hu...
research
03/01/2021

Long Document Summarization in a Low Resource Setting using Pretrained Language Models

Abstractive summarization is the task of compressing a long document int...
research
05/14/2021

Extracting Variable-Depth Logical Document Hierarchy from Long Documents: Method, Evaluation, and Application

In this paper, we study the problem of extracting variable-depth "logica...
research
06/26/2023

Vietnamese multi-document summary using subgraph selection approach – VLSP 2022 AbMuSu Shared Task

Document summarization is a task to generate afluent, condensed summary ...
research
04/11/2019

Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation

Conducting a manual evaluation is considered an essential part of summar...
research
04/06/2020

At Which Level Should We Extract? An Empirical Study on Extractive Document Summarization

Extractive methods have proven to be very effective in automatic documen...

Please sign up or login with your details

Forgot password? Click here to reset