Benchmarking Large Language Models for News Summarization

01/31/2023
by   Tianyi Zhang, et al.
0

Large language models (LLMs) have shown promise for automatic summarization but the reasons behind their successes are poorly understood. By conducting a human evaluation on ten LLMs across different pretraining methods, prompts, and model scales, we make two important observations. First, we find instruction tuning, and not model size, is the key to the LLM's zero-shot summarization capability. Second, existing studies have been limited by low-quality references, leading to underestimates of human performance and lower few-shot and finetuning performance. To better evaluate LLMs, we perform human evaluation over high-quality summaries we collect from freelance writers. Despite major stylistic differences such as the amount of paraphrasing, we find that LMM summaries are judged to be on par with human written summaries.

READ FULL TEXT

page 1

page 5

page 8

research
09/18/2023

Summarization is (Almost) Dead

How well can large language models (LLMs) generate summaries? We develop...
research
05/22/2023

Are Large Language Models Good Evaluators for Abstractive Summarization?

Human evaluations are often required for abstractive summary evaluations...
research
04/17/2021

Transductive Learning for Abstractive News Summarization

Pre-trained language models have recently advanced abstractive summariza...
research
11/29/2022

Zero-Shot Opinion Summarization with GPT-3

Very large language models such as GPT-3 have shown impressive performan...
research
09/22/2021

Recursively Summarizing Books with Human Feedback

A major challenge for scaling machine learning is training models to per...
research
05/10/2023

Summarizing, Simplifying, and Synthesizing Medical Evidence Using GPT-3 (with Varying Success)

Large language models, particularly GPT-3, are able to produce high qual...
research
07/28/2023

Summaries, Highlights, and Action items: Design, implementation and evaluation of an LLM-powered meeting recap system

Meetings play a critical infrastructural role in the coordination of wor...

Please sign up or login with your details

Forgot password? Click here to reset