From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

09/08/2023
by   Griffin Adams, et al.
0

Selecting the “right” amount of information to include in a summary is a difficult task. A good summary should be detailed and entity-centric without being overly dense and hard to follow. To better understand this tradeoff, we solicit increasingly dense GPT-4 summaries with what we refer to as a “Chain of Density” (CoD) prompt. Specifically, GPT-4 generates an initial entity-sparse summary before iteratively incorporating missing salient entities without increasing the length. Summaries generated by CoD are more abstractive, exhibit more fusion, and have less of a lead bias than GPT-4 summaries generated by a vanilla prompt. We conduct a human preference study on 100 CNN DailyMail articles and find that that humans prefer GPT-4 summaries that are more dense than those generated by a vanilla prompt and almost as dense as human written summaries. Qualitative analysis supports the notion that there exists a tradeoff between informativeness and readability. 500 annotated CoD summaries, as well as an extra 5,000 unannotated summaries, are freely available on HuggingFace (https://huggingface.co/datasets/griffin/chain_of_density).

READ FULL TEXT
research
04/05/2022

EntSUM: A Data Set for Entity-Centric Summarization

Controllable summarization aims to provide summaries that take into acco...
research
04/03/2019

Jointly Extracting and Compressing Documents with Summary State Representations

We present a new neural model for text summarization that first extracts...
research
10/13/2020

Sensitivity of BLANC to human-scored qualities of text summaries

We explore the sensitivity of a document summary quality estimator, BLAN...
research
03/19/2020

Boosting Factual Correctness of Abstractive Summarization

A commonly observed problem with abstractive summarization is the distor...
research
05/03/2023

Characterizing Political Bias in Automatic Summaries: A Case Study of Trump and Biden

Growing literature has shown that powerful NLP systems may encode social...
research
07/30/2019

Abstractive Document Summarization without Parallel Data

Abstractive summarization typically relies on large collections of paire...
research
02/05/2021

"I Don't Think So": Disagreement-Based Policy Summaries for Comparing Agents

With Artificial Intelligence on the rise, human interaction with autonom...

Please sign up or login with your details

Forgot password? Click here to reset