Controlled Text Reduction

10/24/2022
by   Aviv Slobodkin, et al.
0

Producing a reduced version of a source text, as in generic or focused summarization, inherently involves two distinct subtasks: deciding on targeted content and generating a coherent text conveying it. While some popular approaches address summarization as a single end-to-end task, prominent works support decomposed modeling for individual subtasks. Further, semi-automated text reduction is also very appealing, where users may identify targeted content while models would generate a corresponding coherent summary. In this paper, we focus on the second subtask, of generating coherent text given pre-selected content. Concretely, we formalize Controlled Text Reduction as a standalone task, whose input is a source text with marked spans of targeted content ("highlighting"). A model then needs to generate a coherent text that includes all and only the target information. We advocate the potential of such models, both for modular fully-automatic summarization, as well as for semi-automated human-in-the-loop use cases. Facilitating proper research, we crowdsource high-quality dev and test datasets for the task. Further, we automatically generate a larger "silver" training dataset from available summarization benchmarks, leveraging a pretrained summary-source alignment model. Finally, employing these datasets, we present a supervised baseline model, showing promising results and insightful analyses.

READ FULL TEXT
research
08/16/2023

SummHelper: Collaborative Human-Computer Summarization

Current approaches for text summarization are predominantly automatic, w...
research
02/26/2018

Tone Biased MMR Text Summarization

Text summarization is an interesting area for researchers to develop new...
research
03/27/2018

Deep Communicating Agents for Abstractive Summarization

We present deep communicating agents in an encoder-decoder architecture ...
research
10/08/2020

A Cascade Approach to Neural Abstractive Summarization with Content Selection and Fusion

We present an empirical study in favor of a cascade architecture to neur...
research
02/04/2022

SummaryLens – A Smartphone App for Exploring Interactive Use of Automated Text Summarization in Everyday Life

We present SummaryLens, a concept and prototype for a mobile tool that l...
research
03/15/2022

Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

The IMPRESSIONS section of a radiology report about an imaging study is ...
research
02/27/2021

PRISM: A Unified Framework of Parameterized Submodular Information Measures for Targeted Data Subset Selection and Summarization

With increasing data, techniques for finding smaller, yet effective subs...

Please sign up or login with your details

Forgot password? Click here to reset