DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization

11/06/2018
by   Jiaxin Shi, et al.
0

We propose DeepChannel, a robust, data-efficient, and interpretable neural model for extractive document summarization. Given any document-summary pair, we estimate a salience score, which is modeled using an attention-based deep neural network, to represent the salience degree of the summary for yielding the document. We devise a contrastive training strategy to learn the salience estimation network, and then use the learned salience score as a guide and iteratively extract the most salient sentences from the document as our generated summary. In experiments, our model not only achieves state-of-the-art ROUGE scores on CNN/Daily Mail dataset, but also shows strong robustness in the out-of-domain test on DUC2007 test set. Moreover, our model reaches a ROUGE-1 F-1 score of 39.41 on CNN/Daily Mail test set with merely 1 / 100 training set, demonstrating a tremendous data efficiency.

READ FULL TEXT
research
07/06/2018

Neural Document Summarization by Jointly Learning to Score and Select Sentences

Sentence scoring and sentence selection are two main steps in extractive...
research
09/19/2019

Summary Level Training of Sentence Rewriting for Abstractive Summarization

As an attempt to combine extractive and abstractive summarization, Sente...
research
07/19/2021

MemSum: Extractive Summarization of Long Documents using Multi-step Episodic Markov Decision Processes

We introduce MemSum (Multi-step Episodic Markov decision process extract...
research
09/29/2022

COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization

Traditional training paradigms for extractive and abstractive summarizat...
research
03/18/2020

Selective Attention Encoders by Syntactic Graph Convolutional Networks for Document Summarization

Abstractive text summarization is a challenging task, and one need to de...
research
09/25/2018

BanditSum: Extractive Summarization as a Contextual Bandit

In this work, we propose a novel method for training neural networks to ...
research
09/10/2020

Classification of descriptions and summary using multiple passes of statistical and natural language toolkits

This document describes a possible approach that can be used to check th...

Please sign up or login with your details

Forgot password? Click here to reset