STRASS: A Light and Effective Method for Extractive Summarization Based on Sentence Embeddings

07/16/2019
by   Léo Bouscarrat, et al.
0

This paper introduces STRASS: Summarization by TRAnsformation Selection and Scoring. It is an extractive text summarization method which leverages the semantic information in existing sentence embedding spaces. Our method creates an extractive summary by selecting the sentences with the closest embeddings to the document embedding. The model learns a transformation of the document embedding to minimize the similarity between the extractive summary and the ground truth summary. As the transformation is only composed of a dense layer, the training can be done on CPU, therefore, inexpensive. Moreover, inference time is short and linear according to the number of sentences. As a second contribution, we introduce the French CASS dataset, composed of judgments from the French Court of cassation and their corresponding summaries. On this dataset, our results show that our method performs similarly to the state of the art extractive methods with effective training and inferring time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2017

Revisiting the Centroid-based Method: A Strong Baseline for Multi-Document Summarization

The centroid-based model for extractive document summarization is a simp...
research
10/01/2019

Analyzing Sentence Fusion in Abstractive Summarization

While recent work in abstractive summarization has resulted in higher sc...
research
04/13/2020

AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Redundancy-aware extractive summarization systems score the redundancy o...
research
05/31/2019

Scoring Sentence Singletons and Pairs for Abstractive Summarization

When writing a summary, humans tend to choose content from one or two se...
research
05/08/2012

Document summarization using positive pointwise mutual information

The degree of success in document summarization processes depends on the...
research
09/26/2022

Text Summarization with Oracle Expectation

Extractive summarization produces summaries by identifying and concatena...
research
10/24/2019

Multi-Document Summarization with Determinantal Point Processes and Contextualized Representations

Emerged as one of the best performing techniques for extractive summariz...

Please sign up or login with your details

Forgot password? Click here to reset