Abstractive Summarization of Reddit Posts with Multi-level Memory Networks

11/02/2018
by   Byeongchang Kim, et al.
0

We address the problem of abstractive summarization in two directions: proposing a novel dataset and a new model. First, we collect Reddit TIFU dataset, consisting of 120K posts from the online discussion forum Reddit. We use such informal crowd-generated posts as text source, because we empirically observe that existing datasets mostly use formal documents as source text such as news articles; thus, they could suffer from some biases that key sentences usually located at the beginning of the text and favorable summary candidates are already inside the text in nearly exact forms. Such biases can not only be structural clues of which extractive methods better take advantage, but also be obstacles that hinder abstractive methods from learning their text abstraction capability. Second, we propose a novel abstractive summarization model named multi-level memory networks (MMN), equipped with multi-level memory to store the information of text from different levels of abstraction. With quantitative evaluation and user studies via Amazon Mechanical Turk, we show the Reddit TIFU dataset is highly abstractive and the MMN outperforms the state-of-the-art summarization models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2021

Topic Modeling Based Extractive Text Summarization

Text summarization is an approach for identifying important information ...
research
04/24/2018

Data-driven Summarization of Scientific Articles

Data-driven approaches to sequence-to-sequence modelling have been succe...
research
07/15/2020

Dialect Diversity in Text Summarization on Twitter

Extractive summarization algorithms can be used on Twitter data to retur...
research
05/03/2018

A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification

Text summarization and sentiment classification both aim to capture the ...
research
12/06/2022

KATSum: Knowledge-aware Abstractive Text Summarization

Text Summarization is recognised as one of the NLP downstream tasks and ...
research
05/08/2018

A Memory Network Approach for Story-based Temporal Summarization of 360° Videos

We address the problem of story-based temporal summarization of long 360...
research
01/15/2016

Detecting and Extracting Events from Text Documents

Events of various kinds are mentioned and discussed in text documents, w...

Please sign up or login with your details

Forgot password? Click here to reset