D2S: Document-to-Slide Generation Via Query-Based Text Summarization

05/08/2021
by   Edward Sun, et al.
0

Presentations are critical for communication in all areas of our lives, yet the creation of slide decks is often tedious and time-consuming. There has been limited research aiming to automate the document-to-slides generation process and all face a critical challenge: no publicly available dataset for training and benchmarking. In this work, we first contribute a new dataset, SciDuet, consisting of pairs of papers and their corresponding slides decks from recent years' NLP and ML conferences (e.g., ACL). Secondly, we present D2S, a novel system that tackles the document-to-slides task with a two-step approach: 1) Use slide titles to retrieve relevant and engaging text, figures, and tables; 2) Summarize the retrieved context into bullet points with long-form question answering. Our evaluation suggests that long-form QA outperforms state-of-the-art summarization baselines on both automated ROUGE metrics and qualitative human evaluation.

READ FULL TEXT
research
12/06/2022

Document-Level Abstractive Summarization

The task of automatic text summarization produces a concise and fluent t...
research
10/30/2022

How Far are We from Robust Long Abstractive Summarization?

Abstractive summarization has made tremendous progress in recent years. ...
research
10/06/2022

Just ClozE! A Fast and Simple Method for Evaluating the Factual Consistency in Abstractive Summarization

The issue of factual consistency in abstractive summarization has attrac...
research
05/24/2023

Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering

The integration of multi-document pre-training objectives into language ...
research
05/03/2023

AttenWalker: Unsupervised Long-Document Question Answering via Attention-based Graph Walking

Annotating long-document question answering (long-document QA) pairs is ...
research
10/10/2021

Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization

Long text understanding is important yet challenging in natural language...
research
07/01/2022

Conditional Generation with a Question-Answering Blueprint

The ability to convey relevant and faithful information is critical for ...

Please sign up or login with your details

Forgot password? Click here to reset