Context-Based Quotation Recommendation

05/17/2020
by   Ansel MacLaughlin, et al.
0

While composing a new document, anything from a news article to an email or essay, authors often utilize direct quotes from a variety of sources. Although an author may know what point they would like to make, selecting an appropriate quote for the specific context may be time-consuming and difficult. We therefore propose a novel context-aware quote recommendation system which utilizes the content an author has already written to generate a ranked list of quotable paragraphs and spans of tokens from a given source document. We approach quote recommendation as a variant of open-domain question answering and adapt the state-of-the-art BERT-based methods from open-QA to our task. We conduct experiments on a collection of speech transcripts and associated news articles, evaluating models' paragraph ranking and span prediction performances. Our experiments confirm the strong performance of BERT-based methods on this task, which outperform bag-of-words and neural ranking baselines by more than 30 Qualitative analyses show the difficulty of the paragraph and span recommendation tasks and confirm the quotability of the best BERT model's predictions, even if they are not the true selected quotes from the original news articles.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2013

Personalized News Recommendation with Context Trees

The profusion of online news articles makes it difficult to find interes...
research
05/05/2023

NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking

To enhance the ability to find credible evidence in news articles, we pr...
research
01/31/2021

Extending Neural Keyword Extraction with TF-IDF tagset matching

Keyword extraction is the task of identifying words (or multi-word expre...
research
02/24/2022

BERTVision – A Parameter-Efficient Approach for Question Answering

We present a highly parameter efficient approach for Question Answering ...
research
04/10/2019

Harvey Mudd College at SemEval-2019 Task 4: The Clint Buchanan Hyperpartisan News Detector

We investigate the recently developed Bidirectional Encoder Representati...
research
04/17/2019

Headline Generation: Learning from Decomposed Document Titles

We propose a novel method for generating titles for unstructured text do...

Please sign up or login with your details

Forgot password? Click here to reset