Narrative Incoherence Detection

12/21/2020
by   Deng Cai, et al.
13

Motivated by the increasing popularity of intelligent editing assistant, we introduce and investigate the task of narrative incoherence detection: Given a (corrupted) long-form narrative, decide whether there exists some semantic discrepancy in the narrative flow. Specifically, we focus on the missing sentence and incoherent sentence detection. Despite its simple setup, this task is challenging as the model needs to understand and analyze a multi-sentence narrative text, and make decisions at the sentence level. As an initial step towards this task, we implement several baselines either directly analyzing the raw text (token-level) or analyzing learned sentence representations (sentence-level). We observe that while token-level modeling enjoys greater expressive power and hence better performance, sentence-level modeling possesses an advantage in efficiency and flexibility. With pre-training on large-scale data and cycle-consistent sentence embedding, our extended sentence-level model can achieve comparable detection accuracy to the token-level model. As a by-product, such a strategy enables simultaneous incoherence detection and infilling/modification suggestions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2023

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding

Recent studies have shown that dual encoder models trained with the sent...
research
05/24/2022

Learning for Expressive Task-Related Sentence Representations

NLP models learn sentence representations for downstream tasks by tuning...
research
06/04/2019

Toward Grammatical Error Detection from Sentence Labels: Zero-shot Sequence Labeling with CNNs and Contextualized Embeddings

Zero-shot grammatical error detection is the task of tagging token-level...
research
12/09/2021

Semantic Search as Extractive Paraphrase Span Detection

In this paper, we approach the problem of semantic search by framing the...
research
10/17/2022

Multi-granularity Argument Mining in Legal Texts

In this paper, we explore legal argument mining using multiple levels of...
research
08/02/2022

Lost in Space Marking

We look at a decision taken early in training a subword tokenizer, namel...
research
02/02/2023

The Fewer Splits are Better: Deconstructing Readability in Sentence Splitting

In this work, we focus on sentence splitting, a subfield of text simplif...

Please sign up or login with your details

Forgot password? Click here to reset