Semantic Novelty Detection and Characterization in Factual Text Involving Named Entities

10/31/2022
by   Nianzu Ma, et al.
0

Much of the existing work on text novelty detection has been studied at the topic level, i.e., identifying whether the topic of a document or a sentence is novel or not. Little work has been done at the fine-grained semantic level (or contextual level). For example, given that we know Elon Musk is the CEO of a technology company, the sentence "Elon Musk acted in the sitcom The Big Bang Theory" is novel and surprising because normally a CEO would not be an actor. Existing topic-based novelty detection methods work poorly on this problem because they do not perform semantic reasoning involving relations between named entities in the text and their background knowledge. This paper proposes an effective model (called PAT-SND) to solve the problem, which can also characterize the novelty. An annotated dataset is also created. Evaluation shows that PAT-SND outperforms 10 baselines by large margins.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2021

NEREL: A Russian Dataset with Nested Named Entities, Relations and Events

In this paper, we present NEREL, a Russian dataset for named entity reco...
research
05/10/2021

Word-level Human Interpretable Scoring Mechanism for Novel Text Detection Using Tsetlin Machines

Recent research in novelty detection focuses mainly on document-level cl...
research
06/14/2019

DocRED: A Large-Scale Document-Level Relation Extraction Dataset

Multiple entities in a document generally exhibit complex inter-sentence...
research
04/18/2021

A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation

Large pretrained generative models like GPT-3 often suffer from hallucin...
research
10/07/2021

Contextual Sentence Classification: Detecting Sustainability Initiatives in Company Reports

We introduce the novel task of detecting sustainability initiatives in c...
research
04/16/2019

Sameness Attracts, Novelty Disturbs, but Outliers Flourish in Fanfiction Online

The nature of what people enjoy is not just a central question for the c...
research
04/30/2016

An Improved System for Sentence-level Novelty Detection in Textual Streams

Novelty detection in news events has long been a difficult problem. A nu...

Please sign up or login with your details

Forgot password? Click here to reset