How UMass-FSD Inadvertently Leverages Temporal Bias

08/02/2022
by   Dominik Wurzer, et al.
0

First Story Detection describes the task of identifying new events in a stream of documents. The UMass-FSD system is known for its strong performance in First Story Detection competitions. Recently, it has been frequently used as a high accuracy baseline in research publications. We are the first to discover that UMass-FSD inadvertently leverages temporal bias. Interestingly, the discovered bias contrasts previously known biases and performs significantly better. Our analysis reveals an increased contribution of temporally distant documents, resulting from an unusual way of handling incremental term statistics. We show that this form of temporal bias is also applicable to other well-known First Story Detection systems, where it improves the detection accuracy. To provide a more generalizable conclusion and demonstrate that the observed bias is not only an artefact of a particular implementation, we present a model that intentionally leverages a bias on temporal distance. Our model significantly improves the detection effectiveness of state-of-the-art First Story Detection systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2018

The Hidden Shape of Stories Reveals Positivity Bias and Gender Bias

To capture the shape of stories is crucial for understanding the mind of...
research
08/30/2018

Story Ending Generation with Incremental Encoding and Commonsense Knowledge

Story ending generation is a strong indication of story comprehension. T...
research
03/15/2018

A Simple and Effective Approach to the Story Cloze Test

In the Story Cloze Test, a system is presented with a 4-sentence prompt ...
research
05/26/2023

Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children's Fairy Tales

Social biases and stereotypes are embedded in our culture in part throug...
research
08/16/2018

Story Disambiguation: Tracking Evolving News Stories across News and Social Streams

Following a particular news story online is an important but difficult t...
research
06/23/2016

Sort Story: Sorting Jumbled Images and Captions into Stories

Temporal common sense has applications in AI tasks such as QA, multi-doc...
research
11/10/2022

Decomposing the Fundamentals of Creepy Stories

Fear is a universal concept; people crave it in urban legends, scary mov...

Please sign up or login with your details

Forgot password? Click here to reset