Deep Bayes Factor Scoring for Authorship Verification

08/23/2020
by   Benedikt Boenninghoff, et al.
0

The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts. Fanfiction is a fan-written extension of a storyline in which a so-called fandom topic describes the principal subject of the document. The data provided in the PAN 2020 AV task is quite challenging because authors of texts across multiple/different fandom topics are included. In this work, we present a hierarchical fusion of two well-known approaches into a single end-to-end learning procedure: A deep metric learning framework at the bottom aims to learn a pseudo-metric that maps a document of variable length onto a fixed-sized feature vector. At the top, we incorporate a probabilistic layer to perform Bayes factor scoring in the learned metric space. We also provide text preprocessing strategies to deal with the cross-topic issue.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2021

O2D2: Out-Of-Distribution Detector to Capture Undecidable Trials in Authorship Verification

The PAN 2021 authorship verification (AV) challenge is part of a three-y...
research
06/21/2021

Self-Calibrating Neural-Probabilistic Model for Authorship Verification Under Covariate Shift

We are addressing two fundamental problems in authorship verification (A...
research
06/08/2023

A modified model for topic detection from a corpus and a new metric evaluating the understandability of topics

This paper presents a modified neural model for topic detection from a c...
research
09/22/2021

Automated Feature-Topic Pairing: Aligning Semantic and Embedding Spaces in Spatial Representation Learning

Automated characterization of spatial data is a kind of critical geograp...
research
10/21/2019

Self-Attentive Document Interaction Networks for Permutation Equivariant Ranking

How to leverage cross-document interactions to improve ranking performan...
research
04/15/2016

DARI: Distance metric And Representation Integration for Person Verification

The past decade has witnessed the rapid development of feature represent...
research
05/26/2023

Prompt- and Trait Relation-aware Cross-prompt Essay Trait Scoring

Automated essay scoring (AES) aims to score essays written for a given p...

Please sign up or login with your details

Forgot password? Click here to reset