Variational Autoencoder with Embedded Student-t Mixture Model for Authorship Attribution

05/28/2020
by   Benedikt Boenninghoff, et al.
0

Traditional computational authorship attribution describes a classification task in a closed-set scenario. Given a finite set of candidate authors and corresponding labeled texts, the objective is to determine which of the authors has written another set of anonymous or disputed texts. In this work, we propose a probabilistic autoencoding framework to deal with this supervised classification task. More precisely, we are extending a variational autoencoder (VAE) with embedded Gaussian mixture model to a Student-t mixture model. Autoencoders have had tremendous success in learning latent representations. However, existing VAEs are currently still bound by limitations imposed by the assumed Gaussianity of the underlying probability distributions in the latent space. In this work, we are extending the Gaussian model for the VAE to a Student-t model, which allows for an independent control of the "heaviness" of the respective tails of the implied probability densities. Experiments over an Amazon review dataset indicate superior performance of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2020

Open-Set Recognition with Gaussian Mixture Variational Autoencoders

In inference, open-set classification is to either classify a sample int...
research
03/17/2019

Topic-Guided Variational Autoencoders for Text Generation

We propose a topic-guided variational autoencoder (TGVAE) model for text...
research
02/11/2019

Variational Autoencoder with Truncated Mixture of Gaussians for Functional Connectivity Analysis

Resting-state functional connectivity states are often identified as clu...
research
10/31/2018

Dirichlet Variational Autoencoder for Text Modeling

We introduce an improved variational autoencoder (VAE) for text modeling...
research
07/09/2021

Lifelong Mixture of Variational Autoencoders

In this paper, we propose an end-to-end lifelong learning mixture of exp...
research
10/20/2019

Neuro-SERKET: Development of Integrative Cognitive System through the Composition of Deep Probabilistic Generative Models

This paper describes a framework for the development of an integrative c...
research
04/26/2020

Similarity Learning-Based Device Attribution

Methods and systems for attributing browsing activity from two or more d...

Please sign up or login with your details

Forgot password? Click here to reset