Evidence Transfer for Improving Clustering Tasks Using External Categorical Evidence

11/09/2018
by   Athanasios Davvetas, et al.
0

In this paper we introduce evidence transfer for clustering, a deep learning method that can incrementally manipulate the latent representations of an autoencoder, according to external categorical evidence, in order to improve a clustering outcome. It is deployed on a baseline solution to reduce the cross entropy between the external evidence and an extension of the latent space. By evidence transfer we define the process by which the categorical outcome of an external, auxiliary task is exploited to improve a primary task, in this case representation learning for clustering. Our proposed method makes no assumptions regarding the categorical evidence presented, nor the structure of the latent space. We compare our method, against the baseline solution by performing k-means clustering before and after its deployment. Experiments with three different kinds of evidence show that our method effectively manipulates the latent representations when introduced with real corresponding evidence, while remaining robust when presented with low quality evidence.

READ FULL TEXT

page 5

page 6

page 7

research
12/22/2019

Learning Improved Representations by Transferring Incomplete Evidence Across Heterogeneous Tasks

Acquiring ground truth labels for unlabelled data can be a costly proced...
research
12/18/2018

Clustering-Oriented Representation Learning with Attractive-Repulsive Loss

The standard loss function used to train neural network classifiers, cat...
research
06/20/2018

InfoCatVAE: Representation Learning with Categorical Variational Autoencoders

This paper describes InfoCatVAE, an extension of the variational autoenc...
research
07/19/2022

Expert-LaSTS: Expert-Knowledge Guided Latent Space for Traffic Scenarios

Clustering traffic scenarios and detecting novel scenario types are requ...
research
12/02/2020

On Extending NLP Techniques from the Categorical to the Latent Space: KL Divergence, Zipf's Law, and Similarity Search

Despite the recent successes of deep learning in natural language proces...
research
05/14/2020

Unsupervised Severe Weather Detection Via Joint Representation Learning Over Textual and Weather Data

When observing a phenomenon, severe cases or anomalies are often charact...
research
06/17/2021

A probabilistic database approach to autoencoder-based data cleaning

Data quality problems are a large threat in data science. In this paper,...

Please sign up or login with your details

Forgot password? Click here to reset