Semi-crowdsourced Clustering with Deep Generative Models

10/29/2018
by   Yucen Luo, et al.
0

We consider the semi-supervised clustering problem where crowdsourcing provides noisy information about the pairwise comparisons on a small subset of data, i.e., whether a sample pair is in the same cluster. We propose a new approach that includes a deep generative model (DGM) to characterize low-level features of the data, and a statistical relational model for noisy pairwise annotations on its subset. The two parts share the latent variables. To make the model automatically trade-off between its complexity and fitting data, we also develop its fully Bayesian variant. The challenge of inference is addressed by fast (natural-gradient) stochastic variational inference algorithms, where we effectively combine variational message passing for the relational part and amortized learning of the DGM under a unified framework. Empirical results on synthetic and real-world datasets show that our model outperforms previous crowdsourced clustering methods.

READ FULL TEXT
research
09/16/2018

A Deep Generative Model for Semi-Supervised Classification with Noisy Labels

Class labels are often imperfectly observed, due to mistakes and to genu...
research
05/28/2015

Automatic Relevance Determination For Deep Generative Models

A recurring problem when building probabilistic latent variable models i...
research
04/05/2021

Semi-Supervised Clustering with Inaccurate Pairwise Annotations

Pairwise relational information is a useful way of providing partial sup...
research
12/12/2020

Learning Consistent Deep Generative Models from Sparse Data via Prediction Constraints

We develop a new framework for learning variational autoencoders and oth...
research
04/03/2017

Semi-Supervised Generation with Cluster-aware Generative Models

Deep generative models trained with large amounts of unlabelled data hav...
research
12/11/2014

A Topic Modeling Approach to Ranking

We propose a topic modeling approach to the prediction of preferences in...
research
06/30/2020

Semi-supervised Sequential Generative Models

We introduce a novel objective for training deep generative time-series ...

Please sign up or login with your details

Forgot password? Click here to reset