Meta-learning representations for clustering with infinite Gaussian mixture models

03/01/2021
by   Tomoharu Iwata, et al.
0

For better clustering performance, appropriate representations are critical. Although many neural network-based metric learning methods have been proposed, they do not directly train neural networks to improve clustering performance. We propose a meta-learning method that train neural networks for obtaining representations such that clustering performance improves when the representations are clustered by the variational Bayesian (VB) inference with an infinite Gaussian mixture model. The proposed method can cluster unseen unlabeled data using knowledge meta-learned with labeled data that are different from the unlabeled data. For the objective function, we propose a continuous approximation of the adjusted Rand index (ARI), by which we can evaluate the clustering performance from soft clustering assignments. Since the approximated ARI and the VB inference procedure are differentiable, we can backpropagate the objective function through the VB inference procedure to train the neural networks. With experiments using text and image data sets, we demonstrate that our proposed method has a higher adjusted Rand index than existing methods do.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2022

Meta-learning for Out-of-Distribution Detection via Density Estimation in Latent Space

Many neural network-based out-of-distribution (OoD) detection methods ha...
research
09/11/2020

An unsupervised deep learning framework via integrated optimization of representation learning and GMM-based modeling

While supervised deep learning has achieved great success in a range of ...
research
12/05/2022

Clustering with Neural Network and Index

A new model called Clustering with Neural Network and Index (CNNI) is in...
research
02/22/2016

Semi-supervised Clustering for Short Text via Deep Representation Learning

In this work, we propose a semi-supervised method for short text cluster...
research
10/09/2020

Few-shot Learning for Spatial Regression

We propose a few-shot learning method for spatial regression. Although G...
research
08/25/2021

Clustering acoustic emission data streams with sequentially appearing clusters using mixture models

The interpretation of unlabeled acoustic emission (AE) data classically ...
research
05/19/2018

Estimation of Non-Normalized Mixture Models and Clustering Using Deep Representation

We develop a general method for estimating a finite mixture of non-norma...

Please sign up or login with your details

Forgot password? Click here to reset