Deep Distribution-preserving Incomplete Clustering with Optimal Transport

03/21/2021
by   Mingjie Luo, et al.
10

Clustering is a fundamental task in the computer vision and machine learning community. Although various methods have been proposed, the performance of existing approaches drops dramatically when handling incomplete high-dimensional data (which is common in real world applications). To solve the problem, we propose a novel deep incomplete clustering method, named Deep Distribution-preserving Incomplete Clustering with Optimal Transport (DDIC-OT). To avoid insufficient sample utilization in existing methods limited by few fully-observed samples, we propose to measure distribution distance with the optimal transport for reconstruction evaluation instead of traditional pixel-wise loss function. Moreover, the clustering loss of the latent feature is introduced to regularize the embedding with more discrimination capability. As a consequence, the network becomes more robust against missing features and the unified framework which combines clustering and sample imputation enables the two procedures to negotiate to better serve for each other. Extensive experiments demonstrate that the proposed network achieves superior and stable clustering performance improvement against existing state-of-the-art incomplete clustering methods over different missing ratios.

READ FULL TEXT

page 3

page 4

page 5

page 8

page 9

page 11

page 12

page 13

research
10/20/2019

Differentiable Deep Clustering with Cluster Size Constraints

Clustering is a fundamental unsupervised learning approach. Many cluster...
research
08/28/2022

Leachable Component Clustering

Clustering attempts to partition data instances into several distinctive...
research
05/25/2020

Feature Robust Optimal Transport for High-dimensional Data

Optimal transport is a machine learning technique with applications incl...
research
02/10/2020

Missing Data Imputation using Optimal Transport

Missing data is a crucial issue when applying machine learning algorithm...
research
02/27/2019

Clustering through the optimal transport barycenter problem

The problem of clustering a data set is formulated in terms of the Wasse...
research
03/18/2023

Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection

Semantically coherent out-of-distribution (SCOOD) detection aims to disc...
research
08/12/2021

An Optimal Transport Approach to Causal Inference

We propose a method based on optimal transport theory for causal inferen...

Please sign up or login with your details

Forgot password? Click here to reset