Revisiting data augmentation for subspace clustering

07/20/2022
by   Maryam Abdolali, et al.
0

Subspace clustering is the classical problem of clustering a collection of data samples that approximately lie around several low-dimensional subspaces. The current state-of-the-art approaches for this problem are based on the self-expressive model which represents the samples as linear combination of other samples. However, these approaches require sufficiently well-spread samples for accurate representation which might not be necessarily accessible in many applications. In this paper, we shed light on this commonly neglected issue and argue that data distribution within each subspace plays a critical role in the success of self-expressive models. Our proposed solution to tackle this issue is motivated by the central role of data augmentation in the generalization power of deep neural networks. We propose two subspace clustering frameworks for both unsupervised and semi-supervised settings that use augmented samples as an enlarged dictionary to improve the quality of the self-expressive representation. We present an automatic augmentation strategy using a few labeled samples for the semi-supervised problem relying on the fact that the data samples lie in the union of multiple linear subspaces. Experimental results confirm the effectiveness of data augmentation, as it significantly improves the performance of general self-expressive models.

READ FULL TEXT

page 19

page 38

research
10/08/2020

A Critique of Self-Expressive Deep Subspace Clustering

Subspace clustering is an unsupervised clustering technique designed to ...
research
05/01/2019

Self-Supervised Convolutional Subspace Clustering Network

Subspace clustering methods based on data self-expression have become ve...
research
11/24/2021

PMSSC: Parallelizable Multi-Subset based Self-Expressive Model for Subspace Clustering

Subspace clustering methods embrace a self-expressive model that represe...
research
03/26/2022

Metropolis-Hastings Data Augmentation for Graph Neural Networks

Graph Neural Networks (GNNs) often suffer from weak-generalization due t...
research
06/22/2023

AugDMC: Data Augmentation Guided Deep Multiple Clustering

Clustering aims to group similar objects together while separating dissi...
research
10/19/2019

LSTM-Assisted Evolutionary Self-Expressive Subspace Clustering

Massive volumes of high-dimensional data that evolves over time is conti...
research
10/05/2021

Deep Subspace analysing for Semi-Supervised multi-label classification of Diabetic Foot Ulcer

Diabetes is a global raising pandemic. Diabetes patients are at risk of ...

Please sign up or login with your details

Forgot password? Click here to reset