Probabilistic Dimensionality Reduction via Structure Learning

10/16/2016
by   Li Wang, et al.
0

We propose a novel probabilistic dimensionality reduction framework that can naturally integrate the generative model and the locality information of data. Based on this framework, we present a new model, which is able to learn a smooth skeleton of embedding points in a low-dimensional space from high-dimensional noisy data. The formulation of the new model can be equivalently interpreted as two coupled learning problem, i.e., structure learning and the learning of projection matrix. This interpretation motivates the learning of the embedding points that can directly form an explicit graph structure. We develop a new method to learn the embedding points that form a spanning tree, which is further extended to obtain a discriminative and compact feature representation for clustering problems. Unlike traditional clustering methods, we assume that centers of clusters should be close to each other if they are connected in a learned graph, and other cluster centers should be distant. This can greatly facilitate data visualization and scientific discovery in downstream analysis. Extensive experiments are performed that demonstrate that the proposed framework is able to obtain discriminative feature representations, and correctly recover the intrinsic structures of various real-world datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2015

IT-map: an Effective Nonlinear Dimensionality Reduction Method for Interactive Clustering

Scientists in many fields have the common and basic need of dimensionali...
research
12/09/2015

A Novel Regularized Principal Graph Learning Framework on Explicit Graph Representation

Many scientific datasets are of high dimension, and the analysis usually...
research
07/25/2022

Laplacian-based Cluster-Contractive t-SNE for High Dimensional Data Visualization

Dimensionality reduction techniques aim at representing high-dimensional...
research
07/12/2019

Improving the Projection of Global Structures in Data through Spanning Trees

The connection of edges in a graph generates a structure that is indepen...
research
09/18/2019

Laplacian Matrix for Dimensionality Reduction and Clustering

Many problems in machine learning can be expressed by means of a graph w...
research
07/05/2021

Randomized Dimensionality Reduction for Facility Location and Single-Linkage Clustering

Random dimensionality reduction is a versatile tool for speeding up algo...
research
05/21/2016

Learning From Hidden Traits: Joint Factor Analysis and Latent Clustering

Dimensionality reduction techniques play an essential role in data analy...

Please sign up or login with your details

Forgot password? Click here to reset