SparCA: Sparse Compressed Agglomeration for Feature Extraction and Dimensionality Reduction

01/26/2023
by   Leland Barnard, et al.
0

The most effective dimensionality reduction procedures produce interpretable features from the raw input space while also providing good performance for downstream supervised learning tasks. For many methods, this requires optimizing one or more hyperparameters for a specific task, which can limit generalizability. In this study we propose sparse compressed agglomeration (SparCA), a novel dimensionality reduction procedure that involves a multistep hierarchical feature grouping, compression, and feature selection process. We demonstrate the characteristics and performance of the SparCA method across heterogenous synthetic and real-world datasets, including images, natural language, and single cell gene expression data. Our results show that SparCA is applicable to a wide range of data types, produces highly interpretable features, and shows compelling performance on downstream supervised learning tasks without the need for hyperparameter tuning.

READ FULL TEXT
research
11/30/2022

DimenFix: A novel meta-dimensionality reduction method for feature preservation

Dimensionality reduction has become an important research topic as deman...
research
06/19/2023

Nonlinear Feature Aggregation: Two Algorithms driven by Theory

Many real-world machine learning applications are characterized by a hug...
research
06/17/2022

DPDR: A novel machine learning method for the Decision Process for Dimensionality Reduction

This paper discusses the critical decision process of extracting or sele...
research
06/23/2023

Analyzing scRNA-seq data by CCP-assisted UMAP and t-SNE

Single-cell RNA sequencing (scRNA-seq) is widely used to reveal heteroge...
research
10/31/2018

Unsupervised Dimension Selection using a Blue Noise Spectrum

Unsupervised dimension selection is an important problem that seeks to r...
research
05/18/2018

Spectral feature scaling method for supervised dimensionality reduction

Spectral dimensionality reduction methods enable linear separations of c...
research
11/27/2013

Dimensionality reduction for click-through rate prediction: Dense versus sparse representation

In online advertising, display ads are increasingly being placed based o...

Please sign up or login with your details

Forgot password? Click here to reset