Multiway sparse distance weighted discrimination

10/11/2021
by   Bin Guo, et al.
4

Modern data often take the form of a multiway array. However, most classification methods are designed for vectors, i.e., 1-way arrays. Distance weighted discrimination (DWD) is a popular high-dimensional classification method that has been extended to the multiway context, with dramatic improvements in performance when data have multiway structure. However, the previous implementation of multiway DWD was restricted to classification of matrices, and did not account for sparsity. In this paper, we develop a general framework for multiway classification which is applicable to any number of dimensions and any degree of sparsity. We conducted extensive simulation studies, showing that our model is robust to the degree of sparsity and improves classification accuracy when the data have multiway structure. For our motivating application, magnetic resonance spectroscopy (MRS) was used to measure the abundance of several metabolites across multiple neurological regions and across multiple time points in a mouse model of Friedreich's ataxia, yielding a four-way data array. Our method reveals a robust and interpretable multi-region metabolomic signal that discriminates the groups of interest. We also successfully apply our method to gene expression time course data for multiple sclerosis treatment. An R implementation is available in the package MultiwayClassification at http://github.com/lockEF/MultiwayClassification .

READ FULL TEXT
research
06/26/2016

Discriminating sample groups with multi-way data

High-dimensional linear classifiers, such as the support vector machine ...
research
01/24/2015

Sparse Distance Weighted Discrimination

Distance weighted discrimination (DWD) was originally proposed to handle...
research
11/29/2018

Adaptive Sparse Estimation with Side Information

The article considers the problem of estimating a high-dimensional spars...
research
06/24/2023

Robust Classification of High-Dimensional Data using Data-Adaptive Energy Distance

Classification of high-dimensional low sample size (HDLSS) data poses a ...
research
10/09/2021

Simultaneous Cluster Structure Learning and Estimation of Heterogeneous Graphs for Matrix-variate fMRI Data

Graphical models play an important role in neuroscience studies, particu...
research
09/12/2018

High-dimensional Bayesian Fourier Analysis For Detecting Circadian Gene Expressions

In genomic applications, there is often interest in identifying genes wh...

Please sign up or login with your details

Forgot password? Click here to reset