Integrative Sparse Partial Least Squares

06/05/2020
by   Weijuan Liang, et al.
0

Partial least squares, as a dimension reduction method, has become increasingly important for its ability to deal with problems with a large number of variables. Since noisy variables may weaken the performance of the model, the sparse partial least squares (SPLS) technique has been proposed to identify important variables and generate more interpretable results. However, the small sample size of a single dataset limits the performance of conventional methods. An effective solution comes from gathering information from multiple comparable studies. The integrative analysis holds an important status among multi-datasets analyses. The main idea is to improve estimation results by assembling raw datasets and analyzing them jointly. In this paper, we develop an integrative SPLS (iSPLS) method using penalization based on the SPLS technique. The proposed approach consists of two penalties. The first penalty conducts variable selection under the context of integrative analysis; The second penalty, a contrasted one, is imposed to encourage the similarity of estimates across datasets and generate more reasonable and accurate results. Computational algorithms are provided. Simulation experiments are conducted to compare iSPLS with alternative approaches. The practical utility of iSPLS is shown in the analysis of two TCGA gene expression data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2023

Biclustering Methods via Sparse Penalty

In this paper, we first reviewed several biclustering methods that are u...
research
10/19/2021

Operator-induced structural variable selection for identifying materials genes

In the emerging field of materials informatics, a fundamental task is to...
research
08/22/2023

Nonparametric Assessment of Variable Selection and Ranking Algorithms

Selecting from or ranking a set of candidates variables in terms of thei...
research
07/06/2022

Deep Partial Least Squares for IV Regression

In this paper, we propose deep partial least squares for the estimation ...
research
05/17/2019

Comparison of Machine Learning Models in Food Authentication Studies

The underlying objective of food authentication studies is to determine ...
research
06/18/2019

Learning data representation using modified autoencoder for the integrative analysis of multi-omics data

In integrative analyses of omics data, it is often of interest to extrac...

Please sign up or login with your details

Forgot password? Click here to reset