Dual-sPLS: a family of Dual Sparse Partial Least Squares regressions for feature selection and prediction with tunable sparsity; evaluation on simulated and near-infrared (NIR)

01/17/2023
by   Louna Alsouki, et al.
1

Relating a set of variables X to a response y is crucial in chemometrics. A quantitative prediction objective can be enriched by qualitative data interpretation, for instance by locating the most influential features. When high-dimensional problems arise, dimension reduction techniques can be used. Most notable are projections (e.g. Partial Least Squares or PLS ) or variable selections (e.g. lasso). Sparse partial least squares combine both strategies, by blending variable selection into PLS. The variant presented in this paper, Dual-sPLS, generalizes the classical PLS1 algorithm. It provides balance between accurate prediction and efficient interpretation. It is based on penalizations inspired by classical regression methods (lasso, group lasso, least squares, ridge) and uses the dual norm notion. The resulting sparsity is enforced by an intuitive shrinking ratio parameter. Dual-sPLS favorably compares to similar regression methods, on simulated and real chemical data. Code is provided as an open-source package in R: <https://CRAN.R-project.org/package=dual.spls>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2022

Deep Partial Least Squares for IV Regression

In this paper, we propose deep partial least squares for the estimation ...
research
09/22/2020

The Linear Lasso: a location model resolution

We use location model methodology to guide the least squares analysis of...
research
04/06/2012

Fast projections onto mixed-norm balls with applications

Joint sparsity offers powerful structural cues for feature selection, es...
research
07/10/2023

Predicting milk traits from spectral data using Bayesian probabilistic partial least squares regression

High-dimensional spectral data – routinely generated in dairy production...
research
06/05/2020

Integrative Sparse Partial Least Squares

Partial least squares, as a dimension reduction method, has become incre...
research
06/26/2021

Deep Learning Partial Least Squares

High dimensional data reduction techniques are provided by using partial...
research
12/16/2019

Sparse Group Fused Lasso for Model Segmentation

This article introduces the sparse group fused lasso (SGFL) as a statist...

Please sign up or login with your details

Forgot password? Click here to reset