ℓ_0-Regularized High-dimensional Accelerated Failure Time Model

02/09/2020
by   Xingdong Feng, et al.
0

We develop a constructive approach for ℓ_0-penalized estimation in the sparse accelerated failure time (AFT) model with high-dimensional covariates. Our proposed method is based on Stute's weighted least squares criterion combined with ℓ_0-penalization. This method is a computational algorithm that generates a sequence of solutions iteratively, based on active sets derived from primal and dual information and root finding according to the KKT conditions. We refer to the proposed method as AFT-SDAR (for support detection and root finding). An important aspect of our theoretical results is that we directly concern the sequence of solutions generated based on the AFT-SDAR algorithm. We prove that the estimation errors of the solution sequence decay exponentially to the optimal error bound with high probability, as long as the covariate matrix satisfies a mild regularity condition which is necessary and sufficient for model identification even in the setting of high-dimensional linear regression. We also proposed an adaptive version of AFT-SDAR, or AFT-ASDAR, which determines the support size of the estimated coefficient in a data-driven fashion. We conduct simulation studies to demonstrate the superior performance of the proposed method over the lasso and MCP in terms of accuracy and speed. We also apply the proposed method to a real data set to illustrate its application.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/16/2020

A Support Detection and Root Finding Approach for Learning High-dimensional Generalized Linear Models

Feature selection is important for modeling high-dimensional data, where...
research
10/09/2018

SNAP: A semismooth Newton algorithm for pathwise optimization with optimal local convergence rate and oracle properties

We propose a semismooth Newton algorithm for pathwise optimization (SNAP...
research
03/16/2023

High-Dimensional Penalized Bernstein Support Vector Machines

The support vector machines (SVM) is a powerful classifier used for bina...
research
03/19/2021

Robust penalized empirical likelihood in high dimensional longitudinal data analysis

As an effective nonparametric method, empirical likelihood (EL) is appea...
research
06/09/2018

Deterministic Stretchy Regression

An extension of the regularized least-squares in which the estimation pa...
research
04/04/2021

Scalable algorithms for semiparametric accelerated failure time models in high dimensions

Semiparametric accelerated failure time (AFT) models are a useful altern...
research
12/18/2020

Regularized Estimation in High-Dimensional Vector Auto-Regressive Models using Spatio-Temporal Information

A Vector Auto-Regressive (VAR) model is commonly used to model multivari...

Please sign up or login with your details

Forgot password? Click here to reset