A Modern Theory for High-dimensional Cox Regression Models

04/03/2022
by   Xianyang Zhang, et al.
0

The proportional hazards model has been extensively used in many fields such as biomedicine to estimate and perform statistical significance testing on the effects of covariates influencing the survival time of patients. The classical theory of maximum partial-likelihood estimation (MPLE) is used by most software packages to produce inference, e.g., the coxph function in R and the PHREG procedure in SAS. In this paper, we investigate the asymptotic behavior of the MPLE in the regime in which the number of parameters p is of the same order as the number of samples n. The main results are (i) existence of the MPLE undergoes a sharp 'phase transition'; (ii) the classical MPLE theory leads to invalid inference in the high-dimensional regime. We show that the asymptotic behavior of the MPLE is governed by a new asymptotic theory. These findings are further corroborated through numerical studies. The main technical tool in our proofs is the Convex Gaussian Min-max Theorem (CGMT), which has not been previously used in the analysis of partial likelihood. Our results thus extend the scope of CGMT and shed new light on the use of CGMT for examining the existence of MPLE and non-separable objective functions.

READ FULL TEXT

page 5

page 6

page 9

page 12

page 13

research
08/17/2019

The Existence of Maximum Likelihood Estimate in High-Dimensional Generalized Linear Models with Binary Responses

Motivated by recent works on the high-dimensional logistic regression, w...
research
04/25/2018

The phase transition for the existence of the maximum likelihood estimate in high-dimensional logistic regression

This paper rigorously establishes that the existence of the maximum like...
research
05/28/2023

Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions

This paper investigates the asymptotic distribution of the maximum-likel...
research
10/16/2022

Dimension free ridge regression

Random matrix theory has become a widely useful tool in high-dimensional...
research
05/12/2022

The LAN property for McKean-Vlasov models in a mean-field regime

We establish the local asymptotic normality (LAN) property for estimatin...
research
02/05/2018

A useful variant of Wilks' theorem for grouped data

This paper provides a generalization of a classical result obtained by W...
research
07/27/2020

A regime switching on Covid19 analysis and prediction in Romania

In this paper we propose a regime separation for the analysis of Covid19...

Please sign up or login with your details

Forgot password? Click here to reset