Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions

05/28/2023
by   Kai Tan, et al.
0

This paper investigates the asymptotic distribution of the maximum-likelihood estimate (MLE) in multinomial logistic models in the high-dimensional regime where dimension and sample size are of the same order. While classical large-sample theory provides asymptotic normality of the MLE under certain conditions, such classical results are expected to fail in high-dimensions as documented for the binary logistic case in the seminal work of Sur and Candès [2019]. We address this issue in classification problems with 3 or more classes, by developing asymptotic normality and asymptotic chi-square results for the multinomial logistic MLE (also known as cross-entropy minimizer) on null covariates. Our theory leads to a new methodology to test the significance of a given feature. Extensive simulation studies on synthetic data corroborate these asymptotic results and confirm the validity of proposed p-values for testing the significance of a given feature.

READ FULL TEXT

page 14

page 18

research
01/26/2018

A note on "MLE in logistic regression with a diverging dimension"

This short note is to point the reader to notice that the proof of high ...
research
08/17/2019

The Existence of Maximum Likelihood Estimate in High-Dimensional Generalized Linear Models with Binary Responses

Motivated by recent works on the high-dimensional logistic regression, w...
research
01/25/2020

The Asymptotic Distribution of the MLE in High-dimensional Logistic Models: Arbitrary Covariance

We study the distribution of the maximum likelihood estimate (MLE) in hi...
research
06/27/2012

A Permutation Approach to Testing Interactions in Many Dimensions

To date, testing interactions in high dimensions has been a challenging ...
research
04/03/2022

A Modern Theory for High-dimensional Cox Regression Models

The proportional hazards model has been extensively used in many fields ...
research
06/05/2017

The Likelihood Ratio Test in High-Dimensional Logistic Regression Is Asymptotically a Rescaled Chi-Square

Logistic regression is used thousands of times a day to fit data, predic...
research
10/18/2022

Adjusting for non-confounding covariates in case-control association studies

Considerable debate has been generated in recent literature on whether n...

Please sign up or login with your details

Forgot password? Click here to reset