Selective Inference via Marginal Screening for High Dimensional Classification

06/26/2019
by   Yuta Umezu, et al.
0

Post-selection inference is a statistical technique for determining salient variables after model or variable selection. Recently, selective inference, a kind of post-selection inference framework, has garnered the attention in the statistics and machine learning communities. By conditioning on a specific variable selection procedure, selective inference can properly control for so-called selective type I error, which is a type I error conditional on a variable selection procedure, without imposing excessive additional computational costs. While selective inference can provide a valid hypothesis testing procedure, the main focus has hitherto been on Gaussian linear regression models. In this paper, we develop a selective inference framework for binary classification problem. We consider a logistic regression model after variable selection based on marginal screening, and derive the high dimensional statistical behavior of the post-selection estimator. This enables us to asymptotically control for selective type I error for the purposes of hypothesis testing after variable selection. We conduct several simulation studies to confirm the statistical power of the test, and compare our proposed method with data splitting and other methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2007

High-dimensional variable selection

This paper explores the following question: what kind of statistical gua...
research
05/25/2022

Resampling-Based Multisplit Inference for High-Dimensional Regression

We propose a novel resampling-based method to construct an asymptoticall...
research
03/29/2022

A new procedure for Selective Inference with the Generalized Linear Lasso

This articles investigates the distribution of the solutions of the gene...
research
05/02/2021

Selective Inference in Propensity Score Analysis

Selective inference (post-selection inference) is a methodology that has...
research
02/23/2014

Exact Post Model Selection Inference for Marginal Screening

We develop a framework for post model selection inference, via marginal ...
research
02/28/2019

Granger Causality Testing in High-Dimensional VARs: a Post-Double-Selection Procedure

In this paper we develop an LM test for Granger causality in high-dimens...
research
01/15/2023

Selective Inference with Distributed Data

Nowadays, big datasets are spread over many machines which compute in pa...

Please sign up or login with your details

Forgot password? Click here to reset