High-Dimensional Knockoffs Inference for Time Series Data

12/18/2021
by   Chien-Ming Chi, et al.
0

The framework of model-X knockoffs provides a flexible tool for exact finite-sample false discovery rate (FDR) control in variable selection. It also completely bypasses the use of conventional p-values, making it especially appealing in high-dimensional nonlinear models. Existing works have focused on the setting of independent and identically distributed observations. Yet time series data is prevalent in practical applications. This motivates the study of model-X knockoffs inference for time series data. In this paper, we make some initial attempt to establish the theoretical and methodological foundation for the model-X knockoffs inference for time series data. We suggest the method of time series knockoffs inference (TSKI) by exploiting the idea of subsampling to alleviate the difficulty caused by the serial dependence. We establish sufficient conditions under which the original model-X knockoffs inference combined with subsampling still achieves the asymptotic FDR control. Our technical analysis reveals the exact effect of serial dependence on the FDR control. To alleviate the practical concern on the power loss because of reduced sample size cause by subsampling, we exploit the idea of knockoffs with copies and multiple knockoffs. Under fairly general time series model settings, we show that the FDR remains to be controlled asymptotically. To theoretically justify the power of TSKI, we further suggest the new knockoff statistic, the backward elimination ranking (BE) statistic, and show that it enjoys both the sure screening property and controlled FDR in the linear time series model setting. The theoretical results and appealing finite-sample performance of the suggested TSKI method coupled with the BE are illustrated with several simulation examples and an economic inflation forecasting application.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/26/2018

Partial Distance Correlation Screening for High Dimensional Time Series

High dimensional time series datasets are becoming increasingly common i...
research
07/21/2020

Lasso Inference for High-Dimensional Time Series

The desparsified lasso is a high-dimensional estimation method which pro...
research
12/31/2022

Inference on Time Series Nonparametric Conditional Moment Restrictions Using General Sieves

General nonlinear sieve learnings are classes of nonlinear sieves that c...
research
09/06/2018

IPAD: Stable Interpretable Forecasting with Knockoffs Inference

Interpretability and stability are two important features that are desir...
research
08/01/2023

CoxKnockoff: Controlled Feature Selection for the Cox Model Using Knockoffs

Although there is a huge literature on feature selection for the Cox mod...
research
10/28/2019

Asymptotic Distributions of High-Dimensional Nonparametric Inference with Distance Correlation

Understanding the nonlinear association between a pair of potentially hi...
research
08/31/2017

RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs

Power and reproducibility are key to enabling refined scientific discove...

Please sign up or login with your details

Forgot password? Click here to reset