Robust Hypothesis Test for Nonlinear Effect with Gaussian Processes

10/03/2017
by   Jeremiah Zhe Liu, et al.
0

This work constructs a hypothesis test for detecting whether an data-generating function h: R^p → R belongs to a specific reproducing kernel Hilbert space H_0 , where the structure of H_0 is only partially known. Utilizing the theory of reproducing kernels, we reduce this hypothesis to a simple one-sided score test for a scalar parameter, develop a testing procedure that is robust against the mis-specification of kernel functions, and also propose an ensemble-based estimator for the null model to guarantee test performance in small samples. To demonstrate the utility of the proposed method, we apply our test to the problem of detecting nonlinear interaction between groups of continuous features. We evaluate the finite-sample performance of our test under different data-generating functions and estimation strategies for the null model. Our results reveal interesting connections between notions in machine learning (model underfit/overfit) and those in statistical inference (i.e. Type I error/power of hypothesis test), and also highlight unexpected consequences of common model estimating strategies (e.g. estimating kernel hyperparameters using maximum likelihood estimation) on model inference.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

Cross-Validated Kernel Ensemble: Robust Hypothesis Test for Nonlinear Effect with Gaussian Process

The R package CVEK introduces a robust hypothesis test for nonlinear eff...
research
11/30/2018

Kernel based method for the k-sample problem

In this paper we deal with the problem of testing for the equality of k ...
research
10/28/2021

Kernel-based Partial Permutation Test for Detecting Heterogeneous Functional Relationship

We propose a kernel-based partial permutation test for checking the equa...
research
04/07/2008

Testing for Homogeneity with Kernel Fisher Discriminant Analysis

We propose to investigate test statistics for testing homogeneity in rep...
research
04/15/2021

A robust specification test in linear panel data models

The presence of outlying observations may adversely affect statistical t...
research
10/06/2022

Post-selection Inference in Multiverse Analysis (PIMA): an inferential framework based on the sign flipping score test

When analyzing data researchers make some decisions that are either arbi...

Please sign up or login with your details

Forgot password? Click here to reset