Cross-Fitting and Fast Remainder Rates for Semiparametric Estimation

01/27/2018
by   Whitney K. Newey, et al.
0

There are many interesting and widely used estimators of a functional with finite semiparametric variance bound that depend on nonparametric estimators of nuisance functions. We use cross-fitting (i.e. sample splitting) to construct novel estimators with fast remainder rates. We give cross-fit doubly robust estimators that use separate subsamples to estimate different nuisance functions. We obtain general, precise results for regression spline estimation of average linear functionals of conditional expectations with a finite semiparametric variance bound. We show that a cross-fit doubly robust spline regression estimator of the expected conditional covariance is semiparametric efficient under minimal conditions. Cross-fit doubly robust estimators of other average linear functionals of a conditional expectation are shown to have the fastest known remainder rates for the Haar basis or under certain smoothness conditions. Surprisingly, the cross-fit plug-in estimator also has nearly the fastest known remainder rate, but the remainder converges to zero slower than the cross-fit doubly robust estimator. As specific examples we consider the expected conditional covariance, mean with randomly missing data, and a weighted average derivative.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2023

Three-way Cross-Fitting and Pseudo-Outcome Regression for Estimation of Conditional Effects and other Linear Functionals

We propose an approach to better inform treatment decisions at an indivi...
research
11/13/2017

Optimal estimation in functional linear regression for sparse noise-contaminated data

In this paper, we propose a novel approach to fit a functional linear re...
research
12/30/2022

On Undersmoothing and Sample Splitting for Estimating a Doubly Robust Functional

We consider the problem of constructing minimax rate-optimal estimators ...
research
07/06/2020

Cross-Fitting and Averaging for Machine Learning Estimation of Heterogeneous Treatment Effects

We investigate the finite sample performance of sample splitting, cross-...
research
04/05/2021

A General Derivative Identity for the Conditional Mean Estimator in Gaussian Noise and Some Applications

Consider a channel Y= X+ N where X is an n-dimensional random vector, a...
research
04/25/2023

Positive definite nonparametric regression using an evolutionary algorithm with application to covariance function estimation

We propose a novel nonparametric regression framework subject to the pos...
research
03/07/2021

Cascaded Filtering Using the Sigma Point Transformation (Extended Version)

It is often convenient to separate a state estimation task into smaller ...

Please sign up or login with your details

Forgot password? Click here to reset