A Free Lunch with Influence Functions? Improving Neural Network Estimates with Concepts from Semiparametric Statistics

02/18/2022
by   Matthew J. Vowels, et al.
6

Parameter estimation in the empirical fields is usually undertaken using parametric models, and such models are convenient because they readily facilitate statistical inference. Unfortunately, they are unlikely to have a sufficiently flexible functional form to be able to adequately model real-world phenomena, and their usage may therefore result in biased estimates and invalid inference. Unfortunately, whilst non-parametric machine learning models may provide the needed flexibility to adapt to the complexity of real-world phenomena, they do not readily facilitate statistical inference, and may still exhibit residual bias. We explore the potential for semiparametric theory (in particular, the Influence Function) to be used to improve neural networks and machine learning algorithms in terms of (a) improving initial estimates without needing more data (b) increasing the robustness of our models, and (c) yielding confidence intervals for statistical inference. We propose a new neural network method MultiNet, which seeks the flexibility and diversity of an ensemble using a single architecture. Results on causal inference tasks indicate that MultiNet yields better performance than other approaches, and that all considered methods are amenable to improvement from semiparametric techniques under certain conditions. In other words, with these techniques we show that we can improve existing neural networks for `free', without needing more data, and without needing to retrain them. Finally, we provide the expression for deriving influence functions for estimands from a general graph, and the code to do so automatically.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2019

Shapley regressions: A framework for statistical inference on machine learning models

Machine learning models often excel in the accuracy of their predictions...
research
06/19/2018

Simplifying Probabilistic Expressions in Causal Inference

Obtaining a non-parametric expression for an interventional distribution...
research
06/19/2019

Frequentist Inference without Repeated Sampling

Frequentist inference typically is described in terms of hypothetical re...
research
09/01/2021

RIFLE: Robust Inference from Low Order Marginals

The ubiquity of missing values in real-world datasets poses a challenge ...
research
10/01/2019

Joint Estimation of the Non-parametric Transitivity and Preferential Attachment Functions in Scientific Co-authorship Networks

We propose a statistical method to estimate simultaneously the non-param...
research
11/14/2020

Inference Functions for Semiparametric Models

The paper discusses inference techniques for semiparametric models based...

Please sign up or login with your details

Forgot password? Click here to reset