The Risk of Machine Learning

03/31/2017
by   Alberto Abadie, et al.
0

Many applied settings in empirical economics involve simultaneous estimation of a large number of parameters. In particular, applied economists are often interested in estimating the effects of many-valued treatments (like teacher effects or location effects), treatment effects for many groups, and prediction models with many regressors. In these settings, machine learning methods that combine regularized estimation and data-driven choices of regularization parameters are useful to avoid over-fitting. In this article, we analyze the performance of a class of machine learning estimators that includes ridge, lasso and pretest in contexts that require simultaneous estimation of many parameters. Our analysis aims to provide guidance to applied researchers on (i) the choice between regularized estimators in practice and (ii) data-driven selection of regularization parameters. To address (i), we characterize the risk (mean squared error) of regularized estimators and derive their relative performance as a function of simple features of the data generating process. To address (ii), we show that data-driven choices of regularization parameters, based on Stein's unbiased risk estimate or on cross-validation, yield estimators with risk uniformly close to the risk attained under the optimal (unfeasible) choice of regularization parameters. We use data from recent examples in the empirical economics literature to illustrate the practical applicability of our results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2020

Group-regularized ridge regression via empirical Bayes noise level cross-validation

Features in predictive models are not exchangeable, yet common supervise...
research
10/30/2019

Find what you are looking for: A data-driven covariance matrix estimation

The global minimum-variance portfolio is a typical choice for investors ...
research
07/05/2023

The distribution of Ridgeless least squares interpolators

The Ridgeless minimum ℓ_2-norm interpolator in overparametrized linear r...
research
10/05/2022

Spectral Regularization Allows Data-frugal Learning over Combinatorial Spaces

Data-driven machine learning models are being increasingly employed in s...
research
01/23/2022

High-dimensional model-assisted inference for treatment effects with multi-valued treatments

Consider estimation of average treatment effects with multi-valued treat...
research
12/19/2017

Some Large Sample Results for the Method of Regularized Estimators

We present a general framework for studying regularized estimators; i.e....
research
05/19/2017

Data-driven Optimal Transport Cost Selection for Distributionally Robust Optimizatio

Recently, (Blanchet, Kang, and Murhy 2016) showed that several machine l...

Please sign up or login with your details

Forgot password? Click here to reset