Bayesian Imputation with Optimal Look-Ahead-Bias and Variance Tradeoff

02/02/2022
by   Jose Blanchet, et al.
0

Missing time-series data is a prevalent problem in finance. Imputation methods for time-series data are usually applied to the full panel data with the purpose of training a model for a downstream out-of-sample task. For example, the imputation of missing returns may be applied prior to estimating a portfolio optimization model. However, this practice can result in a look-ahead-bias in the future performance of the downstream task. There is an inherent trade-off between the look-ahead-bias of using the full data set for imputation and the larger variance in the imputation from using only the training data. By connecting layers of information revealed in time, we propose a Bayesian consensus posterior that fuses an arbitrary number of posteriors to optimally control the variance and look-ahead-bias trade-off in the imputation. We derive tractable two-step optimization procedures for finding the optimal consensus posterior, with Kullback-Leibler divergence and Wasserstein distance as the measure of dissimilarity between posterior distributions. We demonstrate in simulations and an empirical study the benefit of our imputation mechanism for portfolio optimization with missing returns.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2021

Time-Series Imputation with Wasserstein Interpolation for Optimal Look-Ahead-Bias and Variance Tradeoff

Missing time-series data is a prevalent practical problem. Imputation me...
research
10/20/2020

RDIS: Random Drop Imputation with Self-Training for Incomplete Time Series Data

It is common that time-series data with missing values are encountered i...
research
06/01/2023

An End-to-End Time Series Model for Simultaneous Imputation and Forecast

Time series forecasting using historical data has been an interesting an...
research
10/25/2021

Time series signal recovery methods: comparative study

Signal data often contains missing values. Effective replacement (imputa...
research
10/01/2020

When to Impute? Imputation before and during cross-validation

Cross-validation (CV) is a technique used to estimate generalization err...
research
02/10/2021

MAIN: Multihead-Attention Imputation Networks

The problem of missing data, usually absent incurated and competition-st...
research
11/17/2022

Imputation of Missing Streamflow Data at Multiple Gauging Stations in Benin Republic

Streamflow observation data is vital for flood monitoring, agricultural,...

Please sign up or login with your details

Forgot password? Click here to reset