Understanding Model Complexity for temporal tabular and multi-variate time series, case study with Numerai data science tournament

03/14/2023
by   Thomas Wong, et al.
0

In this paper, we explore the use of different feature engineering and dimensionality reduction methods in multi-variate time-series modelling. Using a feature-target cross correlation time series dataset created from Numerai tournament, we demonstrate under over-parameterised regime, both the performance and predictions from different feature engineering methods converge to the same equilibrium, which can be characterised by the reproducing kernel Hilbert space. We suggest a new Ensemble method, which combines different random non-linear transforms followed by ridge regression for modelling high dimensional time-series. Compared to some commonly used deep learning models for sequence modelling, such as LSTM and transformers, our method is more robust (lower model variance over different random seeds and less sensitive to the choice of architecture) and more efficient. An additional advantage of our method is model simplicity as there is no need to use sophisticated deep learning frameworks such as PyTorch. The learned feature rankings are then applied to the temporal tabular prediction problem in the Numerai tournament, and the predictive power of feature rankings obtained from our method is better than the baseline prediction model based on moving averages

READ FULL TEXT
research
03/26/2023

Feature Engineering Methods on Multivariate Time-Series Data for Financial Data Science Competitions

We apply different feature engineering methods for time-series to US mar...
research
01/04/2020

Temporal Tensor Transformation Network for Multivariate Time Series Prediction

Multivariate time series prediction has applications in a wide variety o...
research
03/18/2022

Performance of Deep Learning models with transfer learning for multiple-step-ahead forecasts in monthly time series

Deep Learning and transfer learning models are being used to generate ti...
research
08/02/2023

Automatic Feature Engineering for Time Series Classification: Evaluation and Discussion

Time Series Classification (TSC) has received much attention in the past...
research
05/23/2023

DF2M: An Explainable Deep Bayesian Nonparametric Model for High-Dimensional Functional Time Series

In this paper, we present Deep Functional Factor Model (DF2M), a Bayesia...
research
12/30/2022

Dynamic Feature Engineering and model selection methods for temporal tabular datasets with regime changes

The application of deep learning algorithms to temporal panel datasets i...
research
01/27/2023

Learning the Dynamics of Sparsely Observed Interacting Systems

We address the problem of learning the dynamics of an unknown non-parame...

Please sign up or login with your details

Forgot password? Click here to reset