DeepAI AI Chat
Log In Sign Up

Extracting Predictive Information from Heterogeneous Data Streams using Gaussian Processes

by   Sid Ghoshal, et al.
University of Oxford

Financial markets are notoriously complex environments, presenting vast amounts of noisy, yet potentially informative data. We consider the problem of forecasting financial time series from a wide range of information sources using online Gaussian Processes with Automatic Relevance Determination (ARD) kernels. We measure the performance gain, quantified in terms of Normalised Root Mean Square Error (NRMSE), Median Absolute Deviation (MAD) and Pearson correlation, from fusing each of four separate data domains: time series technicals, sentiment analysis, options market data and broker recommendations. We show evidence that ARD kernels produce meaningful feature rankings that help retain salient inputs and reduce input dimensionality, providing a framework for sifting through financial complexity. We measure the performance gain from fusing each domain's heterogeneous data streams into a single probabilistic model. In particular our findings highlight the critical value of options data in mapping out the curvature of price space and inspire an intuitive, novel direction for research in financial prediction.


MegazordNet: combining statistical and machine learning standpoints for time series forecasting

Forecasting financial time series is considered to be a difficult task d...

Automatic Forecasting using Gaussian Processes

Automatic forecasting is the task of receiving a time series and returni...

Causal Analysis of Generic Time Series Data Applied for Market Prediction

We explore the applicability of the causal analysis based on temporally ...

Sentiment analysis with genetically evolved Gaussian kernels

Sentiment analysis consists of evaluating opinions or statements from th...

Optimal Stopping with Gaussian Processes

We propose a novel group of Gaussian Process based algorithms for fast a...

Forecasting of commercial sales with large scale Gaussian Processes

This paper argues that there has not been enough discussion in the field...