Real World Time Series Benchmark Datasets with Distribution Shifts: Global Crude Oil Price and Volatility

08/21/2023
by   Pranay Pasula, et al.
0

The scarcity of task-labeled time-series benchmarks in the financial domain hinders progress in continual learning. Addressing this deficit would foster innovation in this area. Therefore, we present COB, Crude Oil Benchmark datasets. COB includes 30 years of asset prices that exhibit significant distribution shifts and optimally generates corresponding task (i.e., regime) labels based on these distribution shifts for the three most important crude oils in the world. Our contributions include creating real-world benchmark datasets by transforming asset price data into volatility proxies, fitting models using expectation-maximization (EM), generating contextual task labels that align with real-world events, and providing these labels as well as the general algorithm to the public. We show that the inclusion of these task labels universally improves performance on four continual learning algorithms, some state-of-the-art, over multiple forecasting horizons. We hope these benchmarks accelerate research in handling distribution shifts in real-world data, especially due to the global importance of the assets considered. We've made the (1) raw price data, (2) task labels generated by our approach, (3) and code for our algorithm available at https://oilpricebenchmarks.github.io.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2022

Wild-Time: A Benchmark of in-the-Wild Distribution Shift over Time

Distribution shift occurs when the test distribution differs from the tr...
research
03/29/2022

Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries

Learning under a continuously changing data distribution with incorrect ...
research
12/03/2021

Contrastive Continual Learning with Feature Propagation

Classical machine learners are designed only to tackle one task without ...
research
01/05/2022

Mixture of basis for interpretable continual learning with distribution shifts

Continual learning in environments with shifting data distributions is a...
research
06/29/2022

Continual Learning for Human State Monitoring

Continual Learning (CL) on time series data represents a promising but u...
research
03/18/2022

WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series Tasks

Machine learning models often fail to generalize well under distribution...
research
11/11/2019

Making Good on LSTMs Unfulfilled Promise

LSTMs promise much to financial time-series analysis, temporal and cross...

Please sign up or login with your details

Forgot password? Click here to reset