The Proper Care and Feeding of CAMELS: How Limited Training Data Affects Streamflow Prediction

11/17/2019
by   Martin Gauch, et al.
0

Accurate streamflow prediction largely relies on historical records of both meteorological data and streamflow measurements. For many regions around the world, however, such data are only scarcely or not at all available. To select an appropriate model for a region with a given amount of historical data, it is therefore indispensable to know a model's sensitivity to limited training data, both in terms of geographic diversity and different spans of time. In this study, we provide decision support for tree- and LSTM-based models. We feed the models meteorological measurements from the CAMELS dataset, and individually restrict the training period length and the number of basins used in training. Our findings show that tree-based models provide more accurate predictions on small datasets, while LSTMs are superior given sufficient training data. This is perhaps not surprising, as neural networks are known to be data-hungry; however, we are able to characterize each model's strengths under different conditions, including the "breakeven point" when LSTMs begin to overtake tree-based models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2019

A Comparative Analysis of Forecasting Financial Time Series Using ARIMA, LSTM, and BiLSTM

Machine and deep learning-based algorithms are the emerging approaches i...
research
02/05/2020

Forecasting Industrial Aging Processes with Machine Learning Methods

By accurately predicting industrial aging processes (IAPs), it is possib...
research
12/24/2022

Improving Uncertainty Quantification of Variance Networks by Tree-Structured Learning

To improve uncertainty quantification of variance networks, we propose a...
research
12/06/2019

A limited-size ensemble of homogeneous CNN/LSTMs for high-performance word classification

In recent years, long short-term memory neural networks (LSTMs) have bee...
research
06/13/2018

An Evaluation of Neural Machine Translation Models on Historical Spelling Normalization

In this paper, we apply different NMT models to the problem of historica...
research
07/18/2022

Why do tree-based models still outperform deep learning on tabular data?

While deep learning has enabled tremendous progress on text and image da...
research
12/15/2022

Converting College Football Point Spread Differentials to Probabilities

For NCAA football, we provide a method for sports bettors to determine i...

Please sign up or login with your details

Forgot password? Click here to reset