The data synergy effects of time-series deep learning models in hydrology

01/06/2021
by   Kuai Fang, et al.
6

When fitting statistical models to variables in geoscientific disciplines such as hydrology, it is a customary practice to regionalize - to divide a large spatial domain into multiple regions and study each region separately - instead of fitting a single model on the entire data (also known as unification). Traditional wisdom in these fields suggests that models built for each region separately will have higher performance because of homogeneity within each region. However, by partitioning the training data, each model has access to fewer data points and cannot learn from commonalities between regions. Here, through two hydrologic examples (soil moisture and streamflow), we argue that unification can often significantly outperform regionalization in the era of big data and deep learning (DL). Common DL architectures, even without bespoke customization, can automatically build models that benefit from regional commonality while accurately learning region-specific differences. We highlight an effect we call data synergy, where the results of the DL models improved when data were pooled together from characteristically different regions. In fact, the performance of the DL models benefited from more diverse rather than more homogeneous training data. We hypothesize that DL models automatically adjust their internal representations to identify commonalities while also providing sufficient discriminatory information to the model. The results here advocate for pooling together larger datasets, and suggest the academic community should place greater emphasis on data sharing and compilation.

READ FULL TEXT
research
12/15/2022

An Empirical Study of Deep Learning Models for Vulnerability Detection

Deep learning (DL) models of code have recently reported great progress ...
research
06/22/2021

Revisiting Deep Learning Models for Tabular Data

The necessity of deep learning for tabular data is still an unanswered q...
research
01/23/2023

Toward Foundation Models for Earth Monitoring: Generalizable Deep Learning Models for Natural Hazard Segmentation

Climate change results in an increased probability of extreme weather ev...
research
11/08/2018

When Mobile Apps Going Deep: An Empirical Study of Mobile Deep Learning

Deep learning (DL) is a game-changing technique in mobile scenarios, as ...
research
12/21/2020

Natural vs Balanced Distribution in Deep Learning on Whole Slide Images for Cancer Detection

The class distribution of data is one of the factors that regulates the ...
research
02/03/2023

Using Explainability to Inform Statistical Downscaling Based on Deep Learning Beyond Standard Validation Approaches

Deep learning (DL) has emerged as a promising tool to downscale climate ...
research
03/18/2019

Advanced Capsule Networks via Context Awareness

Capsule Networks (CN) offer new architectures for Deep Learning (DL) com...

Please sign up or login with your details

Forgot password? Click here to reset