DisCo: Physics-Based Unsupervised Discovery of Coherent Structures in Spatiotemporal Systems

09/25/2019
by   Adam Rupe, et al.
0

Extracting actionable insight from complex unlabeled scientific data is an open challenge and key to unlocking data-driven discovery in science. Complementary and alternative to supervised machine learning approaches, unsupervised physics-based methods based on behavior-driven theories hold great promise. Due to computational limitations, practical application on real-world domain science problems has lagged far behind theoretical development. We present our first step towards bridging this divide - DisCo - a high-performance distributed workflow for the behavior-driven local causal state theory. DisCo provides a scalable unsupervised physics-based representation learning method that decomposes spatiotemporal systems into their structurally relevant components, which are captured by the latent local causal state variables. Complex spatiotemporal systems are generally highly structured and organize around a lower-dimensional skeleton of coherent structures, and in several firsts we demonstrate the efficacy of DisCo in capturing such structures from observational and simulated scientific data. To the best of our knowledge, DisCo is also the first application software developed entirely in Python to scale to over 1000 machine nodes, providing good performance along with ensuring domain scientists' productivity. We developed scalable, performant methods optimized for Intel many-core processors that will be upstreamed to open-source Python library packages. Our capstone experiment, using newly developed DisCo workflow and libraries, performs unsupervised spacetime segmentation analysis of CAM5.1 climate simulation data, processing an unprecedented 89.5 TB in 6.6 minutes end-to-end using 1024 Intel Haswell nodes on the Cori supercomputer obtaining 91 strong-scaling efficiency.

READ FULL TEXT

page 1

page 8

page 10

page 12

research
09/16/2019

Towards Unsupervised Segmentation of Extreme Weather Events

Extreme weather is one of the main mechanisms through which climate chan...
research
01/01/2018

Local Causal States and Discrete Coherent Structures

Coherent structures form spontaneously in nonlinear spatiotemporal syste...
research
04/25/2023

Physics-Informed Representation Learning for Emergent Organization in Complex Dynamical Systems

Nonlinearly interacting system components often introduce instabilities ...
research
10/17/2019

Speeding simulation analysis up with yt and Intel Distribution for Python

As modern scientific simulations grow ever more in size and complexity, ...
research
07/12/2023

CLAIMED – the open source framework for building coarse-grained operators for accelerated discovery in science

In modern data-driven science, reproducibility and reusability are key c...
research
10/15/2021

Robust physics discovery via supervised and unsupervised pattern recognition using the Euler characteristic

Machine learning approaches have been widely used for discovering the un...
research
01/08/2018

Unsupervised Discovery of Toxoplasma gondii Motility Phenotypes

Toxoplasma gondii is a parasitic protozoan that causes dis- seminated to...

Please sign up or login with your details

Forgot password? Click here to reset