Improved architectures and training algorithms for deep operator networks

10/04/2021
by   Sifan Wang, et al.
0

Operator learning techniques have recently emerged as a powerful tool for learning maps between infinite-dimensional Banach spaces. Trained under appropriate constraints, they can also be effective in learning the solution operator of partial differential equations (PDEs) in an entirely self-supervised manner. In this work we analyze the training dynamics of deep operator networks (DeepONets) through the lens of Neural Tangent Kernel (NTK) theory, and reveal a bias that favors the approximation of functions with larger magnitudes. To correct this bias we propose to adaptively re-weight the importance of each training example, and demonstrate how this procedure can effectively balance the magnitude of back-propagated gradients during training via gradient descent. We also propose a novel network architecture that is more resilient to vanishing gradient pathologies. Taken together, our developments provide new insights into the training of DeepONets and consistently improve their predictive accuracy by a factor of 10-50x, demonstrated in the challenging setting of learning PDE solution operators in the absence of paired input-output observations. All code and data accompanying this manuscript are publicly available at <https://github.com/PredictiveIntelligenceLab/ImprovedDeepONets.>

READ FULL TEXT

page 15

page 21

page 22

page 33

page 36

page 38

page 39

page 40

research
03/19/2021

Learning the solution operator of parametric partial differential equations with physics-informed DeepOnets

Deep operator networks (DeepONets) are receiving increased attention tha...
research
05/18/2023

PPDONet: Deep Operator Networks for Fast Prediction of Steady-State Solutions in Disk-Planet Systems

We develop a tool, which we name Protoplanetary Disk Operator Network (P...
research
11/09/2021

A research framework for writing differentiable PDE discretizations in JAX

Differentiable simulators are an emerging concept with applications in s...
research
01/13/2020

Understanding and mitigating gradient pathologies in physics-informed neural networks

The widespread use of neural networks across different scientific domain...
research
07/28/2020

When and why PINNs fail to train: A neural tangent kernel perspective

Physics-informed neural networks (PINNs) have lately received great atte...
research
02/01/2023

Learning Functional Transduction

Research in Machine Learning has polarized into two general regression a...
research
09/26/2022

Variationally Mimetic Operator Networks

Operator networks have emerged as promising deep learning tools for appr...

Please sign up or login with your details

Forgot password? Click here to reset