Multi-Grade Deep Learning for Partial Differential Equations with Applications to the Burgers Equation

09/14/2023
by   Yuesheng Xu, et al.
0

We develop in this paper a multi-grade deep learning method for solving nonlinear partial differential equations (PDEs). Deep neural networks (DNNs) have received super performance in solving PDEs in addition to their outstanding success in areas such as natural language processing, computer vision, and robotics. However, training a very deep network is often a challenging task. As the number of layers of a DNN increases, solving a large-scale non-convex optimization problem that results in the DNN solution of PDEs becomes more and more difficult, which may lead to a decrease rather than an increase in predictive accuracy. To overcome this challenge, we propose a two-stage multi-grade deep learning (TS-MGDL) method that breaks down the task of learning a DNN into several neural networks stacked on top of each other in a staircase-like manner. This approach allows us to mitigate the complexity of solving the non-convex optimization problem with large number of parameters and learn residual components left over from previous grades efficiently. We prove that each grade/stage of the proposed TS-MGDL method can reduce the value of the loss function and further validate this fact through numerical experiments. Although the proposed method is applicable to general PDEs, implementation in this paper focuses only on the 1D, 2D, and 3D viscous Burgers equations. Experimental results show that the proposed two-stage multi-grade deep learning method enables efficient learning of solutions of the equations and outperforms existing single-grade deep learning methods in predictive accuracy. Specifically, the predictive errors of the single-grade deep learning are larger than those of the TS-MGDL method in 26-60, 4-31 and 3-12 times, for the 1D, 2D, and 3D equations, respectively.

READ FULL TEXT

page 17

page 21

research
09/24/2019

D3M: A deep domain decomposition method for partial differential equations

A state-of-the-art deep domain decomposition method (D3M) based on the v...
research
12/24/2022

JDNN: Jacobi Deep Neural Network for Solving Telegraph Equation

In this article, a new deep learning architecture, named JDNN, has been ...
research
02/19/2020

Implicit bias with Ritz-Galerkin method in understanding deep learning for solving PDEs

This paper aims at studying the difference between Ritz-Galerkin (R-G) m...
research
05/13/2023

Successive Affine Learning for Deep Neural Networks

This paper introduces a successive affine learning (SAL) model for const...
research
05/16/2023

A Note on Dimensionality Reduction in Deep Neural Networks using Empirical Interpolation Method

Empirical interpolation method (EIM) is a well-known technique to effici...
research
07/26/2020

Train Like a (Var)Pro: Efficient Training of Neural Networks with Variable Projection

Deep neural networks (DNNs) have achieved state-of-the-art performance a...
research
02/01/2023

Multi-Grade Deep Learning

The current deep learning model is of a single-grade, that is, it learns...

Please sign up or login with your details

Forgot password? Click here to reset