Self-scalable Tanh (Stan): Faster Convergence and Better Generalization in Physics-informed Neural Networks

04/26/2022
by   Raghav Gnanasambandam, et al.
14

Physics-informed Neural Networks (PINNs) are gaining attention in the engineering and scientific literature for solving a range of differential equations with applications in weather modeling, healthcare, manufacturing, etc. Poor scalability is one of the barriers to utilizing PINNs for many real-world problems. To address this, a Self-scalable tanh (Stan) activation function is proposed for the PINNs. The proposed Stan function is smooth, non-saturating, and has a trainable parameter. During training, it can allow easy flow of gradients to compute the required derivatives and also enable systematic scaling of the input-output mapping. It is shown theoretically that the PINNs with the proposed Stan function have no spurious stationary points when using gradient descent algorithms. The proposed Stan is tested on a number of numerical studies involving general regression problems. It is subsequently used for solving multiple forward problems, which involve second-order derivatives and multiple dimensions, and an inverse problem where the thermal diffusivity of a rod is predicted with heat conduction data. These case studies establish empirically that the Stan activation function can achieve better training and more accurate predictions than the existing activation functions in the literature.

READ FULL TEXT

page 20

page 21

page 23

page 25

research
05/20/2021

Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions

We propose a new type of neural networks, Kronecker neural networks (KNN...
research
09/25/2019

Locally adaptive activation functions with slope recovery term for deep and physics-informed neural networks

We propose two approaches of locally adaptive activation functions namel...
research
03/01/2016

Noisy Activation Functions

Common nonlinear activation functions used in neural networks can cause ...
research
03/09/2023

CoolPINNs: A Physics-informed Neural Network Modeling of Active Cooling in Vascular Systems

Emerging technologies like hypersonic aircraft, space exploration vehicl...
research
01/20/2021

Quadratic Residual Networks: A New Class of Neural Networks for Solving Forward and Inverse Problems in Physics Involving PDEs

We propose quadratic residual networks (QRes) as a new type of parameter...
research
12/17/2022

Physics-informed Neural Networks with Periodic Activation Functions for Solute Transport in Heterogeneous Porous Media

Solute transport in porous media is relevant to a wide range of applicat...
research
08/06/2020

The nlogistic-sigmoid function

The variants of the logistic-sigmoid functions used in artificial neural...

Please sign up or login with your details

Forgot password? Click here to reset