Analyzing the Performance of Deep Encoder-Decoder Networks as Surrogates for a Diffusion Equation

Neural networks (NNs) have proven to be a viable alternative to traditional direct numerical algorithms, with the potential to accelerate computational time by several orders of magnitude. In the present paper we study the use of encoder-decoder convolutional neural network (CNN) as surrogates for steady-state diffusion solvers. The construction of such surrogates requires the selection of an appropriate task, network architecture, training set structure and size, loss function, and training algorithm hyperparameters. It is well known that each of these factors can have a significant impact on the performance of the resultant model. Our approach employs an encoder-decoder CNN architecture, which we posit is particularly well-suited for this task due to its ability to effectively transform data, as opposed to merely compressing it. We systematically evaluate a range of loss functions, hyperparameters, and training set sizes. Our results indicate that increasing the size of the training set has a substantial effect on reducing performance fluctuations and overall error. Additionally, we observe that the performance of the model exhibits a logarithmic dependence on the training set size. Furthermore, we investigate the effect on model performance by using different subsets of data with varying features. Our results highlight the importance of sampling the configurational space in an optimal manner, as this can have a significant impact on the performance of the model and the required training time. In conclusion, our results suggest that training a model with a pre-determined error performance bound is not a viable approach, as it does not guarantee that edge cases with errors larger than the bound do not exist. Furthermore, as most surrogate tasks involve a high dimensional landscape, an ever increasing training set size is, in principle, needed, however it is not a practical solution.

READ FULL TEXT

page 3

page 5

page 8

page 9

page 17

research
01/22/2019

Understanding Geometry of Encoder-Decoder CNNs

Encoder-decoder networks using convolutional neural network (CNN) archit...
research
01/19/2022

Deep Capsule Encoder-Decoder Network for Surrogate Modeling and Uncertainty Quantification

We propose a novel capsule based deep encoder-decoder model for surrogat...
research
11/17/2020

Theory-guided Auto-Encoder for Surrogate Construction and Inverse Modeling

A Theory-guided Auto-Encoder (TgAE) framework is proposed for surrogate ...
research
12/14/2020

DSM Refinement with Deep Encoder-Decoder Networks

3D city models can be generated from aerial images. However, the calcula...
research
03/22/2016

Learning Discriminative Features using Encoder-Decoder type Deep Neural Nets

As machine learning is applied to an increasing variety of complex probl...
research
05/24/2023

On the Generalization of Diffusion Model

The diffusion probabilistic generative models are widely used to generat...
research
08/17/2019

A Sensitivity Analysis of Attention-Gated Convolutional Neural Networks for Sentence Classification

Recently, Attention-Gated Convolutional Neural Networks (AGCNNs) perform...

Please sign up or login with your details

Forgot password? Click here to reset