From partial data to out-of-sample parameter and observation estimation with Diffusion Maps and Geometric Harmonics

by   Eleni D. Koronaki, et al.

A data-driven framework is presented, that enables the prediction of quantities, either observations or parameters, given sufficient partial data. The framework is illustrated via a computational model of the deposition of Cu in a Chemical Vapor Deposition (CVD) reactor, where the reactor pressure, the deposition temperature and feed mass flow rate are important process parameters that determine the outcome of the process. The sampled observations are high-dimensional vectors containing the outputs of a detailed CFD steady-state model of the process, i.e. the values of velocity, pressure, temperature, and species mass fractions at each point in the discretization. A machine learning workflow is presented, able to predict out-of-sample (a) observations (e.g. mass fraction in the reactor) given process parameters (e.g. inlet temperature); (b) process parameters given observation data; and (c) partial observations (e.g. temperature in the reactor) given other partial observations (e.g. mass fraction in the reactor). The proposed workflow relies on the manifold learning schemes Diffusion Maps and the associated Geometric Harmonics. Diffusion Maps is used for discovering a reduced representation of the available data, and Geometric Harmonics for extending functions defined on the manifold. In our work a special use case of Geometric Harmonics is formulated and implemented, which we call Double Diffusion Maps, to map from the reduced representation back to (partial) observations and process parameters. A comparison of our manifold learning scheme to the traditional Gappy-POD approach is provided: ours can be thought of as a "Gappy DMAP" approach. The presented methodology is easily transferable to application domains beyond reactor engineering.


Landmark Diffusion Maps (L-dMaps): Accelerated manifold learning out-of-sample extension

Diffusion maps are a nonlinear manifold learning technique based on harm...

Grassmannian diffusion maps based surrogate modeling via geometric harmonics

In this paper, a novel surrogate model based on the Grassmannian diffusi...

Manifold Markov chain Monte Carlo methods for Bayesian inference in a wide class of diffusion models

Bayesian inference for partially observed, nonlinear diffusion models is...

Quantification of CO2 generation in sedimentary basins through Carbonate Clays Reactions with uncertain thermodynamic parameters

We develop a methodological framework and mathematical formulation which...

A reduced model for compressible viscous heat-conducting multicomponent flows

In the present paper we propose a reduced temperature non-equilibrium mo...

Block particle filters for state estimation of stochastic reaction-diffusion systems

In this work, we consider a differential description of the evolution of...

Latent common manifold learning with alternating diffusion: analysis and applications

The analysis of data sets arising from multiple sensors has drawn signif...

Please sign up or login with your details

Forgot password? Click here to reset