Reconstruction of three-dimensional porous media using generative adversarial neural networks
To evaluate the variability of multi-phase flow properties of porous media at the pore scale, it is necessary to acquire a number of representative samples of the void-solid structure. While modern x-ray computer tomography has made it possible to extract three-dimensional images of the pore space, assessment of the variability in the inherent material properties is often experimentally not feasible. We present a novel method to reconstruct the solid-void structure of porous media by applying a generative neural network that allows an implicit description of the probability distribution represented by three-dimensional image datasets. We show, by using an adversarial learning approach for neural networks, that this method of unsupervised learning is able to generate representative samples of porous media that honor their statistics. We successfully compare measures of pore morphology, such as the Euler characteristic, two-point statistics and directional single-phase permeability of synthetic realizations with the calculated properties of a bead pack, Berea sandstone, and Ketton limestone. Results show that GANs can be used to reconstruct high-resolution three-dimensional images of porous media at different scales that are representative of the morphology of the images used to train the neural network. The fully convolutional nature of the trained neural network allows the generation of large samples while maintaining computational efficiency. Compared to classical stochastic methods of image reconstruction, the implicit representation of the learned data distribution can be stored and reused to generate multiple realizations of the pore structure very rapidly.READ FULL TEXT VIEW PDF
Reconstruction of three-dimensional porous media using generative adversarial neural networks
The reconstruction and the evaluation of the material properties of porous media plays a key role across many engineering disciplines. Many physical processes such as the movement of multiple phases of fluids through sedimentary rocks are controlled by individual pores at the micron and sub-micron scale Blunt (2017).
In carbon capture and sequestration (CCS), the long term storage behavior is controlled by the physical and chemical interaction of super-critical with the reservoir brine, as well as the spatial distribution and connectivity of minerals in the pore-space Juanes et al. (2006); Kang et al. (2010). The variability of the controlling properties such as the permeability of the host rock is determined by repeated experiments or numerical modeling of these processes.
Using modern computer tomographic methods, it is possible to observe porous materials and evaluate their material properties at the micrometer scale (micro-CT) under static and transient conditions at high pressures and temperatures in near real time. Performing micro-CT imaging of porous media requires specialized, expensive equipment and in the case of CCS, only a single image of the investigated rock type is typically acquired.
To evaluate the variability associated with the geometrical and mineralogical morphology of the pore-space, numerous physical experiments using the same rock type would have to be performed to obtain a distribution over larger volumes. Due to time and cost limitations inherent with the experimental acquisition of high-resolution images, this is often deemed unfeasible. Material properties governing the single and multi-phase flow behavior of porous media can be estimated from numerical solution of partial differential equations at the scale of a representative elementary volume (REV) and verified by experimental resultsMostaghimi et al. (2013).
Many sedimentary rocks consist of granular siliciclastic or carbonate materials. Boolean models use this fundamental characteristic of natural granular materials to emulate the shape of the arising pore space, due to an underlying random process that controls the distribution of the individual grains Matheron (1975); Serra (1980)
. While for the classical Boolean model, the centers of the grains are uniformly distributed in space and grains can arbitrarily overlap, more complicated models with rigid hard sphere grains and more complex grain interaction functions have been developedMatheron (1971); Arns et al. (2009); Rikvold and Stell (1985); Torquato (2013). The framework of Boolean models also allows extension beyond spherical particles and enables derivation of the properties of material models as
In sedimentary rocks, the arrangement of individual grains occurs due to the transport of material from a high energy source to a low energy sink. Process models, where depositional mechanisms are simulated, have been shown to reproduce realistic granular reconstructions capturing the pore space morphology of granular sedimentary rocks Øren and Bakke (2003).
Spatial probabilistic models such as truncated Gaussian processes or sequential indicator simulation have been widely applied in the geosciences to model the spatial distribution of materials Pyrcz and Deutsch (2014). Many of these methods rely on two-point probability functions as a measure of spatial variability, whereas recent methods in geostatistics use training images as a basis for sample reconstruction Caers and Zhang (2004); Mariethoz et al. (2010); Meerschman et al. (2013). These images are usually assumed to exhibit stationarity of the probability distribution of the properties of interest and rely on higher order multiple point statistics (MPS) to reconstruct stochastic random media.
With MPS, the probability distributions are represented by training images and are sampled using a limited multi-scale neighborhood that captures the variation on a large scale, as well as fine structural details on smaller scales Tahmasebi et al. (2014). MPS based methods have been used in two and three-dimensional conditional simulation of spatial properties in reservoir-scale earth modeling applications Comunian et al. (2011). The computational complexity of these methods is highly dependent on individual algorithms as well as the size of the domains used to sample from the training images Mariethoz and Caers (2014). Parallelized versions have been developed, reducing the computational time required to perform reconstruction using multiple point statistics Straubhaar et al. (2011); Huang et al. (2013).
Stochastic methods based on simulated annealing allow the incorporation of arbitrary cost functions of statistical and morphological properties used in unconditional three-dimensional image reconstruction Smith et al. (1983); Svergun (1999). Recent advances have reduced the computational runtime of simulated annealing based methods for reconstruction of porous media, to the order of tens of hours per realization at the scale of voxels Pant (2016).
In the following section, we introduce a recently developed class of unsupervised machine learning methods called generative adversarial networks (GAN) that allow simulation of probability distributions given a set of training dataGoodfellow et al. (2014). Volumetric generative adversarial networks have previously been applied to low-resolution three-dimensional CAD model synthesis, and practical applications of 3D-GANs are few compared to their two-dimensional counterpart Wu et al. (2016). Integration of multi-resolution datasets incorporating image data across a number of length scales is possible in the GAN framework by using a Laplacian pyramid approach such as StackGAN Zhang et al. (2016).
We investigate the applicability of GANs to model three-dimensional textures of rocks based on three-dimensional binary representations of porous media acquired at the micrometer scale. We compare statistical, morphological and transport properties of the simulated images with those of the training images. We evaluate the single-phase directional permeability to show that the synthetic realizations sampled from the learned representation of the input data can capture single-phase flow properties of sedimentary rocks.
Training of these neural networks involves finding a set of hyperparameters that lead to stable trainingGoodfellow (2016)
. While this training can take on the order of tens of hours, the sampling of large volumetric domains occurs on the order of seconds on the current generation of graphical processing units (GPU). We show that in favorable cases convolutional neural networks incorporated in the GAN framework allow the generation of synthetic reconstructions of porous media that exceed the dimensions of their training images. Contrary to most existing simulation techniques the set of parameters used to generate synthetic realizations can be stored once trained allowing rapid generation of new samples to assess the variability of material properties.
While we apply GANs to a set of micro-CT images of porous media, the method can readily be applied to volumetric images of porous media obtained from other three-dimensional microscopy instruments such as nano or medical-CT instruments.
We discuss the challenges involved in training GANs for stochastic image reconstruction of porous media, as compared to other stochastic image reconstruction methods and evaluate the computational efficiency of GAN based image reconstruction. Finally, we provide empirical guidelines on the requirements of the input dataset to allow successful training of GANs on large three-dimensional voxel representations of natural porous media.
All data used in this study is available in the public domain and we have made the code used for training, as well as example pre-trained models, available as additional supporting material 111https://github.com/LukasMosser/PorousMediaGan. A public dataset of high-resolution micro-CT images made available by the Imperial College Pore-Scale Modelling Group 222http://www.imperial.ac.uk/earth-science/research/research-groups/perm/research/pore-scale-modelling/micro-ct-images-and-networks/, of a spherical beadpack, Berea sandstone, and oolitic Ketton limestone will serve as benchmark cases to study the application of GANs to three-dimensional stochastic image reconstruction.
In the following section, we present generative adversarial networks (GAN) in the context of three-dimensional image generation. Generative neural networks have been developed in the context of deep learning by Goodfellow et al. as a methodology to learn a representation of a high-dimensional probability distribution from a given datasetGoodfellow et al. (2014). In the context of image reconstruction, we refer to this dataset as a set of training images that present representative samples of the probability distribution underlying the image space.
GANs learn an implicit representation of the probability density as opposed to explicit density models. The main drawback of explicit density models is their computational cost which grows with the dimensionality of the samples and requires sequential simulation of each voxel. For high-dimensional samples such as volumetric image data, the computational cost is where represents the number of voxels in the domain of interest and can easily exceed voxels for modern high-resolution micro-CT image data. Using any of these methods would make it intractable to generate a large number of very large samples. GANs have been designed to perform fast sampling from the learned density representation and allow full parallel generation, making them an ideal candidate to generate large volumetric images Goodfellow (2016).
GANs consist of two differentiable functions: a discriminator and a generator . The discriminator receives samples of the ”real” dataset (Label 1) and ”fake” samples (Label 0) created by the generator from the hidden latent space (see Fig. 1 above). The latent space
is composed of independent real random variables, typically normally or uniformly distributed, that represent the random input to the generator. The generator maps random variables from the latent space into the space of images. The discriminator’s role is to assign a probability that a random sample is from the ”real” data distribution . The discriminator tries to label each sample correctly, while the generator tries to ”fool” the discriminator into labeling the fake images as part of the true data distribution and therefore achieving close to one.
More formally we can define the loss i.e. the cost function for GANs as a minimization-maximization problem
Solutions to this optimization problem have been shown to be Nash equilibria, where each player achieves a local minimum of their loss function with respect to their parametersGoodfellow (2016).
In practice we represent and by convolutional neural networks that are trained by a gradient descent based optimization method. Training is performed in two steps: First the discriminator is trained to maximize
while the parameters of the generator are fixed. This improves the ability of the discriminator to distinguish between real and fake images.
In a subsequent step we generate synthetic samples by drawing samples from an N-dimensional normal distributed latent space and train the generator to minimize
while keeping the discriminator fixed.
By minimizing Eq. (3) the generator tries to ”fool” the discriminator into believing that the samples are real data samples. In this way the generator learns to represent a distribution that is as close as possible to the real data distribution . When convergence is reached and the value of the discriminator becomes as it cannot distinguish between the two anymore.
Initially, the discriminator outperforms the generator significantly making the gradient used to train the generator close to zero. Therefore, instead of minimizing for the generator, it is helpful to maximize Goodfellow (2016).
GANs show highly unstable behavior during training and a large number of trial and error runs are required to find an optimal set of hyperparameters that allow stable training. A number of heuristics have been published which have been shown to stabilize GAN training, such as one-sided label smoothing and adding white noise to the input layer of the discriminatorSalimans et al. (2016); Kaae Sønderby et al. (2016).
In the following section we outline the criteria used to evaluate the quality of simulations based on the training image datasets. We treat all images under the assumption of stationarity and the existence of a representative elementary volume.
We characterize the second order structure of the porous media by calculating the two-point probability function of the pore phase. By assuming stationarity, this function is equivalent to the non-centered covariance Matheron (1971):
which is the probability that two points and , separated by the lag vector , are located in the pore phase . At the origin, is equal to the porosity . stabilizes around as (Fig. 2). Due to the anisotropic nature of many porous media, we compute along the three Cartesian directions, as well as the radial average of .
It is a well known result that the specific surface area of a porous medium can be expressed as a function of Debye et al. (1957). In the case of an isotropic porous medium and in three-dimensions is related to by
where is the derivative of at the origin.
Furthermore, the average chord length within the pore and the grain phases are Torquato (2013)
which for the pore phase can be readily found from the intersection of the slope of with the x-axis.
In favorable cases, it is possible to find analytical expressions of from the spatial distribution and geometry of the grains. A Boolean model of overlapping spherical grains of uniform spatial distribution exhibits an exponential decay of the covariance until the lag distance is equal to the diameter of the grains where it becomes zero Matheron (1971). For porous media that can be well described by a Boolean model, we can estimate the size of the elementary Boolean grain from the decay of .
Semi-analytical expressions for more complex models such as for a packing of hard spheres have been developed Torquato and Lado (1985). Models of for spherical packings exhibit a dampened oscillation. The shape of the estimated covariance, therefore, allows us to obtain information on the structure of the porous medium (see Fig. 2 above).
The covariance was estimated for the training images and the stochastic reconstructions generated by the trained GAN model. For each GAN model, we evaluate the non-centered covariance as well as the specific surface area [Eq. (5)] and compare these to the values obtained from the original training images.
In our discussion on the required training image sizes (Sec. VI), we will use the average chord length and the specific surface area as possible indicators of the necessary training image size.
It has been shown that flow properties at the pore-scale can be related to morphological characteristics of the void-solid interface of a porous medium Scholz et al. (2015). Hadwiger’s theorem states that the size of a body in a -dimensional space can be described by a linear combination of independent parameters characterizing the body. In three dimensions we can, therefore, define four so-called Minkowski functionals that fully characterize the size of a three-dimensional object. We compute estimates of three Minkowski functionals; the porosity , the specific surface area and the Euler characteristic corresponding to the zero, first and 3rd order functionals. We compute the densities of the Minkowski functionals by dividing by the volume .
The Minkowski functional of order zero is the porosity, defined as the ratio of volume of the void space to the bulk volume of the sample
and is, therefore, a measure of the ability of a porous medium to store fluids.
The Minkowski functional of rank one is the specific surface area .
where integration occurs over the void-solid interface S. The specific surface area has dimensions of and its inverse allows us to define a characteristic pore size.
The specific Euler characteristic is closely related to the order three Minkowski functional and represents a dimensionless quantity defined as
where and are the principal radii of curvature of the void-solid interface. To compute we do not directly evaluate the integral in Eq. (9) but instead make use of a relationship for the Euler characteristic of arbitrary polyhedra,
where is the number of vertices, the number of edges, the number of faces and the number of objects Blunt (2017). This expression is the basis for efficient algorithms to compute Minkowski functionals of arbitrary geometric bodies represented as volumetric voxelized domains Lang et al. (2001). To compute these three Minkowski functionals we have used the open-source image morphological software library MorphoLibJ Legland et al. (2016).
While the porosity expresses the ability to store fluids in a porous medium, adsorption and dissolution processes are controlled by the specific surface area. The Euler characteristic allows the connectivity of the porous medium to be characterized, which is a critical component in the ability of fluids to flow. Reconstructions of porous media should therefore closely match the observed Minkowski functionals to represent the behavior of relevant physical processes at the pore-scale.
The direct computation of the specific surface area and porosity from images allows us to perform a comparison with the values obtained from estimates obtained by computing the empirical non-centered covariance [see Eq. (5)].
|Training Image Dataset|
|Training Image Size||voxels||voxels||voxels|
|Latent Space Dimension||100||512||100|
|Optimizer||Generator + Discriminator: Adam|
|Learning Rate / Momentum||/ 0.5||/ 0.5||/ 0.5|
|Stabilization||White Noise ()||Label Smoothing ()||White Noise ()|
To evaluate the single-phase permeability of the porous media and their generated synthetic reconstructions we solve the Stokes equations for slow, incompressible flow assuming small inertial forces.
The Stokes equations are solved on the domain that is connected to the fluid inlet and outlet. This allows us to define an effective porosity where only the fraction of the pore space that also contributes to flow is considered
The neural network architecture used for the three-dimensional image reconstruction corresponds to a volumetric version of the DCGAN network Radford et al. (2015). The network consists of two independent fully convolutional neural networks, the generator and the discriminator . Upsampling from the input latent vector2015); Nair and Hinton (2010).
The discriminator receives images sampled from the latent space by the generator and images from the set of training images representing . Therefore, the size of the input layer of the discriminator corresponds to the dimensions of the input training images. The discriminator consists of volumetric convolution layers combined with LeakyReLU activation functions Maas et al. (2013). The final convolutional layer of the discriminator is followed by a Tanh activation function.
This combination of generator and discriminator neural network architectures has previously been applied to subsets of the Imagenet and CIFAR-10 datasetsRadford et al. (2015). The hyperparameters for the generator to be used in the optimization of the neural network architecture are the number of trainable convolutional filters in each layer of the neural network , and the size of the latent vector .
The generator and discriminator are optimized using a gradient descent based method where the parameters are changed by taking steps in the gradient
where is the learning rate. We have used the gradient descent based optimiser ADAM for optimization of both the generator and discriminator Kingma and Ba (2014).
GANs have been shown to exhibit unstable behavior during training. The addition of Gaussian noise to the input of the discriminator provides an effective measure to prevent mode collapse and stabilize the training process Kaae Sønderby et al. (2016). An additional stabilization measure called one-sided label smoothing, wherein the class label of 1 for real images is replaced by a new value of has been empirically shown to improve training of GANs Salimans et al. (2016).
Both label smoothing and white noise addition to the input of the discriminator have been used in this study to stabilize the training based on the volumetric image datasets. Table 1 gives an overview of the neural network hyperparameters used for each evaluated sample, the hyperparameters and the stabilization measure used during training.
Images generated by the GAN were post-processed using a median filter to remove single-pixel noise. The resulting images are grayscale images with all voxel values close to zero or one. To compare the resulting images to the binary training images, we segment the generated images using Otsu’s method Otsu (1975).
To evaluate the applicability of GANs for reconstruction of natural porous media we use three previously acquired datasets. All images have been segmented into a three-dimensional binary voxel representation of the pore space (white) and grain structure (black) (Fig. 3). We create a training database of images by extracting sub-volumes from the voxelized binary images. Ideally, these training images should represent independent domains, but due to the limited size of these images, we extract subsets that overlap.
Training image sizes were chosen based on an estimate of the average grain size for each sample. To be able to match the covariance [Eq. (4)] and image morphological characteristics, training images larger than the structuring element were necessary. We discuss this requirement in more detail in the discussion of our results (see Section VI). Due to computational limitations, training image sizes exceeding voxels were not considered.
The beadpack is based on a real packing of equally sized ceramic grains in a disordered close packing Finney (1970). The image consists of voxels with a size of 3 . The size of an individual sphere is 50 voxels. 1727 training images were extracted of size voxels corresponding to a spacing of 32 voxels between them in the original image.
Berea sandstone is a fluvial sandstone of medium to fine grain size (Wentworth classification) Pepper (1954). The individual grains are bonded by clays. The sample analyzed in this study was acquired from an outcropping of the Berea sandstone in a quarry near Berea, Ohio. De Witt showed that the Berea sandstone was deposited in the early Carboniferous (354-323 Mya) de Witt Jr (1970).
The image of Berea sandstone consists of angular grains with no clay presence in the intergranular pore-space. The image has dimensions of voxels with a voxel size of 3 .
To capture the local interaction of grains we have extracted training images at voxels which allows a number of grains to be present in one training image (see Sec. VI). Due to the small image size of voxels, subvolumes were extracted at a spacing of 16 voxels. In all, 10647 training images were used for the image reconstruction.
The Ketton sample is an oolitic limestone of Jurassic age (201.3-145 Mya). The sample was acquired from a quarry of Lincolnshire limestone in the North-East of England. The oolites contained in the Lincolnshire formation are mainly non-ferroan calcite grains. The oolitic limestones of the Lincolnshire show a wide variety of cementation, ranging from uncemented oolite sands with no intergranular cement to heavily ferroan spar-cemented oolites with infilled microporosity Emery (1988). Microstructures in the pore space can be observed that lead to a reduction in porosity (Fig. 3).
The Ketton sample chosen for this study consists of large grains compared to the overall image size. The image used for the following evaluation has been downsampled from a voxel representation to an image size of voxels. This allows more grains to be resolved per training image extracted from the full volume. The downsampled voxel size is 15.2 .
Training images were extracted at a sub-volume size of with a spacing of 8 voxels leading to a total of 15624 training images. The small spacing of the training images results from the small CT image size of voxels.
|Training Image||Synthetic||Training Image||Synthetic||Training Image||Synthetic|
Three GANs were trained based on the network architectures highlighted in Sec. III.2. The training time for each dataset was 24 hours. Manual inspection of synthetic realizations was performed during training to ensure convergence and intermediate evaluation of the covariance and Minkowski functionals.
Figure 4 shows the training curve for the Berea sandstone dataset. Initially the generator loss function [see Eq. (3)] is very high and no structural components can be observed in the samples. After a large reduction in the loss function of the generator, initial structures are observed. Image reconstruction quality significantly improves with the number of generator iterations, but cannot be linked to the loss function of the generator. This can be observed from the increase in generator loss at the end of training while image quality improves significantly.
The final GAN models were subsequently evaluated in terms of their directional and radial averaged non-centered covariance , Minkowski functionals and the single-phase permeability.
For all datasets, 20 realizations were generated using the trained GAN model. In the following section, we present the results of the evaluation of the properties outlined in Sec. III.1 and compare these to the properties of the original input training image.
The evaluation of the non-centered covariance for the beadpack (Fig. 6) shows a strong hole effect reflecting the spherical nature of the grains.
A GAN model was trained for 24 hours on the beadpack training image dataset. The GAN model achieves a small error in the porosity of the generated images with a tendency towards higher porosities (Fig. 8).
A bias can be observed for the specific surface area and the Euler characteristic of the microstructure (Table 2).
This bias can be explained by the deviation of the grains from a perfect spherical shape in the synthetic realizations. Due to the smooth nature of the spherical particles in the training image, any deviation from this geometry will lead to an increase in the surface area. This is reflected by a higher specific surface area for the synthetic realizations. In addition we observe a reduction in connectivity, represented by a less negative Euler characteristic.
The directional covariance measured on the generated samples show excellent agreement up to the training image size of voxels and stabilizes at (see Fig. 8). As expected no directional variation of the covariance is observed and the sample is therefore assumed to be isotropic.
Single-phase permeability shows a close agreement in both magnitude and variance between the measured training image and the synthetic realizations (Fig.8). Figure 6 shows a crossplot of the effective porosity i.e. the porosity open to flow [Eq. (12)], and the single-phase permeability exhibiting a similar trend in the distribution of values computed on training images and synthetic realizations.
We provide a comparison of all twenty realizations generated by the GAN model in cross-sections through the x-y plane of the original model and a synthetic realization in Fig. 9.
Many of the grains show a circular to ellipsoidal shape, which considering the fact that a priori the GAN model does not have any knowledge of the geometry of the grains, learning a representation of a perfect sphere can be considered challenging (see Sec. VI). The complex grain-grain interface where individual beads contact at single points can be observed for numerous grain arrangements in the generated realizations.
The radial averaged covariance in Fig. 11, shows a near exponential decay and stabilisation occurs at a lag distance of 30 voxels for both covariance functions obtained from the Berea training image and synthetic realizations generated by the GAN model.
Additionally, Fig. 13 shows that the directional two-point statistics characterized by the directional covariances is captured in the generated images. This is shown by comparing the small hole effect observed in the z-direction of the Berea sample with the x-direction where a near exponential decay can be observed. In both cases, the GAN model shows excellent agreement and closely follows the trend of the empirical estimates of .
The results of the direct computation of the Minkowski functionals is presented in Fig. 13 and show comparable distributions for the porosity , specific surface area and the Euler characteristic of the training images and the synthetic realizations.
A comparison of the specific surface area obtained from the covariance and the direct computation of the Minkowski functional, show nearly equal values (Table 2).
The obtained estimates of the single-phase permeability show a similar distribution covering the range of effective permeability measured on the training images. Figure 11 shows the computed values of permeability and the corresponding effective porosity. The permeability of the synthetic realizations capture the values, variability and trend obtained from the Berea training image dataset.
Figure 14 shows a comparison of twenty realizations of the GAN model trained on the Berea dataset. A smaller training image size of voxels was used, as compared to the beadpack ( voxels). This is due to the smaller size of the structuring elements observed in the training image. A smaller training image size was therefore sufficient to capture the long and short range correlation found in the Berea sample.
The covariance of the Ketton limestone shown in Fig. 16, shows a pronounced hole effect due to the ellipsoidal oolitic grains. Due to the hole effect observed in the radial averaged covariance (Fig. 16), we relate the Ketton sample to a hard-sphere model. Figure 18 indicates that the images generated by the GAN model trained on the Ketton image, capture the oscillatory and anisotropic behavior of the covariance observed in Ketton. The specific surface area derived from the generated images is in close agreement with the training data. An error of approximately was achieved in the porosity of the GAN generated images compared to the original Ketton dataset (Fig. 18).
The measured specific surface area of the synthetic images shows a higher variance compared to the original training images. Nevertheless, the average values of the porosity and specific surface area derived from the non-centered covariance [see Eq. (5)] are in good agreement with values obtained from direct image morphological estimation (see Table 2).
The distribution of single-phase permeability estimates of the synthetic GAN realizations overlies the permeability values of the Ketton training images.
The Euler characteristic and the permeability of the Ketton training dataset are closely matched by the synthetic images and therefore capture the connectivity observed in the oolitic Ketton limestone.
We present an overview of the 20 realizations generated by the GAN model trained on the Ketton dataset in Fig. 19.
This paper presents a novel method for three-dimensional stochastic image reconstruction based on generative adversarial neural networks (GAN) trained on three-dimensional segmented images. To summarize, the objectives of this contribution are threefold. Firstly, the generation of stochastic reconstructions of porous media such as sedimentary rocks exceeding the size of the acquired image datasets. Secondly, to evaluate the ability of GAN models to capture the image morphological and physical properties of micro-scale porous media. Thirdly, to establish a method of stochastic image reconstruction that allows a probabilistic treatment of pore-scale properties such as permeability without the need to acquire numerous images of a single rock type.
The first objective stems from technical limitations of micro-CT data acquisition. Images are acquired as a trade-off between sample size i.e. how many representative structures can be captured in one image versus the resolution at which these pore-scale structures are resolved. The generation of large porous domains based on high-resolution images enables this gap in scales to be bridged and micro-scale features to be incorporated in macro-scale models.
Our findings show that GANs can learn an implicit representation of the image space given a limited number of training images subsampled from larger images. These subdomains were extracted based on characteristic length scales (see Sec. III.1.1) and serve as a training set for the GAN model. For the Ketton limestone, a small spacing of the extracted subdomains was required to increase the size of the training image dataset. While we did not find any evidence of an introduced bias by using correlated subdomains, we believe that these extracted training images should represent independent regions.
We have evaluated the ability to train GANs for a number of training image sizes less than and up to twice the size of the structuring elements. We have found that models trained on images smaller than the average grain size results in artifacts and distorted shapes occurring in the generated micro-structures. For the beadpack, the size of an individual sphere is 50 voxels. A training image of voxels would typically only contain parts of an individual grain and only capture the interaction of the particles, but not the geometry of the structuring element. For the beadpack, models trained on voxels were successful in learning a representation of the short scale micro-structure but failed to reproduce the long distance correlation. A larger training image of voxels, as was used to model the beadpack has a much higher chance to represent the full geometry of the particles and therefore not only learn interactions, but also the shapes of grains.
We, therefore, suggest that training images extracted from large datasets must be larger than the average grain size. For models that are well described by a Boolean model, the size of the structuring element can be readily estimated from stabilization of the covariance . For more complex samples a different measure must be used to estimate the size of the required training image.
The chord length is one additional measure that can be obtained to characterize the grain space of porous media. While we have found that the mean chord length of the grain space is always less than or equal to our chosen training image size, increases with decreasing porosity. This contradicts the need to have the largest training domain for the beadpack sample which also has the highest porosity. A better estimate may be related to the representative elementary volume of the specific surface area which by definition is the same for the grain and pore space and is, therefore, more representative of the morphology of the porous medium Bear (2013). Based on the properties we have evaluated we could not find a measure derived from two-point statistical or image morphological properties that is closely related to the required training image size and we see a theoretical discussion of this as possible future work.
Conceptually the simplest model considered in this study, the spherical beadpack, has proven to be the most challenging as a training image for the GAN model (Sec. V.1). While we observe spherical and ellipsoidal shapes in the resulting realizations (see Fig. 9), the shape is exactly defined by the spherical nature of the grains. Any deviation from this shape, which for GANs, is learned implicitly from the data itself, will lead to a misrepresentation of the effective properties. Random hard-sphere models with spherical grains will efficiently capture the nature of this dataset. Therefore we suggest a fit-for-purpose application of GANs, for training images that exhibit variability of grain sizes and shapes, which are not readily captured by a simpler model.
While for many sedimentary granular rocks representative volumetric images can be obtained, this may be more challenging for carbonate samples with complex pore-grain structures. The three training images considered in this study were all treated under the assumption of stationarity i.e. we do not expect a variation in the mean and variance of the averaged properties as a function of location. In theory, GANs are not limited to learning representations of stationary datasets. This is shown by the many successful applications for two-dimensional image and texture synthesis of non-stationary domains, such as learned image representations of human faces Gauthier (2014) or galaxies Schawinski et al. (2017); Ravanbakhsh et al. (2016). Therefore a model that incorporates non-stationarity for a single rock-type would technically be possible in the GAN framework but would require the acquisition of many images of the same porous medium.
A valid representation of the microscale variability and connectivity of the pore space is critical to assess the single and multi-phase flow behavior of porous media. Therefore any stochastic reconstruction method used in the process of deriving or evaluating the variability of micro-scale properties must capture the statistical and image morphological characteristics of the reconstructed porous medium. While we have shown that for the evaluated datasets, the GAN based image reconstructions capture the variation and characteristics of these porous media, a number of challenges arise in this task that are fundamentally different to classical stochastic methods of image reconstruction.
For porous media, many flow related properties can be related to the porosity. Classical stochastic methods are able to capture the porosity efficiently by defining a specific proportion of the grain and pore domain. The GAN based model presented in this study initially has no knowledge of the porosity. The porosity, therefore, arises as a feature of the training image data. Matching the porosity distribution of the training image dataset was found to be the main challenge in training a GAN model. An error of three percent in porosity would lead to a mismatch in the permeability of the synthetic images. It is, therefore, necessary to continuously monitor the derived properties such as the Minkowski functionals or estimates of the permeability, in the course of training the neural networks to ensure that synthetic realizations created by the GAN model are able to capture the effective properties of the micro-scale domains.
While this can be considered one of the main challenges in the application of GANs for synthetic image reconstruction, learning an implicit representation of the training data itself can be seen as a strength. Many classic stochastic methods rely on the formulation of an objective function that ensures that statistical properties are captured in the generated realizations e.g. matching and the specific surface area of the stochastic reconstructions to a desired precision. The GAN approach does not require an explicit objective function a priori. The objective function is encoded in the discriminator and adapted in the course of training.
During adversarial training both the generator and discriminator are continuously improved. The discriminator’s sole purpose is to be able to distinguish real training data from generated synthetic data. On the other hand, the generator tries to generate synthetic data that the discriminator is not able to distinguish from the training data. Due to the multi-scale representation of the convolutional neural networks, these features must be learned across the full range of length scales present in the training data, leading to a high-resolution image that captures small and large scale features of the image dataset. A number of stacked GAN models can be trained on e.g. low-resolution medical-CT data and high-resolution micro-CT allowing incorporation of spatial information across multiple length scales Zhang et al. (2016).
Once the GAN model has successfully learned to create physically representative samples of the porous medium, one possible application is to evaluate the variability in the flow properties by evaluating the properties of a large number of samples. This not only requires a physically valid representation of the porous medium but also requires a method that allows fast image reconstruction. In Sec. V we have shown that training was performed for approximately 24 hours and may vary due to the need for manual inspection of the generated samples in the training process. Figure 20 shows the CPU time required for generation of images at increasing image size. The fully convolutional nature of the GAN architecture allows very large images, exceeding the size of the original sample to be generated very efficiently and at low computational cost and runtime.
While training requires considerable time and computational resources in the form of modern graphics processors as well as optimized neural network frameworks, image reconstruction requires little computational effort and scales linearly in the total number of voxels of the generated images. This, therefore, enables the generation of ensembles of large domains based on volumetric images acquired from 3D microscopy, that capture the physical behavior of the porous medium. The learned representation of the generator consists of the weights of the convolutional filters learned in the training process and can, therefore, be stored for future use once training has finished.
We have evaluated the application of generative adversarial neural networks (GAN) for stochastic image reconstruction of porous media based on previously acquired images of sedimentary rocks. Three image datasets were used as training images: a beadpack, a Berea sandstone, and an oolitic Ketton limestone.
By evaluating two-point statistical measures, image morphological features and computing the single-phase effective permeability we have shown that the synthetic images generated by the GAN model are able to capture the characteristic statistical and physical behavior of these porous media.
While a large computational effort is required to train the GAN model, the generation of samples from the learned representation is highly efficient and learned models are easily stored for future use.
Future work in the application of GANs to stochastic image reconstruction of porous media will include improving the quality of the image reconstruction by evaluating various generator-discriminator architectures, the use of grayscale and multi-channel training images, as well as the application of large multi-scale domains of porous media to evaluate the ensemble behavior of single and multi-phase flow properties in porous media. Recent advances in the understanding of GANs should lead to a more stable and consistent training process Mao et al. (2016); Arjovsky et al. (2017).