I Introduction
Technological advances, such as highbrightness Xray photon sources [1, 2, 3] and highrate highsensitivity detectors [4, 5], enable new discoveries by means of Xray Photon Correlation Spectroscopy (XPCS) [6]. Nonequilibrium dynamics are signature of many systems studied with XPCS, including superconductors [7], metallic glasses [8], proteins [9] and polymeric nanocomposites [10]. Such dynamics are commonly represented via twotime intensityintensity correlation functions (s) [11], defined by the expression:
(1) 
where
is the intensity of a detector pixel corresponding to the wave vector
at time . The average is taken over pixels with equivalent values. The quantitative analysis of a nonequilibrium (Fig. 1(A)) often starts with its binning along the sample’s age axis into quasiequilibrium regions [12, 13] prior to making cuts along the time delay axis and averaging them, i.e. obtaining ‘aged’ onetime intensityintensity correlation functions (s)[14] of delay time:(2) 
The reason for binning and averaging is increasing the accuracy of further analysis. The resulting s are fit to a functional form that is characteristic for the dynamics under investigation.
The traditional analysis of experimental data with nonequilibrium dynamics presents several challenges. Selection of the quasiequilibrium regions is often a nontrivial and timeconsuming task, which requires interventions from a human researcher for a visual inspection of the calculated and intermediate results. Moreover, the temporal resolution of the evolution of dynamics’ parameters deteriorates due the binning procedure. Besides, with increasing data collection rates, it often becomes unfeasible to properly do onthefly data analysis during XPCS experiments when such analysis requires inputs from a researcher. This can lead to inefficient use of beam facilities and the efforts of scientific stuff as well as to unfortunate omission of important parameter space regions during measurements. The sheer volume of collected data can reach hundreds of thousands of s a day, making even the offline data processing a challenging task. Automating the analytical routine is an inevitable step towards highthroughput autonomous XPCS experiments. Ability to perform the analysis with experimental framerate resolution is the ultimate target.
Since the motivation behind binning and averaging the data is to curtail the impact of noise, preprocessing a with a noisereducing algorithm would increase the chance of achieving a framerate resolution along during quantitative analysis. Previously, we demonstrated an approach for noise reduction in s with equilibrium dynamics, which is based on Convolutional Neutral Network–EncoderDecoder model (CNN–ED) [15]. The method is shown to work well for equilibrium data and has a promising potential for use with slowly ageing dynamics when applied as a sliding window along . However, there are two major concerns regarding the use of the model for general cases of nonequilibrium dynamics. Firstly, if the rate of the dynamics changes fast, an equilibrium approximation may not be valid, even in narrow age regions. Thus, the application of the model would lead to loss of experimental resolution and extraction of less accurate results. Secondly, for a containing a much larger number of frames than the input size of the model, a considerable portion of it would still contain the original level of noise after the model is applied along .
Here, we build upon the previous model to address the reduction of noise in arbitrarysized s with nonequilibrium dynamics such as ageing. We further demonstrate how the new model, integrated into a workflow for quantitative analysis of XPCS data, eliminates the need of age binning and allows wider parameter bounds for the fit of
s. The results of such analysis have temporal resolution, which is limited only by the experimental acquisition rates. The methods for estimating the creditability of the results – uncertainty scoring for denoised
s and trust regions for dynamics’ parameters – are discussed. Various options for analysis workflows are demonstrated for several XPCS experiments.Ii Methods
ii.1 Data
For a denoising model to perform well for nonequilibrium s, data from experiments with these types of dynamics are used for training. The data are collected from 65 XPCS experiments at the CHX beamline of NSLSII from three different samples, recorded at different conditions and with different detector collection rates. Some of the measurements are repeated multiple times at the same conditions. The samples exhibit dynamics, common for many materials, that at various rates monotonically accelerate with age, monotonically decelerate with age or stay at quasiequilibrium. For all considered dynamics, individual s can be approximated with the KohlrauschWilliamsWatts (KWW) form:
(3) 
where is the rate of the dynamics, is the contrast factor, is the compression constant and is the baseline. All parameters are functions of and . Several regions of interest in the reciprocal space are considered for each experiment when calculating s. There is a total of 492 s, ranging from 134 to 2950 frames. The original fullsized s were split between training (454) and validation (38) sets prior to generating the inputs to the model and augmenting the data as described below.
Inputs are obtained from raw experimental s by cropping the 100100 frames pieces along the in a similar way as was done previously [15]. Prior to cropping, the diagonal values, containing high errors, are replaced by the average of their nearest offdiagonal neighbors. That is, the value at [j, j] is replaced by the mean of the values at [j1, j] and [j+1, j]. This approach preserves the information in the experiments with slowly varying contrast. For data augmentation, additional inputs are constructed by considering every 2^{nd}, 3^{rd}, 4^{th} and 5^{th} frame of the original data when enough frames are available to form at least one 100
100 example. The final size of the training and validation sets are 25222 and 1557 cropped examples respectively. Each input is scaled to have zero mean and unit variance prior to passage through the model. A reverse transformation is performed for the model output.
ii.2 Denoising Autoecoder Model
The CNNED model architecture, used for equilibrium data, demonstrated several advantages, such as simplicity, control of overfitting as well as fast training and application. Trying the same model architecture with certain adjustments for nonequilibrium data is a natural choice. The model presented here consists of an encoder with two 10channel convolutional layers, 8dimensional latent vector and the decoder with two 10channel transposeconvolutional layers. Two modifications are implemented during the model training to meet the peculiarities of nonequilibrium dynamics. Firstly, since a ‘noisefree’ version of an input cannot be obtained by averaging multiple cropped inputs collected from the same fullsized , the model is trained in an autoencoder (AE) mode with an input and its target being the same. Secondly, a nonequilibrium cannot be described by a single
and thus only the mean square loss between the output and the target are chosen as the training loss function.
Despite the increased complexity of dynamics in the training data, the architecture still appears appropriate for the denoising task based on the model’s good performance for the validation set [see Appendix D, E]. The effectiveness of models with convolutional kernel sizes from 1 to 17 is tested and only minor improvements in the smoothness of the output and the level of finite details preserved for larger kernels are discovered. The models with larger kernels take longer to train and the computational time during their application also becomes significant. Further in the text, unless stated otherwise, a model with the kernel size 1 is considered. The fact that the same model architecture works well for both equilibrium and nonequilibrium ageing s confirms it robustness for different types of dynamics.
ii.3 Adapting Model to Arbitrary Input Size
Since the model has a fully connected layer (the latent vector), it can only be applied to a fixedsize input, i.e. 100100 frames. However, the dimensions of an experimental can vary considerably. The symmetry of a prevents the model to be applied as a raster scan over the entire image. Application of the model in a sliding window fashion along the age axis can help with the noise reduction in certain cases like in out original quasiequilibrium analysis [15]. Nevertheless, this approach is not suitable for general cases of s, where dynamics may happen outside of the first 100 delay frames along the axis.
Figure 2 shows a possible solution, which is inspired by the procedure of data augmentation when the original is mapped down (downmapped) to a smaller size by considering every Nth frame horizontally and vertically, where N = 2, 3, 4, 5. For model application, N is a whole part of division of total number of frames in by 100. This way, the size of a newly constructed matches the required model’s input size. By its definition (Eq. 1), a is symmetric with respect to . However, when the starting point for downmapping is away from the axis (0) of , the reduced version is not symmetrical. To ensure the symmetry of the input, only the bottom part of the is considered and a reflection across the =0 line is applied to it. For all starting points, the values at =0 are extrapolated by the neighboring points in the same way it is done for model’s training. After application of the model, its output is mapped back (upmapped) to the size of the by placing [i, j]th point from the to [N*i, N*j]th point in the resulting denoised . Again, only the bottom half is filled and then reflected across the =0 line. The process is repeated for all possible starting points, covering fully the .
For regions of very fast dynamics and for largesized s, this downandup mapping approach may impair the experimental temporal resolution. In such situations, the values near the diagonal of can instead be processed in a slidingwindow fashion, when the model is successively applied to parts consisting of cropped 100100 frames equally distanced along ta, and the outputs are averaged. The results of the model’s application in the downandup mapping and sliding window fashion are combined to obtain the final denoised version of the original . This method reduces noise in the entire and preserves the experimental temporal resolution for both and .
ii.4 Uncertainty Quantification for Denoising
When applying a model to unseen data, there are three potential sources of uncertainty: error (noise) of the input, model’s bias and model’s variance. The bias of the model primarily originates from selection of the training dataset and constricting the model’s architecture. Variance appears due to random initializations of model’s weights and ordering of data butches during model’s training. Approaches for quantifying uncertainty for a deep learning model often involve random perturbations of the model during training and/or application and obtaining the distribution of respective outputs
[16].We estimate the variance of the denoising AE by considering deviations of outputs for models trained with different random initializations (see Appendix E). It appears that the model’s variance is generally much smaller than the corresponding values of . Moreover, the fluctuations of values in the neighboring points of a , caused by downandup mapping during model application, scale linearly with the inherent variance of the model. Thus, it is not necessary to separately estimate the model’s variance for experiments with more than 100 frames, as the pointtopoint variations in a already influence the quantitative analysis of its s.
From the practical point of view, the main potential source of uncertainty of the model is its bias. Obviously, a denoising AE must exhibit some bias to remove noise. Hence, to drive the decision about applicability of the model in each case, the quantification of the bias ought to answer the question: how certain can one be that the model’s output is a valid representation of the underlying sample’s dynamics in the input? Naturally, a model returns the most trustworthy results for the inputs that are very similar to the examples in the training set. We suggest two quantitative measures – uncertainty scores – for estimating the bias by comparing a new input to the training examples.
The first measure is based on the Euclidean distance between the latent vector representation of an input and the center of the latent representations of all training examples. The analysis of the pairwise distributions of the 8 latent coordinates of the training set reveals that they form a single compact cluster [see Appendix F]. The shape of the cluster is determined by the distribution of dynamics and the noise level in the training set. However, the values of each of the eight latent coordinates are mostly concentrated around a single point. The distance from this central point is thus indicative of how different a given input is from most of the training data and thus can be used as an uncertainty score. To ensure an equal contribution of each latent dimension to the uncertainty score, all coordinates are scaled to have a zero mean and unit variance across the training dataset. To improve the interpretability of the bias, the distance to the center is normalized by its median value among the training set examples. The normalization of the distance helps to set a general threshold for an acceptable level of the bias without reference to a particular model or the training set.
The second measure of bias of the denoising is less abstract and does not depend on the model architecture. It involves evaluation of trends in residuals, i.e., the differences between the model’s targets and corresponding outputs. When the output is a valid representation of the sample’s dynamics, the residual should resemble random and sometimes correlated noise in form of vertical and horizontal stripes, without trends along neither nor
. Since the training examples mostly represent monotonically ageing dynamics, the model tends to perform well for similar cases. A trend in a residual can indicate a heterogeneity on top of the ageing dynamics or a completely different type of dynamics. Such situations require additional attention during analysis. To identify the trends, it is convenient to look at the projection of a twodimensional residual to the vertical (or horizontal) direction. Without trends, the autocorrelation coefficients (ACCs) of this projection are close to zero. In contrast, when a trend is present, the absolute value of the ACCs grows. We calculate the first ACC for all the examples of the training set and calculate probability density distribution using Gaussian kernel density estimator [see Appendix
G]. The probability density for the first ACC of the residual for a new test example is the second type of uncertainty scoring of the model prediction.Since both bias measures are based on the distribution of the training examples, they are suitable for detecting anomalies – inputs that are different from the training set. The analysis workflow can flag a new as an anomaly if its uncertainty score is above the userdefined threshold.
ii.5 Extracting Dynamics Parameters
Quantitative analysis of XPCS experimental results with nonequilibrium dynamics, such as ageing, involves taking timeslices along the delay axis of a at different sample’s ages and fitting the resulting s with an appropriate model that describes the dynamics, which in many cases is the KWW form Eq. 3. Typically, several adjacent slices within age range
need to be averaged to target a signaltonoise ratio that is sufficient for extracting dynamics parameters with reasonable certainty. It is necessary that the dynamics do not change much within
. An example of selecting a quasiequilibrium bin within a is shown in Fig. 1. This approach leads to a loss of temporal resolution for the parameters. Moreover, the procedure of binning s often requires an repetitive evaluation of the fitting results and adjustment of bins’ boundaries by an expert researcher. For a large volume of collected data, it becomes challenging to properly split s for each experimental region of interest (ROI), leading to growing uncertainty of the results. Ability to perform an adequate quantitative analysis for a single value of across the entire experiment would facilitate the automation of the analysis process. The highest temporal resolution and maximum usage of experimental data are achieved when considering cuts with = 1 (or, ).We compare the possibility of conducting a quantitative analysis of a and a corresponding while considering cuts with = 1 without a prior knowledge of the dynamic’s parameter ranges (Fig. 3). An example of a noisy (350 frames) from the validation set (Fig. 3A) is passed through the model, resulting in the with a significantly reduced level of noise (Fig. 3BC). The latent space representations of the 100100 frames are close to the center of the training set (Fig. 3D), indicating that the model output is likely a valid representation of the dynamics in the input. To extract the parameters, each of the cuts is fitted to Eq. 3. Cuts with less than 5 points are not considered. The results of the fit are shown in Fig. 3(EH). The fits for the cuts from the with = 35 are provided as ground truth values. Apparently, high noise in the does not allow extracting meaningful information about temporal evolution of dynamics parameters without restricting the fit parameters within narrow regions or increasing the bins’ width. The , on the other hand, produces clear, slowly evolving trends, matching the ground truth values.
While binning and averaging s reduces the noise, it is not a universal solution because material’s dynamics can vary considerably within an age bin, and therefore, a single set of parameters cannot describe them. For the example shown in Fig. 3, this becomes apparent when considering averaging all available s and fitting the result to Eq. 3. Since and are not changing in this experiment, the resulting values for these parameters are close to the ground truth values. However, the values for and are not close to their corresponding average values.
For an automatic analysis, it is important to flag the results that cannot be fully trusted, even when the goodnessoffit to Eq. 3 is acceptable. If dynamics are not fully captured by the experiment, parameters of the Eq. 3
become mutually dependent and the outputs of nonlinear regression can be misleading because several different sets of the dynamics’ parameters can result in almost equally good fits. Therefore, we introduce the concept of
trust regions for the parameters. A trust region of a parameter is a binary vector with the length equal to the number of bins along , which indicates whether the parameter is likely to be reliably identified within each bin. The binary values are determined by several criteria including the rate of the dynamics, correlations between parameters and parameters’ relative errors.When a material’s dynamics are slow with respect to the duration of the experiment, the baseline cannot be reliably identified. Likewise, the true value of cannot be extracted if the dynamics are not fully captured by the experiment due to them being either too fast or too slow. The quantitative measure for establishing respective thresholds for dynamics’ rate is a halftime, i.e., the time it takes for the contrast to drop by half:
(4) 
For slow dynamics, when the halftime is larger than a userdefined portion of ’s length, the trust region values for and are set to zero: not trustable. In case of fast dynamics, when the halftime is less than a certain number of frames, the trust region for is set to zero. However, the can be reliably extracted for fast dynamics and hence its trust region values are set to one. Similarly, thresholdbased conditions are defined for the correlation coefficient between parameters, relative errors of the parameters and measure of the fit.
Iii Results
iii.1 Analysis Workflow
Depending on the application, there are various ways to design an analysis workflow that includes s and s as well as any prior knowledge about the material’s dynamics. An example flowchart for extracting dynamical parameters from a is shown in Fig. 4. In this workflow, a is used for flagging unusual observations and narrowing parameter boundaries based on the initial fit. The new boundaries are then used to fit the and the . The last step of the analysis results in two sets of parameters with corresponding errors and trust regions: one for the and one for the . Stepbystep dynamics extraction algorithms are provided in Appendix H.
Naturally, when reporting experimental outcomes, the fitting results of unprocessed have a higher priority with respect to the fitting results of because signal processing, such as noise removal, introduces an uncertainty. According to the presented analysis workflow, can be used for evaluating the parameters’ boundaries for fitting the raw data during the second pass with the same or greater bin width. However, it is also possible to supplement the fitting results for with fitting results for for regions in parameter space where the low signaltonoise ratio in raw data prevents from obtaining meaningful parameter values. Note that removing the noise is just an additional step in extracting parameters from the raw experimental data. Therefore, the fitting results for can be tested against with goodnessoffit measures such as . Alternatively, one can use the fitting results for to select quasiequilibrium regions for the analysis of .
Upon calculating the uncertainty score for the , one may wish to extend the workflow to investigate possible dynamic heterogeneities of the sample by relying on the denoised output and the results of the s’ fit. The bias of the denoising model provides the opportunity to separate the average ‘envelope’ dynamics and stochastic heterogeneities by subtracting either the or the , computed based on the fit parameters for the , from the original .
iii.2 Application to New Data: Standard Analysis
We test a workflow for XPCS analysis with the use of denoising AE on previously reported data for 3D printing with solution of Lithium Titanate particles [17]. The experiment is not a part of the training or the validation datasets used in the current work. However, the farfromequilibrium dynamics of the ink, exhibited during its deposition and recovery, are similar to the types of dynamics used during the models’ training. Thus, the experiment presents an intended use case for the denoising AE.
There are several factors that limit the signaltonoise ratio in this experiment: operando character of the measurements, beamsensitivity of the ink and anisotropy of the dynamics, which requires selection of small ROIs in the reciprocal space for analysis. In the original work, unevenly spaced age bins of various width have been selected to obtain good quality ‘aged’ slices [17]. Here, we test the advantages of applying the denoising AE for conducting the same analysis with bin width = 1 .
Figure 5 provides an example of the denoising AE being applied to the for one of the ROIs. Since the dynamics is relatively fast, the points near the axis are denoised when the model is applied in a sliding window fashion and the points away from are denoised via downandup mapping approach. The difference in level of noise for and can be seen from corresponding s taken at different ages. Bin width of 1 frame has been considered for and resulting are fitted to Eq. 3 to obtain . for bin widths 1 and 100 are compared against the same . As expected, the fit to denoised version of correlation function describes the raw data well. As the width of the bins increases, the values of lie closer to the corresponding lines. The model captures well not only the characteristic time of the signal decorrelation, but also the change in the contrast factor , caused by initial fast nonergodic dynamics with timescales outside of the experimental time window.
The sample’s dynamics for multiple ROIs is quantified according to the scheme in Fig. 4. We calculate the drift velocity from relaxation times at different wave vectors along deposition direction using only denoised s and compare them to the values obtained in Ref. [17], as shown in Fig. 5G. The values obtained from are very close to results of the original analysis, confirming the denoising AE does not distort the data. Instead, it eliminates the need to carefully select the age bins and ensures the ultimate temporal resolution of a single acquisition period.
iii.3 Application to New Data: Outlier Detection
By using latent space representation of a denoising AE, it is possible to quantitatively compare s from the same group of measurements even if the model was not exposed to this type of dynamics during training. This is suitable for anomaly detection in series of consecutive XPCS measurements for the same material. We test the application of the model for instabilities detection during data collection for static dynamics of a LSAT sample (MTI Corporation) at the CSX beamline of NSLSII. Six data series, 7200 frames (exposure 1.0875 seconds) each are purposefully collected during synchrotron accelerator testing to introduce a disturbance of experimental conditions. Three of the series are collected in a stable beam regime, one series is recorded with small disturbance of the beam energy and two series are recorded during strong beam energy disturbance. From each series, 72 model’s inputs (size 100100 frames) are generated by considering every 72^{nd} frame with different starting points. Each of the downmapped inputs from a single series contain the same information, but different noise. To make the comparison sensitive to the mean value and variance of the contrast, the inputs are not scaled, but are clipped within values of 1 and 2. The values at = 0 are extrapolated with the neighboring values as it is done for a routine model application.
Figure 6 shows the representations of stable and unstable series in the latent space. Inputs formed from the same fullsized are located in close proximity of each other, forming a tight cluster. This confirms the contractive property of the denoising AE  the fact that similar inputs are close in the latent space. Having multiple s representing the same dynamics through downmapping procedure allows estimating the characteristic size of the cluster, to which all other points in the latent space should be compared to. The means and variances of contrasts for the inputs collected from experiment with a small disturbance are similar to the undisturbed series, resulting in a small distance in the latent space, comparable to the size of the clusters. However, when the disturbance is so strong, that the beam is lost for part of the series, the latent space representations of a are much further away from the undisturbed series. In fact, the strongly disturbed series form a separate cluster, where points are much more spread out than for stable series. Selecting a threshold distance between points helps to filter out the measurements taken at very unstable experimental conditions. Thus, it is possible to identify unusual observations in a set of XPCS series by looking into the denoising AE’s latent representation of s and comparing the distance between series.
Iv Discussion
Analysis of XCPS data is a multistep, often iterative, process that requires continuous evaluation of intermediate results by a domain expert. Large volumes of observations and subjectivity in alleviating low signaltonoise ratio complicate proper extraction of valuable information from experiments. Computationally reducing noise in s as a data processing step helps to achieve several goals: automating the analysis workflow, improving temporal resolution of parameters that quantify the system’s dynamics and increasing quantitative usage of data with high cost of collection.
Here, we demonstrate how a denoising AE model can be included into analysis workflow of XPCS data for experiments with ageing dynamics. Quantification of the model’s bias helps driving the decision about the use of its outputs as well as flagging unusual observations, such as heterogeneities or dynamics that are very different from ageing. The denoised correlation function can be used for complementing the fit of the raw data. The concept of trust regions combines the assessment of the fit quality as well as domain expertise, which helps to not only report the most reliable results, but to automate sequential narrowing of the fit parameters boundaries.
The model’s performance for unseen 3D printing data (not included in the training/validation datasets) demonstrates the advantages of its application to actual XPCS experiments. For measurements where the material’s dynamics are similar enough to the model’s training dataset, the analysis does not require a humanintheloop after all the thresholds are selected prior to analysis. Moreover, the analysis is suitable for autonomous data acquisition. It provides values of dynamics parameters, which are important for making decisions about adapting data acquisition parameters such as acquisition rate, exposure time, duration of data acquisition or about changing the sample and/or the processing parameters.
We further demonstrate how encoded representations of a
can be used for quantitative comparison of two or more scattering time series, which can be useful for identifying anomalies such experimental instabilities or phase transitions. The comparison can even be done for the types of dynamics, not present in the training set.
In conclusion, in this work we demonstrate how a CNNbased denoising AE can be used for s with nonequilibrium dynamics for experimental scattering series of arbitrary size. Addition of the denoising AE to XPCS analysis along with quantifying uncertainty helps automating the analysis and improves temporal resolution of extracted parameters. Besides, analysis of residuals and latent space representations of the inputs helps detect anomalous dynamics that go beyond monotonic ageing. This property can be employed for recognizing heterogeneities or phase transitions. Several examples of incorporating the denoising AE in analysis workflows for actual experimental data, collected at NSLSII, demonstrate its effectiveness for unseen data and diversity of its applications. Denoising and encoding properties of the model are auspicious for various online and offline data analysis tasks at XPCS facilities.
Acknowledgements.
This research used the CHX and CSX beamlines and resources of the National Synchrotron Light Source II, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Brookhaven National Laboratory(BNL) under Contract No. DESC0012704 and under a BNL Laboratory Directed Research and Development (LDRD) project 20038 ”Machine Learning for RealTime Data Fidelity, Healing, and Analysis for Coherent Xray Synchrotron Data”Appendix A Denoising Model Training
The denoising model is written in Pytorch (version 1.9.0+cuda10.2). The model is trained using CUDA accelerator Nvidia GeForce RTX 2070 Super. Early stopping is used for model overfitting. Aside a few changes, the same model architecture and training procedure is used as described in
[15].The model is trained in the autoencoder regime, i.e. each input serves as its target. The cost function is the mean square loss between the model’s output and its target(input). Adam [18] optimizer is used for training. Its starting learning rate of 1e
(batch size 8) is decreased at each epoch by a factor 0.9995.
One training epoch takes from 28 seconds (kernel size 1) to 320 seconds (kernel size 17). 2030 epochs are required for training, depending on random initialization of the model’s weights and the order of training examples supplied to the model.
Appendix B Fitting a
Each individual is fit to Eq. 3 using nonlinear regression implemented in Scipy package [19] according to the Trust Region Reflective algorithm [20]. Generally, when a is calculated for a singleframewide cut of a
, each of its points is given equal weight during the fit and when severalframeswide cuts are considered, the weights are assigned inversely proportionally to the standard errors of correlation function values at each
point.The optimization algorithm for nonlinear regression is sensitive to the initial guess of the parameters. The initial values for and can be estimated from characteristics of the experimental setup and test measurements. The algorithm seems to be robust with respect to the initial value of , which is set to 1. However, a proper initial guess of is important and values can vary significantly between experiments and ROIs in the reciprocal space. To automate selection of the initial guess for , several values are attempted and the fit with the smallest is returned. The values range between 0.01 and 1 fr. If neither of the values results in better fit than a line , then the error (‘not a number’) is returned for all parameters.
It is common to have a situation when a signal decorrelates within a small delay and most of the points along the axis in a are at the baseline level. In such case, points at the tail of the have high leverage, but carry little information regarding the dynamics’ rate, the contrast factor and the compression constant. The equidistant delay points in s obtained through cuts of a contrast with logarithmicspaced delay points of s, calculated directly from the speckled images using the traditional multitau algorithm [21]. To reduce the influence of the points at the tail of the , lower weights can be assigned to them during the fit with nonlinear regression. For implementing this, at the first pass, the data are fit to Eq. 3 with points weighted either equally or according to standard errors, as described above. Based on the results of the first fit, the points for which the fitted values are larger than are assigned the weight of 1 during the second fit and the rest of the points are assigned the weight . The same data are fit for the second time to Eq. 3 with the fixed value from the previous fit and the new points’ weights. Both weighted and unweighted results are returned by the fitting procedure.
The nonlinear regression algorithm returns the parameter values as well as their covariance matrix. The values of and for the best fit are used for calculating the halftime (Eq. 4) and establishing initial trust regions. Square roots of variances (the diagonal elements of the covariance matrix) are attributed to the parameters’ errors. The offdiagonal elements are converted to correlations. Optionally, the parameters’ errors and correlations and the values are used for narrowing the trust regions based on userdefined thresholds.
Appendix C Output Files Formats
and uncertainty scores are saved as datasets in a hdf5 [22] file, which allows accessing data (loading into memory) in parts. Parameters, parameter’s errors, parameter’s correlations and the of the fits at each age are recorded as nested dictionaries into a json [23] file. Besides, the parameter boundaries and the initial guesses are also saved in a json file.
Appendix D Different Kernel Sizes
Comparison of the denoising AEs with different convolutional kernels shows that models with larger kernel sizes have less bias and closer resemble the dynamics even if they slightly deviate from the KWW form. Examples of model outputs for the validation data are shown in Fig. 7 and Fig. 8. The figures show the model output for the s with 300 frames, rather than for single 100frame inputs (see the main text for details of applying the model to an arbitrarysized input). Generally, with increasing the kernel size of the model, the trends in the residuals become less pronounced.
However, the difference between the models’ outputs is generally not significant for quantitative analysis, which involves further fitting of the s. We compare the error of parameters extracted from the s passed through the models with different kernels for all fullsized s from the validation set. For this, we fit singleframewide cuts of a and its denoised version in an iterative process according to Fig. 4. The initial fitting of is used for establishing parameter boundaries for the contrast factor and the baseline , applied during the second fit. The ground truth values are obtained by separately fitting 20frameswide cuts of
and interpolating the results of the fit across the entire range of ages. The ground truth fits involve human intervention for establishing the parameters’ boundaries.
One is removed from consideration because its dynamics are too fast to be captured with the downandup mapping approach. Two other s are removed from consideration because the trust regions for from the first fit of contained less than two points and thus the parameter boundaries could not be established. Those s contained very slow dynamics. Overall, 8100 singleframe s have been fit for each kernel value.
The distribution of errors for the models with kernels 1, 3 and 17 is shown in Fig. 9. For the relative error (RE) is considered since dynamics’ rate can be in a broad range, including values close to zero. The absolute error is considered for (AE), (AE) and (AE). Table 1 shows the comparison of median values of the errors. The analysis procedures involving all models produce comparable errors.
kernel  RE(model)  AE(model)  AE(model)  AE(model) 
1  0.133  0.0076  0.0023  0.21 
3  0.135  0.0081  0.0026  0.23 
7  0.127  0.0075  0.0023  0.21 
11  0.121  0.0069  0.0023  0.21 
17  0.132  0.0075  0.0025  0.22 
kernel  RE(raw)  AE(raw)  AE(raw)  AE(raw) 
NA  0.195  0.0088  0.0025  0.35 
The errors produced by fitting have larger median errors for both RE and AE, even when the parameter bounds for and are narrowed. The difference in errors for and between fits for and is not significant because in both cases the parameter boundaries were established using at the previous step.
When considering denoising AEs with different kernel sizes, one needs to account for the computational time. The comparison of computational times for denoising models with different kernel sizes is shown in Fig. 10. These times are measured for a single 100frame
input (average of 50 repeated measurements). Depending on the acceleration hardware and the need to transfer data to/from a CPU/GPU Pytorch Tensor, the difference between computation times for models with kernel=1 and kernel=17 can be 60 times (CPU) or can be nonexistent (GPU, without transfer).
Appendix E Model Variance
We estimate the variance of a denoising AE model by applying an ensemble of 9 models trained with different random initializations to examples from the validation set. For each fullsized from the validation set, a reduced version is generated by considering each 3rd frame horizontally and vertically, starting at point [0,0], and then cropping the first 100x100 frames part. The resulting is passed to each of the models in the ensemble and the framewise variance () is calculated. That is, each of the 100100 frames input produces variances. We refer to these variances as generated via drawing from model space.
When a single denoising AE model is applied to a with 200 or more frames, the downandup mapping process also introduces variance between neighboring values. The difference in underlying dynamics represented in the downmapped originating from starting points [i, i] (i, j = 0,1,2) is often negligible. Therefore, the downmapped s can be viewed as an approximation of realization of the same dynamics. Thus, it is possible to estimate the variance of a model via drawing from input space by applying the model to each of such realizations and estimating the variances of the outputs. Here, we estimate the variance by considering inputs, obtained from the s from the validation set by considering each 3^{rd} frame horizontally and vertically, starting at points: [0,0], [1,1], [2,2] and then cropping the first 100100 frames part from each of the downmapped . A single model is applied to each of the 3 inputs and the framewise variance is calculated.
As a result, for each example in the validation set the framewise variance is calculated by drawing from both model space and input space. The variances, calculated by both methods, are comparable. (Fig. 11) The variance values are considerably smaller than both the noise in the original raw inputs and the typical values of contrast factor , making the model’s variance negligible for the quantitative analysis.
Moreover, there is a linear dependence between the variances calculated by the two methods (the correlation coefficient is 0.73), indicating that the model variance typically is already reflected in the downandup mapping process of the model application. Therefore, it is not necessary to separately estimate the variance of the model using an ensemble method.
Appendix F Latent Coordinates Distribution
Latent space coordinates can be used for applying distancebased similarity measures for the model inputs. The model is likely to perform well for new inputs that are similar to the training set examples. The pairwise distributions of the latent coordinates for the training set (Fig. 12) reveals that the majority of the data is approximately centered around a single point. As a result, the measure of similarity between a new input and the training set can be expressed as the Euclidean distance from the input to the central point of the training set coordinates distribution. Prior to calculation of the distance, the coordinates are standardized using the mean values and variances for each coordinate calculated for the training set.
Appendix G Quantifying Bias With Residuals
For general applications, investigating trends in residuals is a common test for a model’s bias. If a residual resembles a ‘random’ noise, the bias of the model is low. In an opposite case of trends in the residual, the model does not fully represent the process in the input, i.e., it exhibits a bias.
For a denoising AE the situation is not straightforward and depends on the perception of noise for each application. For a , in cases of low detector count or framebyframe instabilities, the residual for the denoised output should not have any trends (Fig. 13). However, in cases when an average, ‘envelope’, dynamics is under interest, the dynamics heterogeneities are treated as noise, leaving a pattern in the residual (Fig. 14), that can be investigated separately. It is convenient to measure noise using autocorrelation coefficient (ACC) at lag=1 of residuals, projected at one of the time axes .
Just like the latent space coordinates, the ACC of residuals for each new example can be compared to the analogous values for examples in the training set. This helps to understand, what values of ACC are unusual. For quantifying the comparison, the probability density function (Fig.
15) is estimated for ACC values of the training set and is calculated for each new test example.Appendix H Examples of workflows for Online and Offline Data Analysis
Algorithm 1 (online analysis):

apply denoising AE to the to obtain

calculate an uncertainty score through latent space and/or residuals

unless observation is unusual:

do the first round of fit for (optionally with wider time cuts) using wide parameter boundaries

calculate new parameter boundaries based on values within trust regions

if needed, repeat the fits for for singleframes cuts with new parameter boundaries


if the observation is unusual:

use and widebin s for the fits


report results of the fit and trust regions
Algorithm 1 (offline analysis):

apply denoising AE to the to obtain

calculate an uncertainty score through latent space and/or residuals

based on the uncertainty score, alarm about an unusual observation if encountered

unless observation is unusual:

do the first round of fit for (optionally with wider time cuts) using wide parameter boundaries

calculate new parameter boundaries based on values within trust regions

fit both and with new parameter bounds


if the observation is unusual:

use and widebin s for the fits

report results of the fit and trust regions

References
 Bergmann et al. [2009] U. Bergmann, J. Corlett, S. Dierker, R. Falcone, J. Galayda, M. Gibson, J. Hastings, B. Hettel, J. Hill, Z. Hussain, et al., Science and technology of future light sources: A white paper, Tech. Rep. (Stanford Linear Accelerator Center (SLAC), 2009).
 Couprie [2014] M. E. Couprie, New generation of light sources: Present and future, Journal of Electron Spectroscopy and Related Phenomena 196, 3 (2014).
 Vartanyants and Singer [2016] I. Vartanyants and A. Singer, Synchrotron light sources and freeelectron lasers, edited by ej jaeschke, s. khan, jr schneider & jb hastings (2016).
 Grybos et al. [2015] P. Grybos, P. Kmon, P. Maj, and R. Szczygiel, 32k Channels readout IC for single photon counting detectors with 75 m pitch, ENC of 123 e rms, 9 e rms offset spread and 2 % rms gain spread, in 2015 IEEE Biomedical Circuits and Systems Conference (BioCAS) (2015) pp. 1–4.
 Llopart et al. [2002] X. Llopart, M. Campbell, R. Dinapoli, D. San Segundo, and E. Pernigotti, Medipix2: A 64k pixel readout chip with 55/spl mu/m square elements working in single photon counting mode, IEEE transactions on nuclear science 49, 2279 (2002).
 Zhang et al. [2018] Q. Zhang, E. M. Dufresne, and A. R. Sandy, Dynamics in hard condensed matter probed by xray photon correlation spectroscopy: Present and beyond, Current Opinion in Solid State and Materials Science 22, 202 (2018).
 Ricci et al. [2020] A. Ricci, G. Campi, B. Joseph, N. Poccia, D. Innocenti, C. Gutt, M. Tanaka, H. Takeya, Y. Takano, T. Mizokawa, et al., Intermittent dynamics of antiferromagnetic phase in inhomogeneous ironbased chalcogenide superconductor, Physical Review B 101, 020508 (2020).
 Evenson et al. [2015] Z. Evenson, B. Ruta, S. Hechler, M. Stolpe, E. Pineda, I. Gallino, and R. Busch, Xray photon correlation spectroscopy reveals intermittent aging dynamics in a metallic glass, Physical review letters 115, 175701 (2015).
 Begam et al. [2021] N. Begam, A. Ragulskaya, A. Girelli, H. Rahmann, S. Chandran, F. Westermeier, M. Reiser, M. Sprung, F. Zhang, C. Gutt, et al., Kinetics of network formation and heterogeneous dynamics of an egg white gel revealed by coherent xray scattering, Physical Review Letters 126, 098001 (2021).
 Yavitt et al. [2020] B. M. Yavitt, D. Salatto, Z. Huang, Y. T. Koga, M. K. Endoh, L. Wiegart, S. Poeller, S. Petrash, and T. Koga, Revealing nanoscale dynamics during an epoxy curing reaction with xray photon correlation spectroscopy, Journal of Applied Physics 127, 114701 (2020).
 Brown et al. [1997] G. Brown, P. A. Rikvold, M. Sutton, and M. Grant, Speckle from phaseordering systems, Physical Review E 56, 6601 (1997).
 Madsen et al. [2020] A. Madsen, A. Fluerasu, and B. Ruta, Structural dynamics of materials probed by xray photon correlation spectroscopy, Synchrotron Light Sources and FreeElectron Lasers: Accelerator Physics, Instrumentation and Science Applications , 1989 (2020).
 Bikondoa [2017] O. Bikondoa, On the use of twotime correlation functions for xray photon correlation spectroscopy data analysis, Journal of applied crystallography 50, 357 (2017).
 Madsen et al. [2010] A. Madsen, R. L. Leheny, H. Guo, M. Sprung, and O. Czakkel, Beyond simple exponential correlation functions and equilibrium dynamics in xray photon correlation spectroscopy, New Journal of Physics 12, 055001 (2010).

Konstantinova et al. [2021]
T. Konstantinova, L. Wiegart, M. Rakitin, A. M. DeGennaro, and A. M. Barbour, Noise reduction in xray photon correlation spectroscopy with convolutional neural networks encoderdecoder models, Scientific Reports
11, 14756 (2021).  Abdar et al. [2021] M. Abdar, F. Pourpanah, S. Hussain, D. Rezazadegan, L. Liu, M. Ghavamzadeh, P. Fieguth, X. Cao, A. Khosravi, U. R. Acharya, et al., A review of uncertainty quantification in deep learning: Techniques, applications and challenges, Information Fusion (2021).
 Lin et al. [2021] C.H. Lin, K. Dyro, O. Chen, D. Yen, B. Zheng, M. T. Arango, S. Bhatia, K. Sun, Q. Meng, L. Wiegart, et al., Revealing mesostructure dynamics in additive manufacturing of energy storage via operando coherent xray scattering, Applied Materials Today 24, 101075 (2021).
 Kingma and Ba [2014] D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980 (2014).
 Virtanen et al. [2020] P. Virtanen, R. Gommers, T. E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, P. Peterson, W. Weckesser, J. Bright, S. J. van der Walt, M. Brett, J. Wilson, K. J. Millman, N. Mayorov, A. R. J. Nelson, E. Jones, R. Kern, E. Larson, C. J. Carey, İ. Polat, Y. Feng, E. W. Moore, J. VanderPlas, D. Laxalde, J. Perktold, R. Cimrman, I. Henriksen, E. A. Quintero, C. R. Harris, A. M. Archibald, A. H. Ribeiro, F. Pedregosa, P. van Mulbregt, and SciPy 1.0 Contributors, SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python, Nature Methods 17, 261 (2020).
 Branch et al. [1999] M. A. Branch, T. F. Coleman, and Y. Li, A subspace, interior, and conjugate gradient method for largescale boundconstrained minimization problems, SIAM Journal on Scientific Computing 21, 1 (1999).
 Wohland et al. [2001] T. Wohland, R. Rigler, and H. Vogel, The standard deviation in fluorescence correlation spectroscopy, Biophysical journal 80, 2987 (2001).
 The HDF Group [2010] The HDF Group, Hierarchical data format version 5 (20002010).
 Pezoa et al. [2016] F. Pezoa, J. L. Reutter, F. Suarez, M. Ugarte, and D. Vrgoč, Foundations of json schema (2016).
Comments
There are no comments yet.