1 Introduction
Recent advances in technology allowed collecting big data at high frequency (effectively continuous) rates that led to the ubiquity of functional data (samples of curves or surfaces) (Ramsay and Silverman (2005); Wang et al. (2016)). Handling such new complex data is essential in various applications, for example, in earth, environmental, ecological sciences, cosmology and image analysis. However, most of classical statistical models and results were developed for discretely sampled data.
Discretisation and corresponding additive models are often used as powerful dimension reduction tools in the analysis of functional data which are intrinsically infinite dimensional. The discretisation is a common strategy for approximating statistics of such data (see, for example, in Ramsay and Silverman (2005)). Also, in practice, the functional curves or surfaces are often observed only at a finite number of points.
Note, that various statistics of functional data can be expressed by integral functionals of these data or their transformations. For instance, some well-known examples of such statistics include sample moments and sample sojourn measures (Minkowski functionals) (see Leonenko and Olenko (2014) ). Another popular model in various applications (especially in engineering and signal processing) is stochastic processes that are obtained as outputs of filters, i.e. defined mathematically by a convolution integral operator.
Another example is functional linear regression models defined by weighted integral functionals. These models found numerous statistical applications in medicine, linguistics, chemometrics (see
In all above applications it is usually assumed that the discretisation error is negligible with respect to the estimation error. However, there are almost no known results that rigorously prove it. This paper addresses this problem and investigates discretisation errors for weighted functionals of long-range dependent spatial processes. Their rates of decay for the case of increasing observation windows are found. It is shown that both additive and integral functionals converge to the same limit distribution. It is proved that these distributions are non-Gaussian. These results provide a constructive method for determining the number of discretisation nodes for a given accuracy.
Various results in statistical inference of random fields were first obtained by Yadrenko (1983). Recently, considerable attention has been paid to asymptotic behaviour of non-linear statistics of random processes and fields (see Ivanov and Leonenko (2008); Bai and Taqqu (2013); Leonenko and Olenko (2014); Anh et al. (2019) and the references therein). Direct probability techniques were used to study these statistics, for example, in regression models. Asymptotic distributions of these statistics were discussed by
There are many practical situations in which non-Gaussian random processes and fields are appropriate for statistical modeling. We deal with an important class of models defined by non-linear functions of Gaussian random fields. This class is widely used in modeling non-Gaussian data. It can be analysed using Wiener chaos expansions that give good data approximations in many cases (see De Oliveira et al. (1997); Vio et al. (2001)).
This research deals with asymptotic behaviour of integral and additive non-linear functionals of random fields with long-range dependence. Long-range dependence is an empirical phenomenon which has been observed in different applied fields including cosmology, economics, geophysics, air pollution, image analysis, earth sciences, just to mention a few examples. For this reason, great effort has been devoted to studying models based on long-range dependent random fields
(see Ivanov and Leonenko (1989); Wackernagel (1998); Doukhan et al. (2002); Frías et al. (2008)). Weighted functionals of long-range dependent random fields were considered in Olenko (2013); Ivanov et al. (2013); Ivanov and Leonenko (1989).
These functionals can have non-Gaussian limits that are known as Hermite or Hermite-Rosenblatt distributions (Rosenblatt (1961); Taqqu (1975); Dobrushin and Major (1979); Taqqu (1979) ). Their asymptotic distributions can be characterised by either multiple Wiener-Itô integrals representations or characteristic functions (see
In various applications, it is natural to consider statistics of random fields and to study their limit behaviour over increasing spatial windows. In these cases, integrals of non-linear functionals of spatial functional data and additive non-linear functionals for discrete observations on a bounded region were studied in numerous papers (see, for example, Major (1981); Leonenko and Olenko (2014); Anh et al. (2015, 2019)). When we deal with the asymptotic behaviour of discretised functionals of functional data, it is important to know how asymptotics of these integrals are related to additive functionals. To the best of our knowledge only particular cases of this correspondence have been addressed in Leonenko and Taufer (2006) and Alodat and Olenko (2017) for rectangular observation windows. However, in many applications spatial data is not necessarily available over rectangles, but rather over irregularly-shaped regions (Cressie (1993); Lahiri et al. (1999)). Therefore, it is important to obtain theoretical results about asymptotics for general types of observation windows. In this paper we extend results of Leonenko and Taufer (2006) and Alodat and Olenko (2017) under more general conditions. More precisely, we consider weighted functionals of random fields of the form
where , is a long-range dependent random field, and are non-random functions, is an observation window and is a normalising factor. We show that these integrals and the corresponding discretised versions have same non-Gaussian limit distributions.
The article is organised as follows. In Section 2 we introduce main notations and outline necessary background from the theory of random fields. In Section 3 we recall some assumptions and auxiliary results from the spectral and correlation theory of random fields. In Section 4.1 we study the case of two-dimensional functionals. Section 4.2 gives a general multidimensional version of the results. Proofs are provided in Section 5. In Section 6 some simulations studies are presented to confirm theoretical results. Conclusions and directions for future research are presented in Section 7.
2 Definitions and Notations
This section provides basic definitions and notations that are used in this article.
In what follows , , and are used for the Lebesgue measure, the Euclidean distance in , the floor and ceiling functions, respectively. The symbols , and (with subscripts) will be used to denote constants that are not important for our discussion. Note, that the same symbol may be used for different constants appearing in the same proof. For a set , we denote by and the interior and the exterior of the set respectively. Moreover, it is assumed that all random variables are defined on a fixed probability space
We consider a measurable mean-square continuous zero-mean homogeneous isotropic real-valued random field , with the covariance function
It is well known that there exists a bounded non-decreasing function , (see Yadrenko (1983); Ivanov and Leonenko (1989)) such that
where is the Bessel function of the first kind of order .
The function is called the isotropic spectral measure of the random field . If there exists a function , such that
then the function is called the isotropic spectral density of the field .
The field with an absolutely continuous spectrum has the following isonormal spectral representation
(2.1) |
where is the complex Gaussian white noise random measure on
The Hermite polynomials , are defined by
The first few Hermite polynomials are
The Hermite polynomials , form a complete orthogonal system in the Hilbert space , where
is the probability density function of the standard normal distribution. An arbitrary function
(2.2) |
By Parseval’s identity it holds
Definition 2.1.
(Taqqu (1975)) Let . Assume that there exists an integer , such that for all , but . Then is called the Hermite rank of and is denoted by
Note, that by (2.1.8) in Ivanov and Leonenko (1989) we get and
(2.3) |
where is the Kronecker delta function.
Definition 2.2.
(Bingham et al. (1989)) A measurable function is slowly varying at infinity if for all ,
3 Assumptions and Auxiliary Results
This section gives some assumptions and results from the spectral and correlation theory of random fields that will be used in the following sections.
Assumption 3.1.
Let , be a homogeneous isotropic Gaussian random field with and the covariance function , such that and
where is a function slowly varying at infinity.
If , then the covariance function satisfying Assumption 3.1 is not integrable, which corresponds to the long-range dependence case (Anh et al. (2015)).
The notation will be used to denote a Jordan-measurable compact bounded set, such that , and contains the origin in its interior. Let , be the homothetic image of the set , with the centre of homothety at the origin and the coefficient , that is and . For and , we define and .
Theorem 3.1.
By Theorem 3.1 it is enough to study to get asymptotic distributions of . Therefore, we restrict our attention only to .
Assumption 3.2.
The random field , has the isotropic spectral density
where and is a locally bounded function which is slowly varying at infinity.
The function will be used to denote the Fourier transform of the indicator function of the set
Theorem 3.2.
(Leonenko and Olenko (2014)) Let , be a homogeneous isotropic Gaussian random field. If Assumptions 3.1 and 3.2 hold, , then for the random variables
converge weakly to
Here denotes the multiple Wiener-Itô integral with respect to a Gaussian white noise measure, where the diagonal hyperplanes
Below we present a limit theorem and the corresponding assumptions on the weight function in the integral functionals from Ivanov and Leonenko (1989). These results will be generalised in the subsequent sections. The obtained results on asymptotic equivalence of additive and integral functionals of random fields will be used to obtain limit theorems for the case of discrete observations.
Assumption 3.3.
(Ivanov and Leonenko (1989)) Let be a radial continuous function that is positive for and such that for
Let , where is from Assumption 3.2. In Section 2.10 in Ivanov and Leonenko (1989) the case when the function is continuous in a neighborhood of zero, bounded on and , was studied. It was assumed that there is a function such that
and
Under these assumptions the following result was obtained.
Theorem 3.3.
Note, that the above result is a specification of the results on convergence to stochastic processes in Ivanov and Leonenko (1989) where functionals over the observation windows , were studied. For simplicity, this paper deals with a particular case when . But the results of this paper can be easily extended to the case of general and convergence to stochastic processes on .
4 Main Results
4.1 Two-dimensional Case
In this section we consider integrals of two-dimensional random fields over a bounded increasing observation window . We show that the limit distributions of these integrals and their corresponding additive functionals coincide.
Our setup is as follows. We assume that the set can be represented as
where , , and , are smooth functions (i.e. , where is a class of functions with continuous first derivatives) except of the sets , where these functions have finite jumps. Here is the number of jumps of , see, for example, Figure 0(a). That is , , are jump points of the functions if but and both exist. Note that, by the homothety of , the set can be represented as where . As contains the origin in its interior, it follows that , , and as .
![]() |
![]() |
Let , be a real-valued homogeneous isotropic Gaussian random field satisfying Assumptions 3.1 and 3.1. We investigate the integrals
as , where is a non-random function such that when , and is a normalising factor.
We define the corresponding additive functional to , by
(4.1) |
where and , see Figure 0(b). It is assumed that
Example 4.1.
Figure 1(a) visualises a realisation of the long-range dependent Cauchy field over the set from Figure 0(a). This field satisfies Assumptions 3.1 and 3.1. The corresponding normal Q-Q plot of in Figure 1(b) was obtained by simulating times for the large value and . It is close to the asymptotic distribution and Figure 1(b) shows its departure from the Gaussian distribution.
![]() |
![]() |
Remark 4.1.
The functionals and include various important statistics. For example, the case of and corresponds to the sample mean estimator. High order sample moments can be expressed in terms of or by using the formula
Another important example, a level excess measure, can be found by using the Hermite series expansion (2.2) for the indicator function
where and are the cdf and pdf for respectively. The case of corresponds to weighted versions of the above statistics.
Let us define rectangles , as
Then, can be rewritten as
(4.2) |
where .
Assumption 4.1.
Let , be such that as and there exists a function such that for some uniformly for it holds
where and
Remark 4.2.
It follows from Assumption 4.1 that is bounded on .
Remark 4.3.
Note, that the conditions on the function in Assumption 4.1 are met by numerous types of functions that are important in solving various statistical problems, in particular, non-linear regression and M estimators. For example, the functions
(for some appropriate constants and ) can be chosen. The case of corresponds to the classical equally-weighted functionals and non-central limit theorems.
Remark 4.4.
To avoid degenerated cases, the condition as is essential to guarantee the boundedness of the variance of
Now, we proceed to the main result.
Remark 4.5.
Theorem 4.1 is also true if , are Lipschitz functions.
4.2 Multidimensional Case
This section gives a multidimensional version of Theorem 4.1 and a generalisation of Theorem 3.3. Then, we apply the obtained results to show that additive functionals have the same asymptotic distribution as the corresponding integral functionals.
We use the following notations that enable us to obtain the result of this section analogously to the two-dimensional case. Let , and the set .
First, we consider the case of . We assume that the set can be represented as
where , , and , are smooth functions except of the sets where these functions have finite jumps, i.e.
where and , are constants. Here and are the number of jumps of . Thus, consists of a finite number of two-dimensional line segments.
Note that, by the homothety of , the set can be represented as , where , .
For a real-valued homogeneous isotropic Gaussian random field , let
where , is a non-random scalar function such that , and is a normalising factor.
We define the corresponding additive functional to , by
where
and
Remark 4.6.
The intersection with is required to correctly define the infimum and supremum for the cases when some points in are outside of , which may happen for the boundary region.
For each , define three-dimensional parallelepipeds as
where