1. Introduction
We consider a -dimensional ergodic diffusion process defined by the following stochastic differential equation such that
where is an -dimensional Wiener process,
is a random variable independent of
, and are unknown parameters, and are bounded, open and convex sets in admitting Sobolev’s inequalities for embedding for , is the true value of the parameter, and and are known functions.A matter of interest is to estimate the parameter with partial and indirect observation of : the observation is discretised and contaminated by exogenous noise. The sequence of observation , which our parametric estimation is based on, is defined as
where is the discretisation step such that and , is an i.i.d. sequence of random variables independent of and such that and where
is the identity matrix in
for every , and is a positive semi-definite matrix which is the variance of noise term. We also assume that the half vectorisation of has bounded, open and convex parameter space , and let us denote . We also notate the true parameter of as , its half vectorisation as , and . That is to say, our interest is on parametric inference for an ergodic diffusion with long-term and high-frequency noised observation. One concrete example is the wind velocity data provided by NWTC Information Portal (2018) whose observation is contaminated by exogenous noise with statistical significance according to the test for noise detection (Nakakita and Uchida, 2018b).
![]() |
![]() |
As the existent discussion, Nakakita and Uchida (2018b) propose the following estimator , and such that
where for every matrix , is the transpose of and , and are the adaptive quasi-likelihood functions of and respectively defined in Section 3, is a tuning parameter, and Nakakita and Uchida (2018b) show these estimators are asymptotically normal and especially the drift one is asymptotically efficient. To obtain the convergence rates of the estimators, it is necessary to see the composition of the quasi-likelihood functions. Both of them are function of local means of observation defined as
where is the number of partition given for observation, is that of observation in each partition, is the time interval which each partition has, and note that these parameters have the properties , and . Intuitively speaking, and correspond to and in the observation scheme without exogenous noise, and divergence of
works to eliminate the influence of noise by law of large numbers. Hence it should be also easy to understand that we have the asymptotic normality with the convergence rates
and for and ; that is,where is an
-dimensional Gaussian distribution with zero-mean.
The statistical inference for diffusion processes with discretised observation has been investigated in these decades: see Florens-Zmirou (1989), Yoshida (1992), Bibby and Sørensen (1995), Kessler (1995, 1997). In practice, it is necessary to argue whether exogenous noise exists in observation, and it has been pointed out that the observational noise, known as microstructure noise, certainly exists in high-frequency financial data which is one of the major disciplines where statistics for diffusion processes is applied. Inference for diffusions under such the noisy and discretised observation in fixed time interval is discussed by Jacod et al. (2009), and also Favetto (2014, 2016) examine same problem as our study and shows simultaneous ML-type estimation has consistency under the situation where the variance of noise is unknown and asymptotic normality under the situation where the variance is known. As mentioned above, Nakakita and Uchida (2018b) propose adaptive ML-type estimation which has asymptotic normality even if we do not know the variance of noise, and test for noise detection which succeeds in showing the real data NWTC Information Portal (2018) which is contaminated by observational noise.
Our study aims at polynomial type large deviation inequalities for statistical random fields and construction of the estimators with not only asymptotic normality as shown in Nakakita and Uchida (2018b) but also a certain type of convergence of moments. Asymptotic normality is well-known as one of the hopeful properties that estimators are expected to have; for instance, Nakakita and Uchida (2018a) utilise this result to compose likelihood-ratio-type statistics and related ones for parametric test and proves the convergence in distribution to a
-distribution under null hypothesis and consistency of the test under alternative one. However, it is also known that asymptotic normality is not sufficient to develop some discussion requiring convergence of moments such as information criterion. In concrete terms, it is necessary to shows the convergence of moments such as for every
with at most polynomial growth and adaptive ML-type estimator and ,This property is stronger than mere asymptotic normality since if we take as a bounded and continuous function, then indeed asymptotic normality follows.
To see the convergence of moments for adaptive ML-type estimator, we can utilise polynomial-type large deviation inequalities (PLDI) and quasi-likelihood analysis (QLA) proposed by Yoshida (2011) which have been widely used to discuss convergence of moments of not only ML-type estimation but also Bayes-type one in statistical inference for continuous-time stochastic processes. This approach is developed from the exponential-type large deviation and likelihood analysis introduced by Ibragimov and Has’minskii (1972, 1973, 1981), and the polynomial-type one discussed by Kutoyants (1984, 1994, 2004). Yoshida (2011) itself discusses convergence of moments in adaptive maximum-likelihood-type estimation, simultaneous Bayes-type one, and adaptive Bayes-type one for ergodic diffusions with and . Uchida and Yoshida (2012, 2014) examine the same problem for adaptive ML-type and adaptive Bayes-type estimation for ergodic diffusions with more relaxed condition: and for some . Ogihara and Yoshida (2011) study convergence of moments for parametric estimators against ergodic jump-diffusion processes in the scheme of and . Other than diffusion processes or jump-diffusions, Clinet and Yoshida (2017) show PLDI for the quasi-likelihood function for ergodic point processes and the convergence of moments for the corresponding ML-type and Bayes-type estimators. As the applications of these discussions, Uchida (2010) composes AIC-type information criterion for ergodic diffusion processes, and Eguchi and Masuda (2018) propose BIC-type one for local-asymptotic quadratic statistical experiments including some schemes for diffusion processes. In this paper, we develop QLA for our ergodic diffusion plus noise model and propose the adaptive Bayes-type estimators of both drift and volatility parameters. Furthermore, we show the convergence of moments of both the adaptive ML-type estimators and the adaptive Bayes-type estimators for the ergodic diffusion plus noise model. Note that Bayes-type estimation itself is important to deal with non-linearity of parameters and multimodality of quasi-likelihood functions which sometimes appear in statistics for diffusion processes. In particular, the hybrid type estimators with initial Bayes-type estimators are considered for diffusion type processes, see Kamatani and Uchida (2015); Kaino and Uchida (2018a, b), and references therein. Moreover, as an application of the Bayes-type estimation proposed in this paper, Kaino et al. (2018) study the hybrid estimators with initial Bayes-type estimators for our ergodic diffusion plus noise model and give an example and simulation results of the hybrid estimator.
2. Notation and assumption
We set the following notations.
-
For every matrix , is the transpose of , and .
-
For every set of matrices and whose dimensions coincide, . Moreover, for any , and , .
-
For any vector and any matrix , and .
-
For every , is the -norm.
-
, , and .
-
For given , , , and , and we define the sequence of local means such that
where indicates an arbitrary sequence defined on the mesh such as , and .
Remark 1.
-
, , , , , and .
-
We define the real-valued function as for :
and with the function as for and ,
we define the matrix as for ,
-
Let
be sequences of -valued functions and -valued ones respectively such that the components of themselves and their derivative with respect to are polynomial growth functions for all and . Then we define the following matrix-valued functionals, for ,
where is the invariant measure of discussed in the following assumption [A1]-(iv), and for all function on , .
With respect to , we assume the following conditions.
-
-
.
-
For some constant , for all ,
-
For all , .
-
There exists an unique invariant measure on and for all and with polynomial growth,
-
For any polynomial growth function satisfying , there exist , with at most polynomial growth for such that for all ,
where is the infinitesimal generator of .
-
Remark 2.
-
There exists such that and have continuous derivatives satisfying
With the invariant measure , we define
where . For these functions, let us assume the following identifiability conditions hold.
-
For all , there exists a constant such that for all .
-
For all , there exists a constant such that for all .
The next assumption is with respect to the moments of noise.
-
For any , has -th moment and the components of are independent of the other components for all , and
. In addition, for all odd integer
, , , and , , and .
The assumption below determines the balance of convergence or divergence of several parameters. Note that is a tuning parameter and hence we can control it arbitrarily in its space .
-
, , , , , for . Furthermore, there exists such that for sufficiently large .
Remark 3.
Let us denote and where and the components of their derivatives are polynomial growth with respect to uniformly in . Then the discussion in Uchida (2010) verifies under [A1] and [A6], for all ,
3. Quasi-likelihood analysis
First of all, we introduce and analyse some quasi-likelihood functions and estimators which are defined in Nakakita and Uchida (2018b). The quasi-likelihood functions for the diffusion parameter and the drift one using this sequence are as follows:
where . We set the adaptive ML-type estimator , and such that
Assume that , are continuous and , and denote the adaptive Bayes-type estimators
Our purpose is to show the polynomial-type large deviation inequalities for the quasi-likelihood functions defined above in the framework introduced by Yoshida (2011), and the convergences of moments for these estimators as the application of them. Let us denote the following statistical random fields for and
and some sets
and for ,
We use the notation as Nakakita and Uchida (2018b) for the information matrices
where for ,
and for ,
We also denote and .
Theorem 1.
Under [A1]-[A6], we have the following results.
-
The polynomial-type large deviation inequalities hold: for all , there exists a constant such that for all ,
-
The convergences of moment hold:
where
and is an arbitrary continuous functions of at most polynomial growth.
3.1. Evaluation for local means
In the first place we give some evaluations related to local means. Some of the instruments are inherited from the previous studies by Nakakita and Uchida (2017) and Nakakita and Uchida (2018b). We define the following random variables:
The next lemma is Lemma 11 in Nakakita and Uchida (2018b).
Lemma 1.
and are -measurable, independent of and Gaussian.These variables have the next decompositions:
The evaluation of the following conditional expectations holds: