1 Introduction
One of the basic problems of quickest changepoint detection is the design of an optimal sequential test. If an alarm is too quickly, there is a high risk that the detection is false. And at the same time, if an alarm is too slowly, the detection delay would be too large. So an optimal sequential test is important, which is expected to have the smallest detection delay among all tests without exceeding a certain false alarm rate.
Moreover, an optimal sequential test is useful in earthquake monitoring, traffic monitoring, etc. And these data are often in the form of network data. There are many network data in our daily life, such as social networks, the representations are Facebook, wechat, etc. Hence, it is useful to analyze network data, such as the change detection of network data, which is in Amini A A , Nguyen X L.(2013), Keshavarz H, Michailidis G, Atchade Y. (2018) and Wei, C. , Wiesel, A. , and Blum, R. S. . (2012). These papers discuss the change detection problem of the Gaussian graphic model. For Gaussian graphic model, we can get accurate distribution conveniently, which makes the change point problem easy to deal with. The Gaussian graphic model is a very special case.
However there are many challenges and difficulties in dealing with general large network data. When the network is large, it is difficult to obtain the distribution of network data, because that the normalization coefficient is not easy to be calculated, since it needs exponential computation complexity. For example, the distribution of exponential family of random graph models(ERGMs), also known as models, which is introduced by Frank and Strauss(1986), Frank(1991) and Wasserman and Pattison(1996), is , represents for a network, are the features of the network, the normalization coefficient is difficult to obtain especially for large network. And this phenomenon also exsits for the Markov networks in probabilistic graphical model which is introduced by Koller D, Friedman N(2009). It is common in network data, which urges us to find a method to overcome this difficulty.
At the same time, in the literature, there are mainly four kinds of optimal sequential tests in infinite independent observation sequence: Shiryaev test , SLR (sum of the log likelihood ratio) test , the conventional CUSUM test introduced by Page (1954) and ShiryaevRoberts test for an infinite independent observation sequence. The tests , , , , and were proven by Shiryaev (1963, 1978), Chow, Robbins and Siegmund (1971, P.108), Frisén (2003), Moustakides (1986) and Polunchenko and Tartakovsky (2010), to be optimal, and the control line are all constants. All of the optimal tests are based on likelihood ratio statistics, and the analyzed observation sequences are onedimensional. Moreover, to obtain infinite sequnce of networks is not realistic. So it is necessary to deal with the changepoint problem for finite sequences of networks about how to give the corresponding optimal sequential tests.
In this paper, in order to overcome the above difficulties, we give a finite measure space to replace the traditional probability space by discarding the normailization coefficient. And a new measure of detection delay, which is based on the corresponding measure space, is proposed. The likelihood ratio statistics are replaced by measure ratio statistics. In a word, the method we proposed are based on the finite measure not the probability space. Moreover, we know that the difference between measure space and probability space is the normalization coefficient, so traditional likelihood ratio statistics are special cases of our method. Note that, we consider unweighted network sequences here.
The rest of this paper is organized as follows. Section 2 states the method we use in this paper. Section 2.1 presents a performance measure for finite network data, , to evaluate how well a sequential test detects changes in a finite network observations. Section 2.2 gives the optimal sequential test under the performance measure we defined. Our results are illustrated by comparison of the numerical simulations of different sequential tests in Section 3, and the optimal sequential test we give obviously have better peformance than other 3 sequential tests. Section 3.1 is the case for independent and dependent finite sequences for networks in finite measure space. Section 3.2 is the case for independent and dependent finite sequences for networks in traditional probability space. Section 4 includes some concluding remarks. Proofs of the theorems are given in the Appendix.
2 Optimal sequential tests for finite sequences of network under finite measure
In this section, we first briefly describe the differences between probability space and measure space. Then we construct the optimal sequential tests for network observations.
Consider the cases such as the exponential family of random graph models(ERGMs) which is proposed in Frank and Strauss(1986), Frank(1991) and Wasserman and Pattison(1996).
represents a network, are the crucial features of the network.
When the normalization coefficient is difficult to deal with, in the probability space, the statistics, such as CUSUMtype, cannot be used. To avoid this, we consider finite measure space , we just discard the normalization coefficient, since there is no need to guarantee that in finite measure space.
And as in the probabiliy space, we also have measure density functions and conditional measure density functions for network objects as follows.

Suppose that there is a network set , let be a finite measure, assume that is differentiable in the usual sense in calculus. Let be the derivative of .
is the network object, is field. In the discrete case, we use to replace . Then is the corresponding measure density function.

The conditional measure density functions
Moreover, in this paper, we consider unweighted network sequences. The joint meansure density function is similar. Suppose that there are finite dependent network observations, denoted by . Then let denotes the number of nodes in each network observation. In this paper, we consider the changepoint problem for network sequences as follows

All the networks in sequences have a common number of nodes , which is a constant. Moreover, without loss of generality, we assume .

Prechange and postchange measure density functions are and , and conditional measure density functions of the th observation for are and , which are known. And the measures above are all finite.
Let () be the changepoint time, then the joint measure density function can be written as
where and are respectively the prechange and the postchange conditional measure densities of the th observation for . When , i.e., a change never occurs in observations . And the corresponding joint measure function is denoted by .
2.1 Performance measures of sequential tests
There are some different performance measures for sequential test in Shiryaev (1963, 1978), Chow, Robbins and Siegmund (1971, P.108), Frisén (2003), Moustakides (1986) and Polunchenko and Tartakovsky (2010). These performance measures relate to the probability distribution
and the probability expectation in probability space. So in finite measure space, we also have and when the changepoint time ().Similarity,
is the finite measure expectation, which is a first order moment based on joint measure density function. That is
(1) 
where is the joint measure when .
Futhermore, denotes the measure expectation with the prechange joint measure density .
Remark 1
It is easy to known that the measure expectations have the similar properties as in probability space.
Let be a sequential test, the definition is
where , is a set of all of the sequential tests satisfying and for . Following the idea of the randomization probability of the change time and the definition of describing the average detection delay proposed by Moustakides (2008), we define a performance measure , which is based on measure expectations, to evaluate the detection performance of each sequential test in the following
(2) 
where are weights, which are functions of .
For the CUSUM test statistics, which is widely used in quality detection. Moreover, it is simple, which is as follows
(3) 
for , where . And it is a special case with .
The performance measure in (2), denotes the detection delay. Note that when , or, , that is, we will newly detect the change of network starting from . So it is reasonable to let the weight of the detection delay as . At the same time, if we set the weight , then the numerator and denominator of the meansure in (2) can be regarded as the generalized outofcontrol average run length and incontrol average run length . For the numrical simulation in Section 3, we choose the CUSUM test statistics with .
2.2 Optimal sequential tests
In order to give the optimal network sequential tests, we fisrt give a definition about the optimal criterion of any sequential tests for finite observations under finite measure space. After defining a new performance measures under finite measure, we give the optimal sequential test, which is related to the control limit , so in this section, we show the optimal control limit under the performance measure we defined.
Definition 1
A sequential test with is optimal under the performance measure if
(4) 
where satisfies .
Before giving the optimal sequential test, we need to show the corresponding conditional measure expectations w.r.t. , , which is similar as in probability space. More details are as follow:
Let
be an integrable random variable on
. Let be a subfield of . The conditional measure expectation of given under measure , denoted by , is the a.s.unique random variable satisfying the following two conditions:is measurable in ;
for any
Remark 2
It is easy to obtain that the conditional measure expectations have the similar properties as in probability space. In the proofs of theorems, we need to use these properties.
Motivated by Chow, Robbins and Siegmund (1971) we present a nonnegative random dynamic control limit that is defined by the following recursive equations
(5)  
for , where is a constant,
Now, by using the test statistics and the control limits we define a sequential test as follows
(6) 
It is clear that , and for . The positive number can be regarded as an adjustment coefficient of the random dynamic control limit, as is increasing on with and for . And c control the value of .
The following theorem shows that the sequential test is optimal under the measure .
Theorem 1
Let be a positive number satisfying . There exists a positive number such that is optimal in the sense of (2) with ; that is,
(7) 
In particular, if satisfies , that is, and , then
(8) 
Here, the random dynamic control limit of the optimal control chart can be called an optimal dynamic control limit.
From Theorem 1 we know that for every measure corresponding to the weights , we can construct an optimal control chart under the measure .
While Markov process is common, so we give a simple version of Theorem 1 when the at most order of Markov process is fixed. Since and are measurable with respect to , there are functions , and such that
for . Therefore, the optimal control limit in (5) can be written as
for .
Note that the optimal control limit depends on the sequence of networks, for . We let the sequence of networks be at most a porder Markov process; that is, both the prechange network observations and the postchange network observations are order and order Markov processes with transition measure density functions and , respectively, where and
for and . We known that , means that the sequence of networks are mutually independent.
Theorem 2
Let the sequence of networks be at most a porder Markov process for and the weights and for . Then
Let . The optimal control limit can be written as
for .
Let and , then we have
for .
Theorem 2 is a special case of Theorem 1 when the observation sequence is a Markov process and at most order of this Markov process is fixed.
3 Comparison and analysis of simulation results
Suppose that there is a finite sequence of network and . In section 3.1, under finite measure space, we first give an example for independent samples condition by comparing four different sequential tests, with constant control limits, with linear decrease control limits, with linear increase control limits, and with optimal control limits which is defined in this paper. Then we give an example for dependent samples condition, which is 1order Markov process and we also compare the four different sequential tests. All of the above is based on CUSUMtype statistics and the number of nodes .
Note that traditional probability space is a special case, so when the normalization coefficient is computable for a network distribution, then we can use the probability space, we also give examples in Section 3.2 about independent and dependent ErdosRényi network sequences.
Moreover, in simulation, the performance measure with and for . And note that ME is proportitional to E, the ratio is fixed and unknown. For any test ,
is easily estimated by sample mean through simulation. So in the third column of tables, we replace
with . For all of the results, , and the four tests all have adjustment coefficient c to guarantee , which is denoted by in tables.The explanation of the simulation results is at the end of this section.
3.1 Comparison of simulation of indenpedent network observations in finite measure space
Let be an independent network observations sequence with a prechange measure density function and a postchange measure density function for . The function is a term used in Exponential Family Random Graph Models, which means the number of edges in a network.
There are many terms in ERGMs and many linear combination of these terms. The example we give is the simplest. The ratio of the prechange and postchange measure density functions and is
for . We compare the performance of four tests, the first one is with constant control limits
The second test, denotes a test with decrease control lmits as follows
And denotes a test with increase control lmits as follows
The last one is the optimal test with .
The simulation result for independent finite ERGM network observations is shown in Table 1. All of the results are based on repititions.
Table 1 Simulation of four tests for independent finite network observations.
Sequential tests  

0.3679  41.8339  0.353611  
0.2165  40.0626  0.201657  
0.3277  39.7875  0.314400  
0.1422  40.1929  0.130419 
3.2 Comparison of simulation of depedent network observations in finite measure space
Let be a dependent network observation sequence. We note that if there is no cross term of and in , then the network observations are independent obviously. So, here we let prechange measure density function and a postchange measure density function for . The ratio of the prechange and postchange measure density functions is
for . We compare the performance of four tests, which the same as indenpendent case above.
The simulation result of dependent finite ERGM network observations is shown in Table 2. All of the results are based on repititions.
Table 2 Simulation of four tests for dependent finite network observations.
Sequential tests  

0.725  40.0135  0.694468  
0.700  39.8105  0.6640197  
0.730  39.9017  0.698247  
0.1295  39.9192  0.073666 
3.3 Comparison of simulation of indepedent network observations in probability space
Let be an independent network observation sequence. Moreover, before the change point, the networks are ErdosRényi networks with link probability 0.5, and the link probability equals to 0.6 after the change point. We compare the performance of four tests, the first one is with constant control limits
The second test, denotes a test with decrease control lmits as follows
And denotes a test with increase control lmits as follows
The last one is the optimal test with
The simulation result of independent finite ErdosRényi network observations is shown in Table 3. All of the results are based on repititions.
Table 3 Simulation of four tests for independent finite network observations.
Sequential tests  

13.780  41.3517  13.136680  
11.750  40.3647  11.117500  
12.875  40.3902  12.258420  
2.050  40.0067  1.230929 
3.4 Comparison of simulation of depedent network observations in probability space
Let be a dependent network observation sequence. Given , is a ErdosRényi network. Before the change point, the link probability is , and the link probability equals to after the change point.
The simulation result of dependent finite ErdosRényi network observations is shown in Table 4. All of the results are based on repititions.
Table 4 Simulation of four tests for dependent finite network observations.
Sequential tests  

8.850  39.9266  8.425866  
7.000  39.9224  6.593795  
8.410  40.2549  8.008693  
4.000  40.0501  1.293891 
Next, we give a explanation about the numerical simulation results in the above four tables. The fist two tables are for general independent and dependent ERGMs. The last two tables are for specific ErdosRényi random networks. The first column shows that we compare four different sequential tests, the adjustment coefficient is shown in the second column to guarantee that , in tables we use to replace it. The fourth column is the focus of our attention, the smallest of , the better of the sequential test. We find that the optimal sequential tests in four tables all have the smallest , which is consistent with Theorem 1. Note that the method in this paper can be applied in many networks, as long as we know the distribution of the networks either in probability space or in general finite measure space. Futher, the sequence of networks in Section 3.2 and Section 3.4 can be order, while in simulation we choose 1order networks. And in simulation, we try to make sure , but in Table 1 and Table 3, for the constant control limits, , that’s because if we adjust the constant c a little bit, the corresponding is far from 40. There are many different control limits, in this paper we choose relatively representative control limits.
The usual CUSUM control chart we use in quality detection is constant control limits. From the simulation, we can conclude that under the performance measure we defined, constant control limits maybe not good enough, somtimes the increase control limits and decrease control limits have better performance, which indicates that finding an optimal dynamic control limits is necessary. The optimal sequential test we defined is the guidance for how to give the suitable or optimal dynamic control limits. Indeed we can conclude that the optimal control limits are a.s. decrease.
4 Concluding remarks
For the ERGMs models, and probabilistic graphical models, the normalization coefficient is difficult to compute, there are many method to approximate them, which needs complexty computations. We choose a finite measure to define corresponding measure ratio statistics for sequential test problem is a method to solve the difficulty. Measure ratio statistics can also show the difference between prechange and postchange distribution. And we can also give optimal sequential test under finite measure. All of the definition about finite measure space comes from Shao, J. (2003).
When the prechange and postchange measure density functions are known, it is simple. However, if the postchange measure density function is unknown, the problem may be more difficult, and would be considered later. If the number of parameters in ERGMs is large, it maybe results in dimension curse, which is also worthy to be considered.
The proofs of Thoerem in Appendix refers to Dong Han, Fugee Tsung, and Jinguo Xian(2019). The expectations and conditional expectations for the changepoint k() are extend to measure expectations and conditional measure expectations .
References
 [1] Amini A A , Nguyen X L.(2013) Sequential detection of multiple change points in networks: a graphical model approach. Transactions on information theory. IEEE, 59, 58245841.
 [2] Chow,Y.S., Robbins, H.and Siegmund, D.(1971) The Theory of Optimal Stopping. Dover Publications, INC. New York.
 [3] Frank, O. (1991). Statistical analysis of change in networks. Statistica Neerlandica, 45, 283293.
 [4] Frank, O., Strauss, D. (1986). Markov graphs. Journal of the American Statistical Association, 81, 832842.
 [5] Frisén, M. (2003) Statistical Surveillance, Optimality and Methods. Int. Statist. Rev., 71, 403434.
 [6] Han, D. and Tsung, F. G. (2004) A generalized EWMA control chart and its comparison with the optimal EWMA, CUSUM and CLR schemes.Ann. Statist., 32, 316339.
 [7] Han, D., Tsung, F. G. and Xian, J. G. (2017) On the optimality of Bayesian changepoint detection. Ann. Statist., 45, 13751402.
 [8] Han, D., Tsung, F. G. and Xian, J. G. (2019) Optimal Sequential Tests for Monitoring Changes in the Distribution of Finite Observation Sequences. arxiv.org
 [9] Keshavarz H, Michailidis G, Atchade Y. (2018) Sequential changepoint detection in highdimensional Gaussian graphical models. arxiv.org
 [10] Koller D, Friedman N. (2009) Probabilistic Graphical Models: Principles and Techniques.MIT Press.
 [11] Moustakides, G. V. (1986) Optimal stopping times for detecting changes in distribution. Ann. Statist., 14, 13791387.
 [12] Polunchenko, A. S. and Tartakovsky, A. G. (2010). On optimality of the ShiryaevRoberts procedure for detecting a change in distribution. Ann. Statist., 38, 34453457.
 [13] Page, E. S. (1954) Continuous inspection schemes. Biometrika,41 100114.
 [14] Shao, J. (2003). Mathematical Statistics. 2nd ed. Springer texts in statistics
 [15] Shiryaev, A. N. (1963) On optimum methods in quickest detection problems. Theory Probab. Appli., 13, 2246
 [16] Shiryaev, A. N. (1978) Optimal Stopping Rules. SpringerVerlag, New York.

[17]
Wasserman, S., Pattison, P.E. (1996). Logit models and logistic regressions for social networks: I. An introduction to Markov graphs and p*.
Psychometrika, 61, 401425.  [18] Wei, C. , Wiesel, A. , and Blum, R. S. . (2012). Distributed change detection in Gaussian graphical models. Information Sciences and Systems. IEEE.