1 Theoretical properties
One of the main strengths of the proposed methodology, possibly due to the SDLL, is that it can work well both in a change point regime as well as in a frequent jump regime: In a change point regime the minimum distance to the next change, , is reasonably large while the magnitude of the change is bounded from above and can be small (even tend to zero as ). In a frequent jump regime
is small (related to outlier detection) and necessarily corresponding jumpsneed to be large to be detectable. In both situations, an adaptation of Lemma 1 of Wang et al. (2018)
shows that no consistent estimator of the locations of change point exists when.
While WBS2.SDLL is shown to perform well in both regimes numerically, the paper does not provide a theoretical underpinning of this good behaviour, in the sense that only a linear-time change point setting with being of the same order as the sample size is considered: Such an assumption is not necessary for consistent change point detection and, moreover, it excludes models such as extreme.teeth (ET) and extreme.extreme.teeth (EET), which are reasonably considered as belonging to the frequent jump regime with . In the future, it will be very exciting to see which theoretical framework will help us to better understand the performance of statistical procedures that aim at handling both regimes simultaneously.
In addition, the best currently available results for the localisation rate attained by WBS as well as the requirement on the magnitude of changes for their detection, are sub-optimal when (see Proposition 3.4 of Cho and Kirch (2020)). Baranowski et al. (2019) and Wang et al. (2018) suggest modifications of WBS that alleviate the sub-optimality at the cost of introducing additional tuning parameters such as a threshold or an upper bound on the length of random intervals. However, even in these papers, the assumptions are formulated in terms of , which does not reflect that the strength of multiscale procedures lies in their ability to handle data sets containing both small changes with long distances to neighbouring change points, as well as large changes with shorter distances (see e.g., the mix model). Cho and Kirch (2020) consider multiscale change point situations by working with in the theoretical investigation of a more systematic moving sum (MOSUM)-type procedure for candidate generation.
2 SDLL with alternative candidate generation methods
As already pointed out by the author, both components of the proposed algorithm, i.e., candidate generation and model section, can be used in combination with other methods. For example, in Cho and Kirch (2020), a version of WBS2 has been adopted as a candidate generation method for the localised pruning method proposed for model selection. We will now show that deterministic candidate generation methods, such as the multiscale MOSUM procedure (Chan and Chen, 2017; Cho and Kirch, 2020), can be used with SDLL. Our first tentative attempt at generating a complete solution path of candidates with a reasonable measure of importance attached, is described in Section 3 below. Based on some initial simulation results reported in Table 3.1, we conclude that deterministic candidate generation methods can be a good alternative, and that this approach merits further research. Such a deterministic method will always yield the same result when applied to the same data set, whereas WBS-based methods can produce different outcomes in different runs (as observed in Cho and Kirch (2020) on array comparative genomic hybridization data sets). In particular, WBS-based results are reproducible only if the seed of the random number generation is also reported. In Section 4.1 of the present paper, the use of a ‘median’ of several runs is proposed to reduce this problem, which clearly comes at the cost of additional computation time.
3 MOSUM-candidate generation and some simulations
Many of the methods included in the comparative simulation studies of the present paper
have been designed for the change point regime with their default parameters chosen accordingly,
e.g., to save computation time.
For example, the algorithm referred to as ‘MOSUM’ in the present paper,
implemented in the R package mosum (Meier et al., 2019a),
has a tuning parameter that relates to the smallest permitted,
and its default value is set at , which we consider as a reasonable lower bound for a change point problem.
Also, the default choice of the parameter ,
which stems from change point testing and sets a threshold for candidate generation in the algorithm,
is somewhat conservative () and not very meaningful in the frequent jump regime.
Moving away from the change point regime,
we set the minimum bandwidth as small as possible in generating the bandwidth set ,
111We generate as detailed in Section 3.5 of Meier et al. (2019b) with
but only use bandwidths due to the necessity of local variance estimation.
due to the necessity of local variance estimation.and also set a more liberal threshold with . With these choices, MOSUM shows much better performance than that reported in the present paper, see Table 3.1 below.
Additionally, we explore the possibility of deterministic candidate generation based on moving sum statistics for a given set of bandwidth pairs :
At each scale , we identify all which maximises locally within , denote the collection of such by , and set . We aggregate the MOSUM statistics generated at multiple scales as , and then generate a solution path as in Algorithm 1, which is complete if the scale is included.
Referring to the methodology combining Algorithm 1 with SDLL as MOSUM.SDLL, Table 3.1 shows the results from applying WBS2.SDLL, MOSUM.SDLL (both with ) and MOSUM (with the aforementioned choice of parameters) to ET and EET summarised over
realisations. All methods perform better for EET than for ET since the signal-to-noise ratiois greater for ET (see also Section 1 above).
As already mentioned, MOSUM adapted for the frequent jump regime works considerably better than the default version calibrated for the change point regime. While being more conservative than the SDLL-based methods for ET, MOSUM still outperforms the others in terms of the absolute and the squared error measures and overall, it returns reasonably good estimators at a fraction of the time. MOSUM.SDLL shows that the deterministic candidate generation provides a promising alternative to WBS2: It performs slightly worse than WBS2.SDLL in identifying the correct number of change points () but the mean squared error of indicates that MOSUM.SDLL may return estimators of better localisation accuracy.
- Baranowski et al. (2019) Baranowski, R., Chen, Y., and Fryzlewicz, P. (2019), “Narrowest-over-threshold detection of multiple change-points and change-point-like features,” Journal of the Royal Statistical Society: Series B, 81, 649–672.
- Chan and Chen (2017) Chan, H. P. and Chen, H. (2017), “Multi-sequence segmentation via score and higher-criticism tests,” arXiv preprint, arXiv:1706.07586.
- Cho and Kirch (2020) Cho, H. and Kirch, C. (2020), “Two-stage data segmentation permitting multiscale change points, heavy tails and dependence,” arXiv preprint arXiv:1910.12486.
- Fryzlewicz (2014) Fryzlewicz, P. (2014), “Wild Binary Segmentation for multiple change-point detection,” The Annals of Statistics, 42, 2243–2281.
- Fryzlewicz (2020) — (2020), “Detecting possibly frequent change-points: Wild Binary Segmentation 2 and steepest-drop model selection,” Journal of the Korean Statistical Society.
- Meier et al. (2019a) Meier, A., Cho, H., and Kirch, C. (2019a), mosum: Moving sum based procedures for changes in the mean, R package version 1.2.3.
- Meier et al. (2019b) Meier, A., Kirch, C., and Cho, H. (2019b), “mosum: A package for moving sums in change point analysis,” To appear in the Journal of Statistical Software.
- Wang et al. (2018) Wang, D., Yu, Y., and Rinaldo, A. (2018), “Univariate mean change point detection: penalization, CUSUM and optimality,” arXiv preprint arXiv:1810.09498.