The non-tightness of the reconstruction threshold of a 4 states symmetric model with different in-block and out-block mutations
The tree reconstruction problem is to collect and analyze massive data at the nth level of the tree, to identify whether there is non-vanishing information of the root, as n goes to infinity. Its connection to the clustering problem in the setting of the stochastic block model, which has wide applications in machine learning and data mining, has been well established. For the stochastic block model, an "information-theoretically-solvable-but-computationally-hard" region, or say "hybrid-hard phase", appears whenever the reconstruction bound is not tight of the corresponding reconstruction on the tree problem. Although it has been studied in numerous contexts, the existing literature with rigorous reconstruction thresholds established are very limited, and it becomes extremely challenging when the model under investigation has 4 states (the stochastic block model with 4 communities). In this paper, inspired by the newly proposed q_1+q_2 stochastic block model, we study a 4 states symmetric model with different in-block and out-block transition probabilities, and rigorously give the conditions for the non-tightness of the reconstruction threshold.
READ FULL TEXT