Optimal partitioning of directed acyclic graphs with dependent costs between clusters

08/08/2023
by   Paul Pao-Yen Wu, et al.
0

Many statistical inference contexts, including Bayesian Networks (BNs), Markov processes and Hidden Markov Models (HMMS) could be supported by partitioning (i.e. mapping) the underlying Directed Acyclic Graph (DAG) into clusters. However, optimal partitioning is challenging, especially in statistical inference as the cost to be optimised is dependent on both nodes within a cluster, and the mapping of clusters connected via parent and/or child nodes, which we call dependent clusters. We propose a novel algorithm called DCMAP for optimal cluster mapping with dependent clusters. Given an arbitrarily defined, positive cost function based on the DAG and cluster mappings, we show that DCMAP converges to find all optimal clusters, and returns near-optimal solutions along the way. Empirically, we find that the algorithm is time-efficient for a DBN model of a seagrass complex system using a computation cost function. For a 25 and 50-node DBN, the search space size was 9.91× 10^9 and 1.51×10^21 possible cluster mappings, respectively, but near-optimal solutions with 88% and 72% similarity to the optimal solution were found at iterations 170 and 865, respectively. The first optimal solution was found at iteration 934 (95% CI 926,971), and 2256 (2150,2271) with a cost that was 4% and 0.2% of the naive heuristic cost, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2018

A New Cost Function for Hierarchical Cluster Trees

Hierarchical clustering has been a popular method in various data analys...
research
02/18/2021

Gradient-Tracking over Directed Graphs for solving Leaderless Multi-Cluster Games

We are concerned with finding Nash Equilibria in agent-based multi-clust...
research
07/11/2018

A PTAS for Minimum Makespan Vehicle Routing in Trees

We consider a variant of the vehicle routing problem on trees in which t...
research
09/25/2017

Evolutionary Acyclic Graph Partitioning

Directed graphs are widely used to model data flow and execution depende...
research
06/24/2023

Partitioning-Guided K-Means: Extreme Empty Cluster Resolution for Extreme Model Compression

Compactness in deep learning can be critical to a model's viability in l...
research
05/05/2021

Optimally partitioning signed networks based on generalized balance

Signed networks, which contain both positive and negative edges, are now...
research
03/26/2020

A Two-Stage Reconstruction of Microstructures with Arbitrarily Shaped Inclusions

The main goal of our research is to develop an effective method with a w...

Please sign up or login with your details

Forgot password? Click here to reset