Learning Data Dependency with Communication Cost

04/29/2018
by   Hyeryung Jang, et al.
0

In this paper, we consider the problem of recovering a graph that represents the statistical data dependency among nodes for a set of data samples generated by nodes, which provides the basic structure to perform an inference task, such as MAP (maximum a posteriori). This problem is referred to as structure learning. When nodes are spatially separated in different locations, running an inference algorithm requires a non-negligible amount of message passing, incurring some communication cost. We inevitably have the trade-off between the accuracy of structure learning and the cost we need to pay to perform a given message-passing based inference task because the learnt edge structures of data dependency and physical connectivity graph are often highly different. In this paper, we formalize this trade-off in an optimization problem which outputs the data dependency graph that jointly considers learning accuracy and message-passing costs. We focus on a distributed MAP as the target inference task, and consider two different implementations, ASYNC-MAP and SYNC-MAP that have different message-passing mechanisms and thus different cost structures. In ASYNC- MAP, we propose a polynomial time learning algorithm that is optimal, motivated by the problem of finding a maximum weight spanning tree. In SYNC-MAP, we first prove that it is NP-hard and propose a greedy heuristic. For both implementations, we then quantify how the probability that the resulting data graphs from those learning algorithms differ from the ideal data graph decays as the number of data samples grows, using the large deviation principle, where the decaying rate is characterized by some topological structures of both original data dependency and physical connectivity graphs as well as the degree of the trade-off. We validate our theoretical findings through extensive simulations, which confirms that it has a good match.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2012

MAP Estimation, Message Passing, and Perfect Graphs

Efficiently finding the maximum a posteriori (MAP) configuration of a gr...
research
02/19/2023

Communication-Efficient Distributed Graph Clustering and Sparsification under Duplication Models

In this paper, we consider the problem of clustering graph nodes and spa...
research
11/14/2018

Communication-Optimal Distributed Dynamic Graph Clustering

We consider the problem of clustering graph nodes over large-scale dynam...
research
02/12/2019

Constructive Heuristics for Min-Power Bounded-Hops Symmetric Connectivity Problem

We consider a Min-Power Bounded-Hops Symmetric Connectivity problem that...
research
12/24/2021

Aligning random graphs with a sub-tree similarity message-passing algorithm

The problem of aligning Erdös-Rényi random graphs is a noisy, average-ca...
research
03/17/2022

On the expressive power of message-passing neural networks as global feature map transformers

We investigate the power of message-passing neural networks (MPNNs) in t...
research
09/05/2020

A Class of Optimal Structures for Node Computations in Message Passing Algorithms

Consider the computations at a node in the message passing algorithms. A...

Please sign up or login with your details

Forgot password? Click here to reset