Tackling Provably Hard Representative Selection via Graph Neural Networks

by   Seyed Mehran Kazemi, et al.

Representative selection (RS) is the problem of finding a small subset of exemplars from an unlabeled dataset, and has numerous applications in summarization, active learning, data compression and many other domains. In this paper, we focus on finding representatives that optimize the accuracy of a model trained on the selected representatives. We study RS for data represented as attributed graphs. We develop RS-GNN, a representation learning-based RS model based on Graph Neural Networks. Empirically, we demonstrate the effectiveness of RS-GNN on problems with predefined graph structures as well as problems with graphs induced from node feature similarities, by showing that RS-GNN achieves significant improvements over established baselines that optimize surrogate functions. Theoretically, we establish a new hardness result for RS by proving that RS is hard to approximate in polynomial time within any reasonable factor, which implies a significant gap between the optimum solution of widely-used surrogate functions and the actual accuracy of the model, and provides justification for the superiority of representation learning-based approaches such as RS-GNN over surrogate functions.


page 1

page 2

page 3

page 4


Measuring and Improving the Use of Graph Information in Graph Neural Networks

Graph neural networks (GNNs) have been widely used for representation le...

Distill n' Explain: explaining graph neural networks using simple surrogates

Explaining node predictions in graph neural networks (GNNs) often boils ...

Break the Wall Between Homophily and Heterophily for Graph Representation Learning

Homophily and heterophily are intrinsic properties of graphs that descri...

Learning Robust Node Representation on Graphs

Graph neural networks (GNN), as a popular methodology for node represent...

Detecting Relevant Feature Interactions for Recommender Systems via Graph Neural Networks

Feature interactions are essential for achieving high accuracy in recomm...

An Arithmetic Theory for the Poly-Time Random Functions

We introduce a new bounded theory RS^1_2 and show that the functions whi...

Please sign up or login with your details

Forgot password? Click here to reset