Active Discovery of Network Roles for Predicting the Classes of Network Nodes

12/27/2013
by   Leto Peel, et al.
0

Nodes in real world networks often have class labels, or underlying attributes, that are related to the way in which they connect to other nodes. Sometimes this relationship is simple, for instance nodes of the same class are may be more likely to be connected. In other cases, however, this is not true, and the way that nodes link in a network exhibits a different, more complex relationship to their attributes. Here, we consider networks in which we know how the nodes are connected, but we do not know the class labels of the nodes or how class labels relate to the network links. We wish to identify the best subset of nodes to label in order to learn this relationship between node attributes and network links. We can then use this discovered relationship to accurately predict the class labels of the rest of the network nodes. We present a model that identifies groups of nodes with similar link patterns, which we call network roles, using a generative blockmodel. The model then predicts labels by learning the mapping from network roles to class labels using a maximum margin classifier. We choose a subset of nodes to label according to an iterative margin-based active learning strategy. By integrating the discovery of network roles with the classifier optimisation, the active learning process can adapt the network roles to better represent the network for node classification. We demonstrate the model by exploring a selection of real world networks, including a marine food web and a network of English words. We show that, in contrast to other network classifiers, this model achieves good classification accuracy for a range of networks with different relationships between class labels and network links.

READ FULL TEXT

page 7

page 8

research
09/15/2011

Active Learning for Node Classification in Assortative and Disassortative Networks

In many real-world networks, nodes have class labels, attributes, or var...
research
10/05/2015

Learning in Unlabeled Networks - An Active Learning and Inference Approach

The task of determining labels of all network nodes based on the knowled...
research
10/22/2020

Joint Use of Node Attributes and Proximity for Semi-Supervised Classification on Graphs

The node classification problem is to infer unknown node labels in a gra...
research
02/27/2012

Protocols for Learning Classifiers on Distributed Data

We consider the problem of learning classifiers for labeled data that ha...
research
02/18/2014

A Bayesian Model of node interaction in networks

We are concerned with modeling the strength of links in networks by taki...
research
06/13/2020

Top influencers can be identified universally by combining classical centralities

Information flow, opinion, and epidemics spread over structured networks...
research
05/24/2012

Language-Constraint Reachability Learning in Probabilistic Graphs

The probabilistic graphs framework models the uncertainty inherent in re...

Please sign up or login with your details

Forgot password? Click here to reset