Towards Improved Illicit Node Detection with Positive-Unlabelled Learning

03/04/2023
by   Junliang Luo, et al.
0

Detecting illicit nodes on blockchain networks is a valuable task for strengthening future regulation. Recent machine learning-based methods proposed to tackle the tasks are using some blockchain transaction datasets with a small portion of samples labeled positive and the rest unlabelled (PU). Albeit the assumption that a random sample of unlabeled nodes are normal nodes is used in some works, we discuss that the label mechanism assumption for the hidden positive labels and its effect on the evaluation metrics is worth considering. We further explore that PU classifiers dealing with potential hidden positive labels can have improved performance compared to regular machine learning models. We test the PU classifiers with a list of graph representation learning methods for obtaining different feature distributions for the same data to have more reliable results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2021

Inferential SIR-GN: Scalable Graph Representation Learning

Graph representation learning methods generate numerical vector represen...
research
06/14/2021

PI-GNN: A Novel Perspective on Semi-Supervised Node Classification against Noisy Labels

Semi-supervised node classification, as a fundamental problem in graph l...
research
10/07/2019

On the Interpretability and Evaluation of Graph Representation Learning

With the rising interest in graph representation learning, a variety of ...
research
03/08/2023

Automatic Debiased Learning from Positive, Unlabeled, and Exposure Data

We address the issue of binary classification from positive and unlabele...
research
08/27/2018

Learning from Positive and Unlabeled Data under the Selected At Random Assumption

For many interesting tasks, such as medical diagnosis and web page class...
research
04/26/2015

Assessing binary classifiers using only positive and unlabeled data

Assessing the performance of a learned model is a crucial part of machin...
research
02/13/2022

Vital Node Identification in Complex Networks Using a Machine Learning-Based Approach

Vital node identification is the problem of finding nodes of highest imp...

Please sign up or login with your details

Forgot password? Click here to reset