Generative Adversarial Networks for Bitcoin Data Augmentation

05/27/2020
by   Francesco Zola, et al.
0

In Bitcoin entity classification, results are strongly conditioned by the ground-truth dataset, especially when applying supervised machine learning approaches. However, these ground-truth datasets are frequently affected by significant class imbalance as generally they contain much more information regarding legal services (Exchange, Gambling), than regarding services that may be related to illicit activities (Mixer, Service). Class imbalance increases the complexity of applying machine learning techniques and reduces the quality of classification results, especially for underrepresented, but critical classes. In this paper, we propose to address this problem by using Generative Adversarial Networks (GANs) for Bitcoin data augmentation as GANs recently have shown promising results in the domain of image classification. However, there is no "one-fits-all" GAN solution that works for every scenario. In fact, setting GAN training parameters is non-trivial and heavily affects the quality of the generated synthetic data. We therefore evaluate how GAN parameters such as the optimization function, the size of the dataset and the chosen batch size affect GAN implementation for one underrepresented entity class (Mining Pool) and demonstrate how a "good" GAN configuration can be obtained that achieves high similarity between synthetically generated and real Bitcoin address data. To the best of our knowledge, this is the first study presenting GANs as a valid tool for generating synthetic address data for data augmentation in Bitcoin entity classification.

READ FULL TEXT
research
04/17/2020

YuruGAN: Yuru-Chara Mascot Generator Using Generative Adversarial Networks With Clustering Small Dataset

A yuru-chara is a mascot character created by local governments and comp...
research
07/07/2021

GAN-based Data Augmentation for Chest X-ray Classification

A common problem in computer vision – particularly in medical applicatio...
research
01/13/2021

Sequential IoT Data Augmentation using Generative Adversarial Networks

Sequential data in industrial applications can be used to train and eval...
research
07/21/2018

Conditional Infilling GANs for Data Augmentation in Mammogram Classification

Deep learning approaches to breast cancer detection in mammograms have r...
research
10/14/2021

IB-GAN: A Unified Approach for Multivariate Time Series Classification under Class Imbalance

Classification of large multivariate time series with strong class imbal...
research
12/20/2022

Conditioned Generative Transformers for Histopathology Image Synthetic Augmentation

Deep learning networks have demonstrated state-of-the-art performance on...
research
02/26/2019

Realistic Ultrasonic Environment Simulation Using Conditional Generative Adversarial Networks

Recently, realistic data augmentation using neural networks especially g...

Please sign up or login with your details

Forgot password? Click here to reset