VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning Benchmarks

07/05/2023
by   Zhaomin Wu, et al.
0

Vertical Federated Learning (VFL) is a crucial paradigm for training machine learning models on feature-partitioned, distributed data. However, due to privacy restrictions, few public real-world VFL datasets exist for algorithm evaluation, and these represent a limited array of feature distributions. Existing benchmarks often resort to synthetic datasets, derived from arbitrary feature splits from a global set, which only capture a subset of feature distributions, leading to inadequate algorithm performance assessment. This paper addresses these shortcomings by introducing two key factors affecting VFL performance - feature importance and feature correlation - and proposing associated evaluation metrics and dataset splitting methods. Additionally, we introduce a real VFL dataset to address the deficit in image-image VFL scenarios. Our comprehensive evaluation of cutting-edge VFL algorithms provides valuable insights for future research in the field.

READ FULL TEXT

page 6

page 7

page 9

page 21

page 24

research
06/14/2020

The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems

This paper presents and characterizes an Open Application Repository for...
research
12/01/2022

Hijack Vertical Federated Learning Models with Adversarial Embedding

Vertical federated learning (VFL) is an emerging paradigm that enables c...
research
06/10/2021

Multi-VFL: A Vertical Federated Learning System for Multiple Data and Label Owners

Vertical Federated Learning (VFL) refers to the collaborative training o...
research
12/22/2022

Federated Learning – Methods, Applications and beyond

In recent years the applications of machine learning models have increas...
research
01/30/2020

Multi-Participant Multi-Class Vertical Federated Learning

Federated learning (FL) is a privacy-preserving paradigm for training co...
research
07/14/2021

IFedAvg: Interpretable Data-Interoperability for Federated Learning

Recently, the ever-growing demand for privacy-oriented machine learning ...
research
09/30/2022

Vertical Semi-Federated Learning for Efficient Online Advertising

As an emerging secure learning paradigm in leveraging cross-silo private...

Please sign up or login with your details

Forgot password? Click here to reset