Data collaboration analysis for distributed datasets

02/20/2019
by   Akira Imakura, et al.
0

In this paper, we propose a data collaboration analysis method for distributed datasets. The proposed method is a centralized machine learning while training datasets and models remain distributed over some institutions. Recently, data became large and distributed with decreasing costs of data collection. If we can centralize these distributed datasets and analyse them as one dataset, we expect to obtain novel insight and achieve a higher prediction performance compared with individual analyses on each distributed dataset. However, it is generally difficult to centralize the original datasets due to their huge data size or regarding a privacy-preserving problem. To avoid these difficulties, we propose a data collaboration analysis method for distributed datasets without sharing the original datasets. The proposed method centralizes only intermediate representation constructed individually instead of the original dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2022

Another Use of SMOTE for Interpretable Data Collaboration Analysis

Recently, data collaboration (DC) analysis has been developed for privac...
research
08/31/2022

Non-readily identifiable data collaboration analysis for multiple datasets including personal information

Multi-source data fusion, in which multiple data sources are jointly ana...
research
12/29/2020

Privacy-Preserving Methods for Vertically Partitioned Incomplete Data

Distributed health data networks that use information from multiple sour...
research
11/09/2020

Interpretable collaborative data analysis on distributed data

This paper proposes an interpretable non-model sharing collaborative dat...
research
12/06/2022

Achieving Transparency in Distributed Machine Learning with Explainable Data Collaboration

Transparency of Machine Learning models used for decision support in var...
research
02/27/2019

Decentralized Evolution and Consolidation of RDF Graphs

The World Wide Web and the Semantic Web are designed as a network of dis...
research
05/07/2019

Collaborative and Privacy-Preserving Machine Teaching via Consensus Optimization

In this work, we define a collaborative and privacy-preserving machine t...

Please sign up or login with your details

Forgot password? Click here to reset