Measuring the quality of Synthetic data for use in competitions

06/29/2018
by   James Jordon, et al.
0

Machine learning has the potential to assist many communities in using the large datasets that are becoming more and more available. Unfortunately, much of that potential is not being realized because it would require sharing data in a way that compromises privacy. In order to overcome this hurdle, several methods have been proposed that generate synthetic data while preserving the privacy of the real data. In this paper we consider a key characteristic that synthetic data should have in order to be useful for machine learning researchers - the relative performance of two algorithms (trained and tested) on the synthetic dataset should be the same as their relative performance (when trained and tested) on the original dataset.

READ FULL TEXT

page 1

page 2

page 3

research
12/21/2021

Synthetic Data and Simulators for Recommendation Systems: Current State and Future Directions

Synthetic data and simulators have the potential to markedly improve the...
research
03/10/2022

Conditional Synthetic Data Generation for Personal Thermal Comfort Models

Personal thermal comfort models aim to predict an individual's thermal c...
research
04/13/2022

Enabling Synthetic Data adoption in regulated domains

The switch from a Model-Centric to a Data-Centric mindset is putting emp...
research
09/20/2019

BinarySDG: binary sensor data generation with R

The scarcity of Smart Home data is still a pretty big problem, and in a ...
research
05/30/2023

How Generative Models Improve LOS Estimation in 6G Non-Terrestrial Networks

With the advent of 5G and the anticipated arrival of 6G, there has been ...
research
11/08/2021

Spirometry-based airways disease simulation and recognition using Machine Learning approaches

The purpose of this study is to provide means to physicians for automate...
research
12/08/2020

Synthetic Data: Opening the data floodgates to enable faster, more directed development of machine learning methods

Many ground-breaking advancements in machine learning can be attributed ...

Please sign up or login with your details

Forgot password? Click here to reset