Synthcity: facilitating innovative use cases of synthetic data in different data modalities

01/18/2023
by   Zhaozhi Qian, et al.
9

Synthcity is an open-source software package for innovative use cases of synthetic data in ML fairness, privacy and augmentation across diverse tabular data modalities, including static data, regular and irregular time series, data with censoring, multi-source data, composite data, and more. Synthcity provides the practitioners with a single access point to cutting edge research and tools in synthetic data. It also offers the community a playground for rapid experimentation and prototyping, a one-stop-shop for SOTA benchmarks, and an opportunity for extending research impact. The library can be accessed on GitHub (https://github.com/vanderschaarlab/synthcity) and pip (https://pypi.org/project/synthcity/). We warmly invite the community to join the development effort by providing feedback, reporting bugs, and contributing code.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2023

TemporAI: Facilitating Machine Learning Innovation in Time Domain Tasks for Medicine

TemporAI is an open source Python software library for machine learning ...
research
07/10/2023

Badgers: generating data quality deficits with Python

Generating context specific data quality deficits is necessary to experi...
research
10/24/2017

Synthetic Data for Social Good

Data for good implies unfettered access to data. But data owners must be...
research
06/27/2023

On the Usefulness of Synthetic Tabular Data Generation

Despite recent advances in synthetic data generation, the scientific com...
research
05/19/2023

TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series

Temporally indexed data are essential in a wide range of fields and of i...
research
11/17/2020

FTK: A Simplicial Spacetime Meshing Framework for Robust and Scalable Feature Tracking

We present the Feature Tracking Kit (FTK), a framework that simplifies, ...
research
01/17/2022

OmniPrint: A Configurable Printed Character Synthesizer

We introduce OmniPrint, a synthetic data generator of isolated printed c...

Please sign up or login with your details

Forgot password? Click here to reset