DP-TBART: A Transformer-based Autoregressive Model for Differentially Private Tabular Data Generation

07/19/2023
by   Rodrigo Castellon, et al.
0

The generation of synthetic tabular data that preserves differential privacy is a problem of growing importance. While traditional marginal-based methods have achieved impressive results, recent work has shown that deep learning-based approaches tend to lag behind. In this work, we present Differentially-Private TaBular AutoRegressive Transformer (DP-TBART), a transformer-based autoregressive model that maintains differential privacy and achieves performance competitive with marginal-based methods on a wide variety of datasets, capable of even outperforming state-of-the-art methods in certain settings. We also provide a theoretical framework for understanding the limitations of marginal-based approaches and where deep learning-based approaches stand to contribute most. These results suggest that deep learning-based techniques should be considered as a viable alternative to marginal-based methods in the generation of differentially private synthetic tabular data.

READ FULL TEXT
research
07/25/2023

Spectral-DP: Differentially Private Deep Learning through Spectral Perturbation and Filtering

Differential privacy is a widely accepted measure of privacy in the cont...
research
10/08/2020

Differentially Private Deep Learning with Direct Feedback Alignment

Standard methods for differentially private training of deep neural netw...
research
02/05/2022

Differentially Private Graph Classification with GNNs

Graph Neural Networks (GNNs) have established themselves as the state-of...
research
06/19/2023

Differentially Private Synthetic Data Using KD-Trees

Creation of a synthetic dataset that faithfully represents the data dist...
research
07/12/2022

dpart: Differentially Private Autoregressive Tabular, a General Framework for Synthetic Data Generation

We propose a general, flexible, and scalable framework dpart, an open so...
research
01/21/2023

Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms

Marginal-based methods achieve promising performance in the synthetic da...
research
05/28/2023

DPFormer: Learning Differentially Private Transformer on Long-Tailed Data

The Transformer has emerged as a versatile and effective architecture wi...

Please sign up or login with your details

Forgot password? Click here to reset