A Privacy-Preserving Walk in the Latent Space of Generative Models for Medical Applications

07/06/2023
by   Matteo Pennisi, et al.
0

Generative Adversarial Networks (GANs) have demonstrated their ability to generate synthetic samples that match a target distribution. However, from a privacy perspective, using GANs as a proxy for data sharing is not a safe solution, as they tend to embed near-duplicates of real samples in the latent space. Recent works, inspired by k-anonymity principles, address this issue through sample aggregation in the latent space, with the drawback of reducing the dataset by a factor of k. Our work aims to mitigate this problem by proposing a latent space navigation strategy able to generate diverse synthetic samples that may support effective training of deep models, while addressing privacy concerns in a principled way. Our approach leverages an auxiliary identity classifier as a guide to non-linearly walk between points in the latent space, minimizing the risk of collision with near-duplicates of real samples. We empirically demonstrate that, given any random pair of points in the latent space, our walking strategy is safer than linear interpolation. We then test our path-finding strategy combined to k-same methods and demonstrate, on two benchmarks for tuberculosis and diabetic retinopathy classification, that training a model using samples generated by our approach mitigate drops in performance, while keeping privacy preservation.

READ FULL TEXT

page 3

page 8

research
11/03/2017

Metrics for Deep Generative Models

Neural samplers such as variational autoencoders (VAEs) or generative ad...
research
11/17/2016

Inverting The Generator Of A Generative Adversarial Network

Generative adversarial networks (GANs) learn to synthesise new samples f...
research
11/06/2017

Optimal transport maps for distribution preserving operations on latent spaces of Generative Models

Generative models such as Variational Auto Encoders (VAEs) and Generativ...
research
08/24/2022

GAN-based generative modelling for dermatological applications – comparative study

The lack of sufficiently large open medical databases is one of the bigg...
research
03/29/2021

Bayesian Attention Networks for Data Compression

The lossless data compression algorithm based on Bayesian Attention Netw...
research
06/07/2021

Double Descent and Other Interpolation Phenomena in GANs

We study overparameterization in generative adversarial networks (GANs) ...
research
09/12/2022

Generate novel and robust samples from data: accessible sharing without privacy concerns

Generating new samples from data sets can mitigate extra expensive opera...

Please sign up or login with your details

Forgot password? Click here to reset