DP-Select: Improving Utility and Privacy in Tabular Data Synthesis with Differentially Private Generative Adversarial Networks and Differentially Private Selection
Loading...
Date
2023-05-29
Authors
Ebrahimianghazani, Faezeh
Advisor
Kerschbaum, Florian
Journal Title
Journal ISSN
Volume Title
Publisher
University of Waterloo
Abstract
This thesis proposes DP-Select, a novel approach to tabular data synthesis that combines DP-GAN and differentially private selection. We develop a mutual information-based selection method that is flexible and scalable for high-dimensional data and large numbers of marginals while being differentially private. We evaluate DP-Select on various datasets and demonstrate its effectiveness and utility compared to existing DP-GAN methods. Our results indicate that DP-Select significantly enhances the utility of synthesized data while maintaining strong privacy guarantees, making it a promising extension of DP-GANs for privacy-preserving data synthesis in terms of differential privacy. We also show that DP-Select performs better for smaller privacy budgets, making it an attractive option for scenarios with limited privacy budgets.
Description
Keywords
data synthesis, differential privacy, generative adversarial network, differentially private selection, exponential mechanism, data privacy, tabular data