Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Generative Models, Synthetic Tabular Data, and Differential Privacy: An Overview and Synthesis (2307.15424v2)

Published 28 Jul 2023 in cs.LG, stat.AP, stat.CO, and stat.ML

Abstract: This article provides a comprehensive synthesis of the recent developments in synthetic data generation via deep generative models, focusing on tabular datasets. We specifically outline the importance of synthetic data generation in the context of privacy-sensitive data. Additionally, we highlight the advantages of using deep generative models over other methods and provide a detailed explanation of the underlying concepts, including unsupervised learning, neural networks, and generative models. The paper covers the challenges and considerations involved in using deep generative models for tabular datasets, such as data normalization, privacy concerns, and model evaluation. This review provides a valuable resource for researchers and practitioners interested in synthetic data generation and its applications.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Conor Hassan (6 papers)
  2. Robert Salomone (17 papers)
  3. Kerrie Mengersen (82 papers)
Citations (3)