DSF-GAN: DownStream Feedback Generative Adversarial Network (2403.18267v1)
Abstract: Utility and privacy are two crucial measurements of the quality of synthetic tabular data. While significant advancements have been made in privacy measures, generating synthetic samples with high utility remains challenging. To enhance the utility of synthetic samples, we propose a novel architecture called the DownStream Feedback Generative Adversarial Network (DSF-GAN). This approach incorporates feedback from a downstream prediction model during training to augment the generator's loss function with valuable information. Thus, DSF-GAN utilizes a downstream prediction task to enhance the utility of synthetic samples. To evaluate our method, we tested it using two popular datasets. Our experiments demonstrate improved model performance when training on synthetic samples generated by DSF-GAN, compared to those generated by the same GAN architecture without feedback. The evaluation was conducted on the same validation set comprising real samples. All code and datasets used in this research will be made openly available for ease of reproduction.
- Wasserstein GAN, January 2017. URL https://arxiv.org/abs/1701.07875v3.
- Boosting Deep Learning Risk Prediction with Generative Adversarial Networks for Electronic Health Records. In 2017 IEEE International Conference on Data Mining (ICDM), pp. 787–792, November 2017. doi: 10.1109/ICDM.2017.93. ISSN: 2374-8486.
- Generation of Heterogeneous Synthetic Electronic Health Records using GANs. December 2019. doi: 10.3929/ETHZ-B-000392473. URL http://hdl.handle.net/20.500.11850/392473. Medium: application/pdf,5 p. accepted version Publisher: ETH Zurich.
- Generative adversarial networks. Commun. ACM, 63(11):139–144, October 2020. ISSN 0001-0782, 1557-7317. doi: 10.1145/3422622. URL https://dl.acm.org/doi/10.1145/3422622.
- Feedback-AVPGAN: Feedback-guided generative adversarial network for generating antiviral peptides. J. Bioinform. Comput. Biol., 20(06):2250026, December 2022. ISSN 0219-7200. doi: 10.1142/S0219720022500263. URL https://www.worldscientific.com/doi/abs/10.1142/S0219720022500263. Publisher: World Scientific Publishing Co.
- Feedback Adversarial Learning: Spatial Feedback for Improving Generative Adversarial Networks. pp. 1476–1485, 2019. URL https://openaccess.thecvf.com/content_CVPR_2019/html/Huh_Feedback_Adversarial_Learning_Spatial_Feedback_for_Improving_Generative_Adversarial_Networks_CVPR_2019_paper.html.
- Ensemble synthetic ehr generation for increasing subpopulation model’s performance. arXiv preprint arXiv:2305.16363, 2023.
- TabFairGAN: Fair Tabular Data Generation with Generative Adversarial Networks. Machine Learning and Knowledge Extraction, 4(2):488–501, June 2022. ISSN 2504-4990. doi: 10.3390/make4020022. URL https://www.mdpi.com/2504-4990/4/2/22. Number: 2 Publisher: Multidisciplinary Digital Publishing Institute.
- Modeling Tabular data using Conditional GAN, October 2019. URL http://arxiv.org/abs/1907.00503. arXiv:1907.00503 [cs, stat].
- Anonymization Through Data Synthesis Using Generative Adversarial Networks (ADS-GAN). IEEE Journal of Biomedical and Health Informatics, 24(8):2378–2388, August 2020. ISSN 2168-2208. doi: 10.1109/JBHI.2020.2980262. Conference Name: IEEE Journal of Biomedical and Health Informatics.
- CTAB-GAN: Effective Table Data Synthesizing. In Proceedings of The 13th Asian Conference on Machine Learning, pp. 97–112. PMLR, November 2021. URL https://proceedings.mlr.press/v157/zhao21a.html. ISSN: 2640-3498.
- CTAB-GAN+: Enhancing Tabular Data Synthesis, April 2022. URL http://arxiv.org/abs/2204.00401. arXiv:2204.00401 [cs].
- Oriel Perets (3 papers)
- Nadav Rappoport (8 papers)