Synthetic Data Generation and Privacy-Preserving AI

Authors

  • Sreeprasad Govindankutty Rochester Institute of Technology, Rochester, New York, United States Author

DOI:

https://doi.org/10.47392/IRJAEM.2025.0270

Keywords:

Synthetic Data Generation, Privacy-Preserving AI, Generative Adversarial Networks (GANs), Differential Privacy, Data Anonymization, Machine Learning Security, Ethical AI, Data Utility, Membership Inference Attacks

Abstract

Synthetic data generation has rapidly emerged as a cornerstone technology for achieving privacy-preserving artificial intelligence (AI). In light of tightening data protection regulations and the growing ethical emphasis on safeguarding personal information, researchers have developed a range of methods to synthesize realistic datasets without compromising individual privacy. This review presents a comprehensive synthesis of existing approaches, focusing on generative adversarial networks (GANs), variational autoencoders (VAEs), and Bayesian techniques. We systematically evaluate these models based on data utility, privacy guarantees, and vulnerability to adversarial attacks. Despite significant progress, challenges such as utility-privacy trade-offs, model bias, and lack of standard evaluation metrics persist. This paper highlights these gaps and proposes strategic future directions for the research community, advocating for hybrid models, interpretability-focused synthetic generation, and cross-disciplinary collaborations to achieve more trustworthy AI ecosystems.

Downloads

Download data is not yet available.

Downloads

Published

2025-05-05