Synthetic Data Generation and Privacy-Preserving AI
DOI:
https://doi.org/10.47392/IRJAEM.2025.0270Keywords:
Synthetic Data Generation, Privacy-Preserving AI, Generative Adversarial Networks (GANs), Differential Privacy, Data Anonymization, Machine Learning Security, Ethical AI, Data Utility, Membership Inference AttacksAbstract
Synthetic data generation has rapidly emerged as a cornerstone technology for achieving privacy-preserving artificial intelligence (AI). In light of tightening data protection regulations and the growing ethical emphasis on safeguarding personal information, researchers have developed a range of methods to synthesize realistic datasets without compromising individual privacy. This review presents a comprehensive synthesis of existing approaches, focusing on generative adversarial networks (GANs), variational autoencoders (VAEs), and Bayesian techniques. We systematically evaluate these models based on data utility, privacy guarantees, and vulnerability to adversarial attacks. Despite significant progress, challenges such as utility-privacy trade-offs, model bias, and lack of standard evaluation metrics persist. This paper highlights these gaps and proposes strategic future directions for the research community, advocating for hybrid models, interpretability-focused synthetic generation, and cross-disciplinary collaborations to achieve more trustworthy AI ecosystems.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Research Journal on Advanced Engineering and Management (IRJAEM)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.