Membership Inference Attacks on Machine Learning: A Survey (2103.07853v4)

Published 14 Mar 2021 in cs.LG and cs.CR

Abstract: Machine learning (ML) models have been widely applied to various applications, including image classification, text generation, audio recognition, and graph data analysis. However, recent studies have shown that ML models are vulnerable to membership inference attacks (MIAs), which aim to infer whether a data record was used to train a target model or not. MIAs on ML models can directly lead to a privacy breach. For example, via identifying the fact that a clinical record that has been used to train a model associated with a certain disease, an attacker can infer that the owner of the clinical record has the disease with a high chance. In recent years, MIAs have been shown to be effective on various ML models, e.g., classification models and generative models. Meanwhile, many defense methods have been proposed to mitigate MIAs. Although MIAs on ML models form a newly emerging and rapidly growing research area, there has been no systematic survey on this topic yet. In this paper, we conduct the first comprehensive survey on membership inference attacks and defenses. We provide the taxonomies for both attacks and defenses, based on their characterizations, and discuss their pros and cons. Based on the limitations and gaps identified in this survey, we point out several promising future research directions to inspire the researchers who wish to follow this area. This survey not only serves as a reference for the research community but also provides a clear description for researchers outside this research domain. To further help the researchers, we have created an online resource repository, which we will keep updated with future relevant work. Interested readers can find the repository at https://github.com/HongshengHu/membership-inference-machine-learning-literature.

Authors (6)

Hongsheng Hu (27 papers)
Zoran Salcic (4 papers)
Lichao Sun (186 papers)
Gillian Dobbie (21 papers)
Philip S. Yu (592 papers)
Xuyun Zhang (21 papers)

Citations (336)

View on Semantic Scholar

Summary

The paper presents a comprehensive taxonomy categorizing membership inference attacks and defenses across various ML models.
It details methodologies from black-box to white-box attacks, emphasizing how overfitting increases vulnerability.
The study highlights practical defense strategies, including confidence score masking and differential privacy, to safeguard sensitive data.

An In-depth Analysis of "Membership Inference Attacks on Machine Learning: A Survey"

This essay provides a comprehensive review and analysis of the academic paper titled "Membership Inference Attacks on Machine Learning: A Survey," authored by Hongsheng Hu et al. The paper is pioneering in its consolidation of research on membership inference attacks (MIAs) on a variety of ML models and observes defenses against such attacks. MIAs seek to determine if a particular data record was included in the training set of the model, thus revealing privacy concerns. This survey not only categorizes existing work but also paves the path for future research in this burgeoning field.

Overview of Membership Inference Attacks

The paper begins by establishing the vulnerability of various ML models—ranging from classifiers to generative models—to MIAs. These attacks exploit the probabilistic inference to determine the inclusion of individual data records in the training dataset of a model. The review categorizes MIAs based on several dimensions: the type of learning model under attack, the knowledge available to the adversaries, and the methodology of the attack. For instance, black-box attacks only require access to the model's output, while white-box attacks can also leverage model internals like gradients.

Comprehensive Taxonomies and Methodologies

Crucially, the paper develops detailed taxonomies of both MIAs and potential defenses. The types of MIAs reviewed include attacks on classification models and generative adversarial networks (GANs), driven by shadow models or exploiting properties like overfitting. Binary classifier-based and metric-based attacks—highlighting the model's behavior differences on training and non-training data—are illuminated.

For defenses, the survey discusses several approaches such as:

Confidence Score Masking: Limits or disguises the information from the model's output.
Regularization: Attempts to minimize overfitting.
Differential Privacy: A robust method offering theoretical backing against MIAs at the potential cost of model performance.
Knowledge Distillation: Utilizes teacher-student training mechanisms to obscure data origins.

Theoretical and Practical Implications

Within the paper's theoretical discourse, it addresses the fundamental question: "Why do MIAs Work?" It attributes their success largely to models' overfitting and the intrinsic differences in how models treat training data versus unseen data. The paper challenges the community to rethink standard assumptions about model security and privacy disclosures.

From a practical perspective, the implications of MIAs extend to potential data leaks in healthcare, finance, and other sensitive applications where privacy is paramount. As models are deployed in critical real-world applications, protecting against MIAs becomes integral to safeguarding user data privacy.

Future Directions

The authors critically assess current research gaps and propose several future directions:

Reducing ML model overfitting without sacrificing utility remains a key challenge.
Exploring MIAs on self-supervised and meta-learning models.
Enhancing robustness against MIAs through innovative paradigms stemming from adversarial machine learning.
Investigating the use of surrogate datasets via generative models as a means to protect against MIAs.
Developing specific defenses for federated learning infrastructures, which raise unique privacy challenges.

Conclusion

This survey serves as a foundational reference for ongoing and future research in understanding and countering membership inference attacks. In providing a robust synthesis of current methodologies and articulating the trajectory for future exploration, the paper significantly contributes to the dialogue on data privacy in machine learning. The sophisticated taxonomies and comprehensive evaluations make it a valuable resource for researchers, equipping them with the insights needed to push forward the boundaries of privacy protection in AI.

PDF Markdown

Related Papers

GitHub

GitHub - HongshengHu/membership-inference-machine-learning-literature (242 stars)