Machine Unlearning of Features and Labels (2108.11577v4)

Published 26 Aug 2021 in cs.LG and cs.CR

Abstract: Removing information from a machine learning model is a non-trivial task that requires to partially revert the training process. This task is unavoidable when sensitive data, such as credit card numbers or passwords, accidentally enter the model and need to be removed afterwards. Recently, different concepts for machine unlearning have been proposed to address this problem. While these approaches are effective in removing individual data points, they do not scale to scenarios where larger groups of features and labels need to be reverted. In this paper, we propose the first method for unlearning features and labels. Our approach builds on the concept of influence functions and realizes unlearning through closed-form updates of model parameters. It enables to adapt the influence of training data on a learning model retrospectively, thereby correcting data leaks and privacy issues. For learning models with strongly convex loss functions, our method provides certified unlearning with theoretical guarantees. For models with non-convex losses, we empirically show that unlearning features and labels is effective and significantly faster than other strategies.

Citations (139)

View on Semantic Scholar

Summary

The paper proposes an influence function-based approach that efficiently unlearns ML features and labels with certified guarantees for convex models.
It demonstrates empirical performance improvements on non-convex models, notably reducing computational cost compared to full retraining.
The method addresses privacy concerns by enabling group-level unlearning, thereby meeting GDPR and similar regulatory requirements.

Overview of "Machine Unlearning of Features and Labels"

The paper "Machine Unlearning of Features and Labels" presents a novel approach in the domain of privacy protection in ML. It addresses the critical need to remove inadvertently captured sensitive information in ML models, which is increasingly important due to privacy regulations like GDPR. The authors propose techniques for unlearning features and labels—a task where existing methods are inefficient, especially when dealing with multiple or distributed data entries.

The core contribution of the paper is the introduction of a method utilizing influence functions to retrospectively adjust a model’s parameters via closed-form updates. Unlike prior approaches that focus on the removal of individual data points, this method scales to scenarios requiring unlearning of larger groups of features and labels. The approach is theoretically sound for models with strongly convex loss functions, providing certified unlearning with provable guarantees. For non-convex models, empirical results demonstrate effectiveness and improved speed compared to traditional retraining.

Key Findings

Certified Unlearning for Convex Models: For ML models, such as logistic regression and SVMs, which employ strongly convex losses, the authors prove their method's capability to perform certified unlearning. This is realized through first-order and second-order updates that ensure bounded gradient residuals after unlearning, thereby guaranteeing the removal of specified data perturbations.
Empirical Performance with Non-Convex Models: In scenarios involving non-convex losses, such as deep neural networks, theoretical guarantees are more challenging to establish. Nonetheless, empirical evaluations validate the capability of their method. The results show a significant reduction in computational cost and time while maintaining performance fidelity comparable to complete model retraining.
Efficient Handling of Privacy Requests: By providing a framework that adapts unlearning to the granularity of features and labels, the authors mitigate the inefficiencies presented by previous instance-based unlearning methods or costly full retraining processes. Their method effectively reduces the privacy leakage risk by efficiently updating only the affected sub-components of a model.

Implications

The implications of this research are twofold. Practically, the introduction of a scalable and efficient unlearning mechanism offers ML practitioners a viable tool for addressing privacy demands, significantly reducing the burden associated with model retraining. Theoretically, the work broadens the scope of machine unlearning from a predominantly instance-level focus to include features and labels, paving the way for further exploration into more granular unlearning methods in complex environments.

Future Directions

The paper outlines several promising avenues for future research. One potential area is the extension of certified unlearning capabilities to non-convex models, aiming to establish similar theoretical guarantees. Additionally, refining the influence function framework to reduce the computational overhead further and improve the estimations' accuracy is an ongoing challenge that could enhance unlearning efficiency on high-dimensional datasets or under more stringent privacy constraints.

Overall, this paper contributes a significant advancement in privacy-preserving machine learning, offering a solution that balances efficiency, efficacy, and theoretical rigor in the unlearning of sensitive ML features and labels.

PDF Markdown

Related Papers

GitHub

GitHub - alewarne/MachineUnlearning: Code related to the paper "Machine Unlearning for Features and Labels" (70 stars)