Data Selection for Transfer Unlearning (2405.10425v1)

Published 16 May 2024 in cs.LG

Abstract: As deep learning models are becoming larger and data-hungrier, there are growing ethical, legal and technical concerns over use of data: in practice, agreements on data use may change over time, rendering previously-used training data impermissible for training purposes. These issues have driven increased attention to machine unlearning: removing "the influence of" a subset of training data from a trained model. In this work, we advocate for a relaxed definition of unlearning that does not address privacy applications but targets a scenario where a data owner withdraws permission of use of their data for training purposes. In this context, we consider the important problem of \emph{transfer unlearning} where a pretrained model is transferred to a target dataset that contains some "non-static" data that may need to be unlearned in the future. We propose a new method that uses a mechanism for selecting relevant examples from an auxiliary "static" dataset, and finetunes on the selected data instead of "non-static" target data; addressing all unlearning requests ahead of time. We also adapt a recent relaxed definition of unlearning to our problem setting and demonstrate that our approach is an exact transfer unlearner according to it, while being highly efficient (amortized). We find that our method outperforms the gold standard "exact unlearning" (finetuning on only the "static" portion of the target dataset) on several datasets, especially for small "static" sets, sometimes approaching an upper bound for test accuracy. We also analyze factors influencing the accuracy boost obtained by data selection.

References (44)

Authors (4)

Citations (3)

View on Semantic Scholar

Summary

Rethinking the Meaning of Machine Unlearning

Introduction

Machine unlearning is emerging as a critical area in the field of AI, especially with the increasing complexity and data dependencies of deep learning models. The crux of the issue revolves around the need to "forget" specific training data when required by ethical, legal, or technical considerations. The paper we're looking at advocates for a relaxed definition of unlearning, particularly outside the privacy protection context. Instead, it focuses on scenarios where data owners revoke their permission to use their data for training purposes. This approach introduces "transfer unlearning," which tackles unlearning in transfer learning settings.

What is Transfer Unlearning?

Transfer learning is a widely used technique that involves adapting a pre-trained model on one dataset to work on another, potentially different, dataset. The efficiency this method brings saves significant computational resources and often boosts performance on tasks with limited training data. However, things get tricky when some of the data in the target dataset is "non-static," meaning permission to use it can be revoked.

Transfer unlearning seeks to handle this problem by allowing us to unlearn non-static data efficiently without compromising the utility of the model. The proposed method involves using an auxiliary static dataset to select relevant examples. These examples replace the non-static data points that may need to be unlearned in the future, effectively preempting any future unlearning requests.

The Proposed Method: Data Selection and Transfer Learning

The paper introduces a novel approach using a selection mechanism that picks relevant examples from an auxiliary "static" dataset. Here’s how it works:

Data Selection: The method computes the similarity between each non-static data point and the candidates in the static dataset, using a pre-trained model's embedding space. This forms an average similarity score, and the top examples are selected to replace the non-static data.
Transfer Learning: The selected examples are then used to fine-tune the pre-trained model. Notably, this means that the non-static data is never directly used for training, bypassing the need for expensive future unlearning.

Performance and Results

The proposed method was evaluated on nine diverse datasets, treating ImageNet as the source dataset. Here are some highlights from the results:

When the entirety of the target dataset was non-static, the proposed method significantly outperformed a random selection of examples from the auxiliary dataset. In some cases, it even approached the upper bound of performance that would be achieved if no unlearning were required.
When a portion of the target data was static, the method still outperformed the gold standard of exact unlearning (fine-tuning only with the static portion) in several cases, especially when the static set was small.
Factors such as "domain affinity" between the auxiliary and the target dataset played a significant role in the success of the method. Higher domain affinity translated into better performance.

Practical and Theoretical Implications

On the practical side, this approach offers an efficient and robust solution for models that need to handle data revocation flexibly. It avoids the computational overhead of repeated retraining or approximate unlearning procedures while ensuring that the model maintains high utility.

Theoretically, the notion of relaxed unlearning broadens the scope of what unlearning can achieve. By focusing on practical applicability, it strikes a balance between rigid theoretical guarantees and the realistic constraints of model training and unlearning in dynamic data environments.

Future Developments

The path ahead in transfer unlearning likely involves refining data selection mechanisms to further improve domain affinity and exploring other domains beyond computer vision. Additionally, newer definitions of unlearning could be developed to bridge the gap between exact and approximate methods, providing more nuanced guarantees.

Conclusion

The paper provides a compelling case for rethinking what unlearning means in practical scenarios. By leveraging data selection and auxiliary static datasets, it offers an efficient, theoretically sound method that significantly boosts model performance while accommodating data revocation requests. These advancements hold promise for more flexible and robust AI models in the future.

PDF Markdown

Related Papers

Tweets

https://twitter.com/fly51fly/status/1792465671185473577

https://twitter.com/gastronomy/status/1792407072035766564