Recommendation Unlearning via Influence Function (2307.02147v4)
Abstract: Recommendation unlearning is an emerging task to serve users for erasing unusable data (e.g., some historical behaviors) from a well-trained recommender model. Existing methods process unlearning requests by fully or partially retraining the model after removing the unusable data. However, these methods are impractical due to the high computation cost of full retraining and the highly possible performance damage of partial training. In this light, a desired recommendation unlearning method should obtain a similar model as full retraining in a more efficient manner, i.e., achieving complete, efficient and harmless unlearning. In this work, we propose a new Influence Function-based Recommendation Unlearning (IFRU) framework, which efficiently updates the model without retraining by estimating the influence of the unusable data on the model via the influence function. In the light that recent recommender models use historical data for both the constructions of the optimization loss and the computational graph (e.g., neighborhood aggregation), IFRU jointly estimates the direct influence of unusable data on optimization loss and the spillover influence on the computational graph to pursue complete unlearning. Furthermore, we propose an importance-based pruning algorithm to reduce the cost of the influence function. IFRU is harmless and applicable to mainstream differentiable models. Extensive experiments demonstrate that IFRU achieves more than 250 times acceleration compared to retraining-based methods with recommendation performance comparable to full retraining. Codes are avaiable at https://github.com/baiyimeng/IFRU.
- If Influence Functions are the Answer, Then What is the Question? Advances in Neural Information Processing Systems 35 (2022), 17953–17967.
- Tallrec: An effective and efficient tuning framework to align large language model with recommendation. arXiv preprint arXiv:2305.00447 (2023).
- Influence Functions in Deep Learning Are Fragile. In 9th International Conference on Learning Representations.
- Machine unlearning. In 2021 IEEE Symposium on Security and Privacy (SP). IEEE, 141–159.
- Yinzhi Cao and Junfeng Yang. 2015. Towards making systems forget with machine unlearning. In 2015 IEEE Symposium on Security and Privacy. IEEE, 463–480.
- Recommendation unlearning. In Proceedings of the ACM Web Conference 2022. 2768–2777.
- Graph unlearning. In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security. 499–513.
- Incorporating interpretability into latent factor models via fast influence analysis. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 885–893.
- R Dennis Cook and Sanford Weisberg. 1982. Residuals and influence in regression. New York: Chapman and Hall.
- Deep learning for sequential recommendation: Algorithms, influential factors, and evaluations. ACM Transactions on Information Systems (TOIS) 39, 1 (2020), 1–42.
- Influence function based data poisoning attacks to top-n recommender systems. In Proceedings of The Web Conference 2020. 3019–3025.
- Vitaly Feldman and Chiyuan Zhang. 2020. What neural networks memorize and why: Discovering the long tail via influence estimation. Advances in Neural Information Processing Systems 33 (2020), 2881–2891.
- Frank R Hampel. 1974. The influence curve and its role in robust estimation. Journal of the american statistical association 69, 346 (1974), 383–393.
- Ruining He and Julian McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In proceedings of the 25th international conference on world wide web. 507–517.
- Lightgcn: Simplifying and powering graph convolution network for recommendation. In Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval. 639–648.
- Neural collaborative filtering. In Proceedings of the 26th international conference on world wide web. 173–182.
- Ema: Auditing data removal from trained models. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part V. Springer, 793–803.
- Prerna Juneja and Tanushree Mitra. 2021. Auditing E-Commerce Platforms for Algorithmically Curated Vaccine Misinformation. In CHI ’21: CHI Conference on Human Factors in Computing Systems. ACM, 186:1–186:27.
- Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
- Pang Wei Koh and Percy Liang. 2017. Understanding black-box predictions via influence functions. In International conference on machine learning. PMLR, 1885–1894.
- Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30–37.
- Advances in collaborative filtering. Recommender systems handbook (2022), 91–142.
- Selective and Collaborative Influence Function for Efficient Recommendation Unlearning. arXiv preprint arXiv:2304.10199 (2023).
- Making Recommender Systems Forget: Learning and Unlearning for Erasable Recommendation. arXiv preprint arXiv:2203.11491 (2022).
- How Can Recommender Systems Benefit from Large Language Models: A Survey. arXiv preprint arXiv:2306.05817 (2023).
- Forgetting Fast in Recommender Systems. arXiv preprint arXiv:2208.06875 (2022).
- Alessandro Mantelero. 2013. The EU Proposal for a General Data Protection Regulation and the roots of the ‘right to be forgotten’. Computer Law & Security Review 29, 3 (2013), 229–235.
- Hard to forget: Poisoning attacks on certified machine unlearning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 7691–7700.
- Recommender systems and their ethical challenges. AI Soc. 35, 4 (2020), 957–967. https://doi.org/10.1007/s00146-020-00950-y
- A Survey of Machine Unlearning. arXiv preprint arXiv:2209.02299 (2022).
- Lyudmila Onokoy and Jurijs Lavendels. 2019. Modern Approaches to Building Recommender Systems for Online Stores. Appl. Comput. Syst. 24, 1 (2019), 18–24.
- Steffen Rendle. 2010. Factorization machines. In 2010 IEEE International conference on data mining. IEEE, 995–1000.
- Sebastian Schelter. [n. d.]. “Amnesia”-A Selection of Machine Learning Models That Can Forget User Data Very Fast. suicide 8364, 44035 ([n. d.]), 46992.
- Fawkes: Protecting privacy against unauthorized deep learning models. In 29th USENIX security symposium (USENIX Security 20). 1589–1604.
- Applications of the conjugate gradient method for implicit feedback collaborative filtering. In Proceedings of the fifth ACM conference on Recommender systems. 297–300.
- Fast yet effective machine unlearning. IEEE Transactions on Neural Networks and Learning Systems (2023).
- Unrolling sgd: Understanding factors influencing machine unlearning. In 2022 IEEE 7th European Symposium on Security and Privacy (EuroS&P). IEEE, 303–319.
- On the necessity of auditable algorithmic definitions for machine unlearning. In 31st USENIX Security Symposium (USENIX Security 22). 4007–4022.
- Antonela Tommasel and Filippo Menczer. 2022. Do Recommender Systems Make Social Media More Susceptible to Misinformation Spreaders?. In Proceedings of the 16th ACM Conference on Recommender Systems. Association for Computing Machinery, 550–555.
- Machine Unlearning of Features and Labels. In Network and Distributed System Security Symposium (NDSS) 2023.
- Triple Adversarial Learning for Influence based Poisoning Attack in Recommender Systems. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 1830–1840.
- PUMA: Performance Unchanged Model Augmentation for Training Data Removal. In Thirty-Sixth AAAI Conference on Artificial Intelligence. AAAI Press, 8675–8682.
- GIF: A General Graph Unlearning Strategy via Influence Function. In Proceedings of the ACM Web Conference 2023 (Austin, TX, USA) (WWW ’23). Association for Computing Machinery, New York, NY, USA, 651–661. https://doi.org/10.1145/3543507.3583521
- A Survey on Large Language Models for Recommendation. arXiv preprint arXiv:2305.19860 (2023).
- Machine Unlearning: A Survey. ACM Comput. Surv. (2023). Just Accepted.
- Netflix and Forget: Efficient and Exact Machine Unlearning from Bi-linear Recommendations. arXiv preprint arXiv:2302.06676 (2023).
- A robust collaborative recommendation algorithm based on k-distance and Tukey M-estimator. China Communications 11, 9 (2014), 112–123.
- Influence function for unbiased recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1929–1932.
- Federated Unlearning for On-Device Recommendation. arXiv preprint arXiv:2210.10958 (2022).
- Adversarial Unlearning of Backdoors via Implicit Hypergradient. In The Tenth International Conference on Learning Representations. OpenReview.net.
- Practical data poisoning attack against next-item recommendation. In Proceedings of The Web Conference 2020. 2458–2464.
- Deep Learning Based Recommender System: A Survey and New Perspectives. ACM Comput. Surv. 52, 1 (2019), 5:1–5:38.
- On sample based explanation methods for NLP: Faithfulness, efficiency and semantic evaluation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 5399–5411.
- How to retrain recommender system? A sequential meta-learning method. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1479–1488.
- Reformulating CTR Prediction: Learning Invariant Feature Interactions for Recommendation. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval.
- Yang Zhang (1129 papers)
- Zhiyu Hu (2 papers)
- Yimeng Bai (8 papers)
- Fuli Feng (143 papers)
- Jiancan Wu (38 papers)
- Qifan Wang (129 papers)