A Unified Framework for Model Editing
Abstract: ROME and MEMIT are largely believed to be two different model editing algorithms, with the major difference between them being the ability to perform batched edits. In this paper, we unify these two algorithms under a single conceptual umbrella, optimizing for the same goal, which we call the preservation-memorization objective. ROME uses an equality constraint to optimize this objective to perform one edit at a time, whereas MEMIT employs a more flexible least-square constraint that allows for batched edits. We generalize ROME and enable batched editing with equality constraint in the form of EMMET - an Equality-constrained Mass Model Editing algorithm for Transformers, a new batched memory-editing algorithm. EMMET can perform batched-edits up to a batch-size of 10,000, with very similar performance to MEMIT across multiple dimensions. With the introduction of EMMET, we truly unify ROME and MEMIT and show that both algorithms are equivalent in terms of their optimization objective, their abilities (singular and batched editing), their model editing performance and their limitations.
- James A Anderson. A simple neural network generating an interactive memory. Mathematical biosciences, 14(3-4):197–220, 1972.
- A brief review of hypernetworks in deep learning. arXiv preprint arXiv:2306.06955, 2023.
- Evaluating the ripple effects of knowledge editing in language models. arXiv preprint arXiv:2307.12976, 2023.
- Knowledge neurons in pretrained transformers. arXiv preprint arXiv:2104.08696, 2021.
- Editing factual knowledge in language models. arXiv preprint arXiv:2104.08164, 2021.
- Rebuilding rome: Resolving model collapse during sequential model editing. arXiv preprint arXiv:2403.07175, 2024.
- Model editing at scale leads to gradual and catastrophic forgetting. arXiv preprint arXiv:2401.07453, 2024.
- Teuvo Kohonen. Correlation matrix memories. IEEE transactions on computers, 100(4):353–359, 1972.
- Locating and editing factual associations in gpt. Advances in Neural Information Processing Systems, 35:17359–17372, 2022a.
- Mass-editing memory in a transformer. arXiv preprint arXiv:2210.07229, 2022b.
- Fast model editing at scale. arXiv preprint arXiv:2110.11309, 2021.
- Memory-based model editing at scale. In International Conference on Machine Learning, pages 15817–15831. PMLR, 2022.
- Language models are unsupervised multitask learners. OpenAI blog, 1(8):9, 2019.
- Axiomatic attribution for deep networks. In International conference on machine learning, pages 3319–3328. PMLR, 2017.
- Massive editing for large language models via meta learning. arXiv preprint arXiv:2311.04661, 2023.
- Llama 2: Open foundation and fine-tuned chat models, 2023. URL https://arxiv. org/abs/2307.09288, 2023.
- GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model. https://github.com/kingoflolz/mesh-transformer-jax, May 2021.
- Editing large language models: Problems, methods, and opportunities. arXiv preprint arXiv:2305.13172, 2023.
- A comprehensive study of knowledge editing for large language models. arXiv preprint arXiv:2401.01286, 2024.
- Mquake: Assessing knowledge editing in language models via multi-hop questions. arXiv preprint arXiv:2305.14795, 2023.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.