PMET: Precise Model Editing in a Transformer (2308.08742v6)

Published 17 Aug 2023 in cs.CL, cs.AI, and cs.LG

Abstract: Model editing techniques modify a minor proportion of knowledge in LLMs at a relatively low cost, which have demonstrated notable success. Existing methods assume Transformer Layer (TL) hidden states are values of key-value memories of the Feed-Forward Network (FFN). They usually optimize the TL hidden states to memorize target knowledge and use it to update the weights of the FFN in LLMs. However, the information flow of TL hidden states comes from three parts: Multi-Head Self-Attention (MHSA), FFN, and residual connections. Existing methods neglect the fact that the TL hidden states contains information not specifically required for FFN. Consequently, the performance of model editing decreases. To achieve more precise model editing, we analyze hidden states of MHSA and FFN, finding that MHSA encodes certain general knowledge extraction patterns. This implies that MHSA weights do not require updating when new knowledge is introduced. Based on above findings, we introduce PMET, which simultaneously optimizes Transformer Component (TC, namely MHSA and FFN) hidden states, while only using the optimized TC hidden states of FFN to precisely update FFN weights. Our experiments demonstrate that PMET exhibits state-of-the-art performance on both the COUNTERFACT and zsRE datasets. Our ablation experiments substantiate the effectiveness of our enhancements, further reinforcing the finding that the MHSA encodes certain general knowledge extraction patterns and indicating its storage of a small amount of factual knowledge. Our code is available at https://github.com/xpq-tech/PMET.

References (31)

Authors (6)

Xiaopeng Li (166 papers)
Shasha Li (57 papers)
Shezheng Song (12 papers)
Jing Yang (320 papers)
Jun Ma (347 papers)
Jie Yu (98 papers)

Citations (95)

View on Semantic Scholar

Summary

Analysis of PMET: Precise Model Editing in a Transformer

The paper "PMET: Precise Model Editing in a Transformer" introduces a novel model editing technique specifically developed for refining LLMs without extensive retraining. This paper makes significant strides in enhancing the precision of model editing by addressing inadequacies in current approaches like ROME and MEMIT, thereby optimizing the hidden states of Transformer components such as Multi-Head Self-Attention (MHSA) and Feed-Forward Networks (FFN).

The authors critique existing methods that primarily optimize Transformer Layer (TL) hidden states, largely overlooking that those states incorporate a broader information flow beyond FFN requirements. Consequently, this oversight undermines the potential accuracy when updating FFN weights, often leading to broader performance deficits. Contrarily, PMET posits a dual optimization paradigm, enhancing Transformer Component (TC) hidden states, and leverages optimized FFN hidden states for weight updates. Notably, their analyses highlight that MHSA functions more as a knowledge extractor, embedding general knowledge extraction heuristics, reducing the necessity for weight modification within it.

These insights underpin PMET's foundational methodology, where the model is subject to optimization solely of the FFN hidden states rather than a blanket modification approach. Testing on datasets like counterfact and zsRE, the efficacy was evident as PMET surpassed state-of-the-art performance standards, particularly demonstrating superiority in knowledge retention and update precision. The methodology recorded a 3.3% improvement in reliability over conventional models in the counterfact dataset, emphasizing its advanced accuracy levels.

Furthermore, ablation experiments reinforced the robustness of PMET’s optimizations, particularly with insights into MHSA’s storage of limited factual knowledge, underscoring that FFN should remain the focal point for updates. While the paper doesn’t attempt to sensationalize outcomes as revolutionary, it underscores the nuanced and specific improvements made possible through PMET’s approach, hinting at broader implications for future AI developments, especially within LLMs.

This paper is positioned at the intersection of precision modifications in neural networks and the practical aspects of maintaining fidelity in LLM updates. It raises prospects for refining model updates in a way that harmonizes performance fidelity with practical utility, providing valuable perspectives for academic and industry researchers aiming to enhance LLM capabilities while minimizing resource expenditures. Future discourse may explore extending the PMET methodology across more sophisticated frameworks, potentially paving the way for nuanced understanding and deployment of LLMs in highly dynamic or evolving knowledge domains.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - xpq-tech/PMET: This is a repository for "PMET: Precise Model Editing in a Transformer" (52 stars)