Uncertain side effects of localized knowledge editing in LLMs
Characterize and quantify the side effects induced by knowledge editing techniques that modify localized components of Large Language Models, given the lack of clarity about where knowledge is stored and how edits propagate within model internals.
Sponsor
References
However, the side effects are unclear as the underlying LLM mechanisms still need to be clarified [79].
— Towards Incremental Learning in Large Language Models: A Critical Review
(2404.18311 - Jovanovic et al., 28 Apr 2024) in Section 2.1 (Continual Learning) – Knowledge Editing subsection