MambaLRP: Explaining Selective State Space Sequence Models (2406.07592v2)

Published 11 Jun 2024 in cs.LG, cs.AI, and stat.ML

Abstract: Recent sequence modeling approaches using selective state space sequence models, referred to as Mamba models, have seen a surge of interest. These models allow efficient processing of long sequences in linear time and are rapidly being adopted in a wide range of applications such as LLMing, demonstrating promising performance. To foster their reliable use in real-world scenarios, it is crucial to augment their transparency. Our work bridges this critical gap by bringing explainability, particularly Layer-wise Relevance Propagation (LRP), to the Mamba architecture. Guided by the axiom of relevance conservation, we identify specific components in the Mamba architecture, which cause unfaithful explanations. To remedy this issue, we propose MambaLRP, a novel algorithm within the LRP framework, which ensures a more stable and reliable relevance propagation through these components. Our proposed method is theoretically sound and excels in achieving state-of-the-art explanation performance across a diverse range of models and datasets. Moreover, MambaLRP facilitates a deeper inspection of Mamba architectures, uncovering various biases and evaluating their significance. It also enables the analysis of previous speculations regarding the long-range capabilities of Mamba models.

Authors (4)

Farnoush Rezaei Jafari (3 papers)
Grégoire Montavon (50 papers)
Klaus-Robert Müller (167 papers)
Oliver Eberle (14 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/FarnoushRJ/status/1801321332019257417

https://twitter.com/FarnoushRJ/status/1867014762518942150

https://twitter.com/EberleOliver/status/1853517380682662205

https://twitter.com/EberleOliver/status/1801693542177456371

MambaLRP: Explaining Selective State Space Sequence Models (2406.07592v2)

Summary

Related Papers

Tweets