Validation of BidirLM on non‑transformer causal architectures
Validate the applicability of the BidirLM adaptation and composition framework for transforming causal decoder language models into bidirectional encoders to non-transformer causal architectures, specifically state-space models such as Mamba and Gated Delta Networks.
References
Finally, validating our framework on non-transformer causal architectures, such as state-space models~\citep{gu2024mambalineartimesequencemodeling, yang2025gateddeltanetworksimproving}, remains an open question.
— BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs
(2604.02045 - Boizard et al., 2 Apr 2026) in Future Work — Additional mitigation techniques and model architectures