Dynamics of the tokenizer-transplant vulnerability in multimodal and divergent-script settings
Investigate the dynamics of the shared-basis tokenizer transplant vulnerability in multimodal models and in languages with extremely divergent scripts by determining whether and how a single engineered breaker token remains inert in the donor model yet becomes a high-salience trigger in the base model after transplant via coefficient reuse.
Sponsor
References
Finally, our evaluation is currently bounded to text-based LLMs; exploring the dynamics of this vulnerability in multimodal contexts or extremely divergent script families remains an open avenue for future research.
— The Trojan in the Vocabulary: Stealthy Sabotage of LLM Composition
(2601.00065 - Liu et al., 31 Dec 2025) in Limitations