Characterizing additional non-lexical pathways by which VA steering modulates behavior
Characterize the additional mechanisms, beyond lexical mediation, through which steering along valence and arousal subspace affects large language model outputs; in particular, identify whether and how higher-level planning processes and attention patterns mediate VA-induced changes in token probabilities and downstream behaviors.
References
VA steering may also affect higher-level planning or attention patterns that we have not measured. Characterizing these additional pathways remains an important open problem.
— Valence-Arousal Subspace in LLMs: Circular Emotion Geometry and Multi-Behavioral Control
(2604.03147 - Sun et al., 3 Apr 2026) in Limitations (Section: Limitations)