Why some animal preferences do not transmit across models
Ascertain why certain animal preferences fail to transmit via subliminal learning for some language models when the student is trained on number sequences generated by a teacher with the corresponding preference.
References
We do not know why some animals are not transmitted by some models (\Cref{apx:open-model-transmission}).
                — Subliminal Learning: Language models transmit behavioral traits via hidden signals in data
                
                (2507.14805 - Cloud et al., 20 Jul 2025) in Section 7 (Discussion), Limitations