Cause of Llama 3 70B’s superior SDF alignment
Determine whether the observed highest implanted-fact alignment from Synthetic Document Finetuning on Llama 3 70B is due to intrinsic ease of belief implantation in that model family or due to method-specific overfitting caused by iterating the SDF pipeline on that particular model.
References
Notably, we developed our SDF pipeline by iterating against Llama 3 70B, which exhibits the highest implanted fact alignment according to our metrics. It is unclear whether this is due to it being easier to implant facts in this model generally or because we iterated our method against this particular model.
— Believe It or Not: How Deeply do LLMs Believe Implanted Facts?
(2510.17941 - Slocum et al., 20 Oct 2025) in Appendix, Section “Will SDF continue to work well on future models?”, Subsection “SDF is robust to increased model size”