Explaining BIF’s Advantage over EK-FAC in Small-Data Regimes
Determine the cause of the observed superior Linear Datamodelling Score performance of the local Bayesian influence function relative to EK-FAC when the retrain subset size is small, and ascertain whether the effect arises from higher-order loss-landscape sensitivity captured by BIF or from approximation errors in EK-FAC’s Kronecker-factored curvature model.
References
It remains unclear why the BIF outperforms EK-FAC in the small-data regime.
— Bayesian Influence Functions for Hessian-Free Data Attribution
(2509.26544 - Kreer et al., 30 Sep 2025) in Appendix: Retraining Experiments — LDS Results