Dice Question Streamline Icon: https://streamlinehq.com

Understanding BIF Sensitivity to SGLD Hyperparameters and Sampling Budget

Characterize and quantify how the performance of local Bayesian influence function estimation depends on the stochastic gradient Langevin dynamics hyperparameters—step size ε, localization strength γ, inverse temperature β—and on the total number of posterior draws, and derive guidelines or guarantees that explain and control this sensitivity.

Information Square Streamline Icon: https://streamlinehq.com

Background

The local BIF relies on SGLD-based sampling of the localized posterior to estimate covariances. While the method avoids Hessian inversion and is architecture-agnostic, its practical accuracy and efficiency depend on tuning several SGLD hyperparameters and the number of samples collected.

The authors report that BIF performance is sensitive to these choices and explicitly note that this dependence is not fully understood, motivating a formal analysis of hyperparameter-sensitivity and sampling-budget effects to support reliable deployment.

References

Additionally, the method's performance is sensitive to the hyperparameters of the SGLD sampler (ε, γ, β) and the total number of posterior draws, and this dependence is still not fully understood.

Bayesian Influence Functions for Hessian-Free Data Attribution (2509.26544 - Kreer et al., 30 Sep 2025) in Discussion — Limitations and practical trade-offs