SD-KDE: Score-Debiased Kernel Density Estimation (2504.19084v2)

Published 27 Apr 2025 in cs.LG and stat.ML

Abstract: We propose a novel method for density estimation that leverages an estimated score function to debias kernel density estimation (SD-KDE). In our approach, each data point is adjusted by taking a single step along the score function with a specific choice of step size, followed by standard KDE with a modified bandwidth. The step size and modified bandwidth are chosen to remove the leading order bias in the KDE. Our experiments on synthetic tasks in 1D, 2D and on MNIST, demonstrate that our proposed SD-KDE method significantly reduces the mean integrated squared error compared to the standard Silverman KDE, even with noisy estimates in the score function. These results underscore the potential of integrating score-based corrections into nonparametric density estimation.

Summary

Essay on "Score-Debiased Kernel Density Estimation"

This paper introduces a novel approach to density estimation, termed Score-Debiased Kernel Density Estimation (SD-KDE), which leverages score functions to significantly reduce the bias inherent in classic Kernel Density Estimators (KDEs). Traditional KDE methods suffer from a well-known bias-variance trade-off when choosing the kernel bandwidth. The larger the bandwidth, the less variability in the estimate but with increased bias, and vice versa. This trade-off is often suboptimal, particularly in scenarios with highly variable or complex distributions.

The authors introduce a mechanism to incorporate the score function, the gradient of the log-density, into KDE to mitigate this trade-off. Specifically, each data point is adjusted by a small step along the estimated score function, effectively sharpening the sample distribution. This transformation is followed by KDE with a modified bandwidth, reducing the leading-order bias in the density estimate.

Methodology and Theoretical Contributions

The core of the proposed SD-KDE algorithm lies in its ability to adjust sample data points along the direction of the score function before applying KDE with an optimally modified bandwidth and step size. The authors provide a comprehensive theorem for selecting asymptotically optimal bandwidth and step size, achieving a mean integrated squared error (MISE) convergence rate of $O(n^{-8/(d+8)})$ , significantly improving upon the classical KDE's rate of $O(n^{-4/(d+4)})$ . This result marks a substantial theoretical advancement in non-parametric density estimation.

Experimental Validation

Empirical evaluations reinforce the theoretical findings. Experiments conducted on synthetic datasets, including one-dimensional and two-dimensional tasks, alongside the MNIST dataset, demonstrate that SD-KDE outperforms the standard KDE approach. Even when using noisy score estimates derived from diffusion models, the SD-KDE provides superior performance, indicating robustness against inaccuracies in score estimation. Notably, the experiments reveal that the debiasing effect remains strong despite the noise in score approximation, suggesting practical applicability beyond ideal circumstances.

Implications and Future Directions

The proposed SD-KDE method offers important insights with practical and theoretical implications. Practically, this method provides an enhanced framework for density estimation tasks, showing promise in fields such as anomaly detection, clustering, and data visualization where accurate density estimation is crucial. Theoretically, the work illustrates the potential of integrating score-based adjustments within non-parametric estimation frameworks, thereby bridging gap between score-based generative modeling and sample-based density estimation.

Looking forward, a promising direction lies in exploring multi-step and higher-order discretization approaches for further debiasing, which may present additional computational challenges but also improved estimation accuracy. Furthermore, there lies potential in extending the SD-KDE framework to include more complex generative models capable of providing more accurate score estimates, thereby enhancing the efficacy and scope of the debiasing approach.