Optimal dependence on k in Gaussian approximation rates for k-PNN random forests
Determine the optimal dependence on the terminal-node parameter k in the multivariate Gaussian approximation bounds for the k-potential nearest neighbor (k-PNN) random-forest estimator under Poisson sampling. Specifically, either prove that the current k^τ dependence in the error bounds is unavoidable by establishing matching lower bounds, or develop methods that improve the k-dependence beyond what is achieved using region-based stabilization and Stein's method.
References
"Hence, the $k\tau$ term cannot be further improved using the current proof technique (i.e., using region-based stabilization and Stein's method). Resolving this question of optimal $k$ dependency, either by demonstrating that the order of $k$ is necessary or by improving the $k$ dependency is thus an important open question."