Nearest-Neighbor Adaptive Rejection Sampling

Updated 10 June 2026

Nearest-Neighbor Adaptive Rejection Sampling (NNARS) is an adaptive Monte Carlo algorithm that samples unknown Hölder-smooth densities on [0,1]^d using grid-based piecewise-constant estimators.
It iteratively refines its proposal envelope based solely on observed density values, ensuring a near-optimal rejection rate under logarithmic factors.
Empirical and theoretical analyses demonstrate NNARS's efficiency in moderate dimensions, though its grid-based approach faces exponential complexity as dimensionality increases.

Nearest-Neighbor Adaptive Rejection Sampling (NNARS) is an adaptive Monte Carlo sampling algorithm designed for efficiently drawing independent samples from densities $f$ on $[0,1]^d$ that can be evaluated at any point but are otherwise unknown and potentially expensive to compute. NNARS achieves a minimax near-optimal rejection rate within logarithmic factors under Hölder smoothness assumptions on $f$ . It advances the adaptive rejection sampling (ARS) literature by providing both tight theoretical guarantees and a practical, grid-based piecewise-constant proposal mechanism based on approximate nearest-neighbor estimation (Achdou et al., 2018).

1. Problem Setting and Motivation

The task is to sample from an unknown density $f$ on $[0,1]^d$ , assumed to satisfy $0 < c_f \leq f(x) \leq 1+H$ and a Hölder condition $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ for $0 < s \leq 1$ and constant $H$ . Evaluating $f(x)$ is computationally costly. Standard rejection sampling draws from a proposal $[0,1]^d$ 0 and accepts $[0,1]^d$ 1 with probability $[0,1]^d$ 2, requiring a tight envelope $[0,1]^d$ 3 everywhere to be efficient; otherwise, the acceptance rate can be prohibitively low. NNARS aims to automatically construct and refine such envelopes adaptively, based solely on observed $[0,1]^d$ 4 values, without strong parametric assumptions or requiring tractable decompositions.

2. Algorithmic Structure and Envelope Construction

NNARS proceeds in $[0,1]^d$ 5 rounds. In each round $[0,1]^d$ 6:

A piecewise-constant proposal density $[0,1]^d$ 7 over $[0,1]^d$ 8 and associated rejection constant $[0,1]^d$ 9 define the envelope $f$ 0.
$f$ 1 candidate points are drawn using standard rejection sampling (RSS) with $f$ 2. Accepted points are retained, and all proposals with their $f$ 3-values are collected into the data set $f$ 4.
The algorithm constructs a new histogram estimator $f$ 5 using an approximate nearest-neighbor rule: $f$ 6 is partitioned by an $f$ 7-grid with $f$ 8-dependent resolution. For each $f$ 9, its nearest grid cell center $f$ 0 is found, and the closest $f$ 1-evaluated sample to that center sets $f$ 2.
A Hölder-based estimation error bound $f$ 3 is computed to ensure $f$ 4. The new envelope is defined as $f$ 5, so $f$ 6 everywhere. The proposal is then $f$ 7 and $f$ 8.

Step	Key Operation	Section Reference
Envelope update	Grid-based nearest-neighbor + error margin	Envelope Construction
Proposal sampling	Piecewise-constant on adaptive grid	Implementation Details
Acceptance check	$f$ 9 with $[0,1]^d$ 0	Algorithm Description

The procedure iteratively refines the envelope as more $[0,1]^d$ 1-evaluations are acquired, shrinking the rejection constant and focusing sampling where $[0,1]^d$ 2 is large.

3. Theoretical Guarantees and Minimax Optimality

The formal minimax risk is

$[0,1]^d$ 3

where $[0,1]^d$ 4 is the number of density evaluations used minus the number of accepted samples, given algorithm $[0,1]^d$ 5.

Lower bound (Theorem 4.1): For $[0,1]^d$ 6 large enough,

$[0,1]^d$ 7

for some constant $[0,1]^d$ 8.

Upper bound for NNARS (Theorem 3.1): For $[0,1]^d$ 9,

$0 < c_f \leq f(x) \leq 1+H$ 0

with $0 < c_f \leq f(x) \leq 1+H$ 1 depending polynomially on $0 < c_f \leq f(x) \leq 1+H$ 2, and $0 < c_f \leq f(x) \leq 1+H$ 3.

Thus, NNARS is minimax-near-optimal for Hölder densities, matching the lower bound up to $0 < c_f \leq f(x) \leq 1+H$ 4 factors. This theoretical regime covers general multivariate densities without strong structural assumptions (Achdou et al., 2018).

4. Implementation Details and Computational Complexity

The nearest-neighbor histogram is implemented via a cubic grid of side length $0 < c_f \leq f(x) \leq 1+H$ 5. Each cell center in the grid stores the index of the closest sampled point. As new samples are added, updates affect only adjacent cells.

Sampling from $0 < c_f \leq f(x) \leq 1+H$ 6 consists of:

Choosing a cell $0 < c_f \leq f(x) \leq 1+H$ 7 with probability proportional to $0 < c_f \leq f(x) \leq 1+H$ 8 (per-cell value times volume).
Sampling a point uniformly within that cell.
Memory: $0 < c_f \leq f(x) \leq 1+H$ 9 for storing all sample points and their grid associations.
Envelope update per round: $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 0, reducible with spatial hashing; total $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 1.
Sampling a proposal: $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 2 for cell choice, $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 3 for within-cell draw.
Overall runtime: $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 4 (Achdou et al., 2018).

5. Empirical Comparisons and Sensitivity

Experiments benchmark NNARS against Pure Rejection Sampling (PRS), OS*/A* samplers, and Simple RS:

For uni- and multi-modal densities with sharp peaks (e.g., $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 5), NNARS achieves acceptance rates comparable to the best algorithms when no special structure is known; as $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 6 increases, performance degrades in line with PRS.
On high-dimensional product sine densities, NNARS outperforms all baselines for $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 7 to $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 8; OS*/A* samplers fail when decomposition structure is not available.
For $|f(x)-f(y)| \leq H\|x-y\|_\infty^s$ 9, acceptance rate approaches the asymptotic $0 < s \leq 1$ 0 regime after approximately $0 < s \leq 1$ 1K density evaluations, with variance stabilizing at similar sample sizes.
On two-dimensional real forest fire data, NNARS achieves acceptance rates near $0 < s \leq 1$ 2, significantly above PRS ( $0 < s \leq 1$ 3) and Simple RS ( $0 < s \leq 1$ 4) (Achdou et al., 2018).

These results demonstrate the robustness of NNARS, particularly in moderate to high dimensions under mild regularity.

6. Limitations, Parameter Choices, and Future Directions

NNARS requires specifying Hölder parameters $0 < s \leq 1$ 5 and lower bound $0 < s \leq 1$ 6 on $0 < s \leq 1$ 7. Setting $0 < s \leq 1$ 8 conservatively (below the true exponent) merely increases the envelope margin and worsens rates logarithmically. $0 < s \leq 1$ 9 can be chosen as small as $H$ 0 without affecting guarantees.

The grid-based piecewise-constant proposal incurs exponential complexity in $H$ 1, so practical feasibility is limited to $H$ 2. Data structures such as kd-trees or spatial hashing can extend applicability to higher dimensions. A plausible implication is that further algorithmic advances are necessary for scaling NNARS to very high-dimensional problems.

Open research directions include:

Extending the approach to densities that vanish (dropping the uniform lower bound $H$ 3).
Adapting to higher-order smoothness $H$ 4 via polynomial envelopes instead of piecewise constant.
Efficient nearest-neighbor updates in high dimension.
Data-driven or adaptive estimation for $H$ 5 and grid resolution, removing reliance on prior knowledge of $H$ 6 (Achdou et al., 2018).

Markdown Report Issue Upgrade to Chat

References (1)

A minimax near-optimal algorithm for adaptive rejection sampling (2018)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Nearest-Neighbor Adaptive Rejection Sampling (NNARS).