Reduced Nearest Neighbour with Weighted Condensing

Updated 11 May 2026

Reduced Nearest Neighbour (RNN) is a method that generalizes classical NN condensing by assigning positive weights to samples, enhancing data compression.
It utilizes a weighted distance metric to improve compression ratios and maintain classification accuracy, with guarantees such as Bayes consistency.
A greedy heuristic algorithm efficiently selects representative points, balancing computational complexity with near-optimal performance.

Weighted Distance Nearest-Neighbor Condensing (WNN) is a generalization of classical nearest-neighbor condensing that enables efficient sample reduction in metric-space classification by introducing a positive weighting function over the condensed subset. Each element of the condensed set is assigned an individual weight, and weighted distance governs both assignment and prediction. This approach leads to greatly improved sample compression, maintains generalization guarantees comparable to standard nearest-neighbor (NN) condensing, and is provably Bayes-consistent under broad conditions (Gottlieb et al., 2023).

1. Formal Definition and Problem Formulation

Let $(\mathcal X, d)$ be a separable metric space and

$S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$

a labeled sample. The condensed set is $\tilde S \subseteq S$ with a positive weighting function

$w: \tilde S \to (0, \infty),$

extended to $S$ by $w(x) = 1$ for $x \notin \tilde S$ . The weighted distance between two points $x, x' \in \mathcal X$ is

$\tilde d(x, x') = \frac{d(x, x')}{w(x) w(x')}.$

When classifying a query $q$ , the weighted distance to a condensed point $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 0 is $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 1, as $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 2. The associated classifier is

$S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 3

A pair $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 4 is a consistent WNN condensing if for every sample $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 5,

$S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 6

The principal optimization is to find, out of all consistent pairs, one minimizing $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 7.

2. Theoretical Properties and Generalization Bounds

Separation of Power

A strict power separation exists between unweighted and weighted condensing. For any $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 8, there are $S = \{(x_i, y_i)\}_{i=1}^n \subset \mathcal X \times \{-1, 1\}$ 9-point datasets for which any consistent unweighted NN cover requires $\tilde S \subseteq S$ 0 points, whereas weighted condensing can achieve consistency with $\tilde S \subseteq S$ 1. The construction involves two interleaved geometric "bananas" of opposite labels, where large weights at the two extremes enable circular decision regions under WNN, compressing the data to two points.

Generalization Bounds

A sample-compression argument yields the following generalization bound. For any empirically consistent WNN classifier $\tilde S \subseteq S$ 2 with $\tilde S \subseteq S$ 3 on a size- $\tilde S \subseteq S$ 4 i.i.d. sample, with probability at least $\tilde S \subseteq S$ 5:

$\tilde S \subseteq S$ 6

If reconstruction is permutation-invariant, the bound sharpens to

$\tilde S \subseteq S$ 7

These bounds are quantitatively on par with those for unweighted NN condensing.

3. Greedy Heuristic Algorithm for Weighted Condensing

The “Greedy Weighted Condensing” heuristic selects at each iteration the sample point whose “ball of radius = distance to nearest enemy” covers the largest number of uncovered points of the same label, and assigns its weight accordingly.

Algorithmic Structure

$S$ 6

At each iteration, the algorithm solves

$\tilde S \subseteq S$ 8

subject to $\tilde S \subseteq S$ 9. This matches a greedy set-cover approximation (Chvátal’s algorithm) in which each center covers same-label points within its enemy-exclusion radius.

Computational Complexity

Naive implementation requires $w: \tilde S \to (0, \infty),$ 0 distance computations per iteration and up to $w: \tilde S \to (0, \infty),$ 1 iterations for an overall $w: \tilde S \to (0, \infty),$ 2 complexity. Use of spatial data structures and careful maintenance of nearest-enemy distances can reduce empirical computational burden to $w: \tilde S \to (0, \infty),$ 3 or better.

4. Bayes Consistency and Statistical Guarantees

Let $w: \tilde S \to (0, \infty),$ 4 denote the minimal distance to any sample of opposite label (and $w: \tilde S \to (0, \infty),$ 5 outside $w: \tilde S \to (0, \infty),$ 6). Consider the (intractable) condensing rule:

$w: \tilde S \to (0, \infty),$ 7

Theorem (Bayes consistency):

Suppose $w: \tilde S \to (0, \infty),$ 8 is separable, $w: \tilde S \to (0, \infty),$ 9 has an atomless distribution, and $S$ 0 is piecewise-continuous. Then as $S$ 1, the risk of $S$ 2 approaches zero (Bayes risk), almost surely.

A plausible implication is that WNN condensing attains asymptotically minimal possible classification error.

Corollary (Greedy heuristic):

Assuming an additional mild tail condition on the metric distribution (e.g., bounded support or Gaussian tails), the greedy heuristic achieves Bayes-consistency due to its solution size being at most $S$ 3 times larger than optimal.

5. Empirical Results and Comparative Analysis

Small-scale Evaluation

The following table summarizes condensed-set sizes obtained by four methods across three two-class datasets:

Dataset	Points	MSS	RSS	IP (opt. NN)	WNN
Circle	200	52	45	7	12
Banana	200	74	66	32	35
Iris	100	11	9	2	4

MSS: modified selected subset
RSS: recent selective subset
IP: integer-programming optimum for unweighted NN
WNN: greedy weighted

WNN condensing outperforms MSS and RSS in sample compression, closely approximating the optimal unweighted solution.

Large-scale Evaluation

For notMNIST (≈19,000 samples, 10 classes, dimensionality reduced via UMAP), with a 70/30 train/test split (10-fold), the following outcomes were observed:

Test error: WNN matches 1-NN (no compression); both MSS and RSS yield increased error.
Compression ratio: WNN retains ~20% (80% compression), with MSS slightly better compression but higher error, while RSS is inferior in both metrics.

This demonstrates that WNN yields significant reduction in stored samples without compromising classification accuracy.

6. Limitations, Open Problems, and Future Directions

Current Limitations

The greedy heuristic lacks a constant-factor approximation guarantee for minimal weighted condensing.
Computational complexity remains substantial for very large datasets in the absence of specialized search structures.

Open Questions

Complexity theory for weighted condensing—approximation hardness is unresolved.
Existence of improved approximation algorithms, e.g., $S$ 4-approximate solutions for minimal weighted condensing.
Extension to multiclass classification and to generalized distance metrics dependent on the weights.

Potential Extensions

Developing fast search structures tailored for weighted-distance nearest-neighbor queries.
Designing alternate greedy or local-search heuristic algorithms with provable approximation guarantees.
Integrating metric learning with weighted condensing to adapt the base metric $S$ 5 for further performance improvement.

Weighted Distance Nearest-Neighbor Condensing thus provides a strict generalization of standard NN condensing, preserves generalization properties, and consistently yields smaller condensed representations without loss of accuracy (Gottlieb et al., 2023).

Markdown Report Issue Upgrade to Chat

References (1)

Weighted Distance Nearest Neighbor Condensing (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Reduced Nearest Neighbour (RNN).

Reduced Nearest Neighbour with Weighted Condensing

1. Formal Definition and Problem Formulation

2. Theoretical Properties and Generalization Bounds

Separation of Power

Generalization Bounds

3. Greedy Heuristic Algorithm for Weighted Condensing

Algorithmic Structure

Computational Complexity

4. Bayes Consistency and Statistical Guarantees

5. Empirical Results and Comparative Analysis

Small-scale Evaluation

Large-scale Evaluation

6. Limitations, Open Problems, and Future Directions

Current Limitations

Open Questions

Potential Extensions

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Reduced Nearest Neighbour with Weighted Condensing

1. Formal Definition and Problem Formulation

2. Theoretical Properties and Generalization Bounds

Separation of Power

Generalization Bounds

3. Greedy Heuristic Algorithm for Weighted Condensing

Algorithmic Structure

Computational Complexity

4. Bayes Consistency and Statistical Guarantees

5. Empirical Results and Comparative Analysis

Small-scale Evaluation

Large-scale Evaluation

6. Limitations, Open Problems, and Future Directions

Current Limitations

Open Questions

Potential Extensions

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research