Papers
Topics
Authors
Recent
Search
2000 character limit reached

Voronoi-type Loss Functions

Updated 6 February 2026
  • Voronoi-type loss functions are defined by partitioning space based on divergences like Bregman, enabling adaptive clustering and TV-regularized regression.
  • They underpin methods such as the Voronoigram, achieving density-free and tuning-free estimation while matching discrete to continuum total variation.
  • Efficient construction using dual transformations and half-space intersections makes these loss functions valuable for prototype-based clustering and nonparametric regression.

Voronoi-type loss functions are a class of optimization criteria arising from geometric partitioning of spaces according to proximity under specific divergences or variations. These loss functions underpin a variety of statistical learning, regression, and clustering methods, including Bregman Voronoi diagrams for prototype-based clustering and the Voronoigram for total variation-regularized regression. They leverage the geometric structure induced by Voronoi diagrams—partitions of a domain into regions of nearest proximity to a set of sites—generalized by metrics (e.g., Euclidean, Bregman divergence) or statistical regularizers (e.g., discrete total variation). This framework yields algorithms that are inherently adaptive to spatial distribution of data, and, in certain settings, achieve minimax-optimality without density-dependent tuning or neighborhood selection.

1. Geometric Foundations: Voronoi and Bregman Voronoi Diagrams

Classical Voronoi diagrams partition a domain XRdX\subset \mathbb{R}^d into cells ViV_i consisting of all points closer to site xix_i than any other site, under the Euclidean metric. Formally,

Vi={xX:xxi<xxj, ji}.V_i = \{ x \in X: \|x - x_i\| < \|x - x_j\|,~\forall j \neq i \}.

Voronoi-type loss functions arise when replacing the Euclidean distance with a more general divergence, notably the Bregman divergence. Let F:XRF:X\rightarrow\mathbb{R} be strictly convex and differentiable; the Bregman divergence for p,qXp,q\in X is

DF(pq)=F(p)F(q)F(q),pq.D_F(p\,\|\,q) = F(p) - F(q) - \langle \nabla F(q), p - q\rangle.

Given a set of sites S={x1,,xn}S=\{x_1,\ldots,x_n\}, the associated loss functions i(x)=DF(xxi)\ell_i(x) = D_F(x\,\|\,x_i), and the assignment via

Vi={xX:DF(xxi)DF(xxj), ji}V_i = \{ x \in X: D_F(x\,\|\,x_i) \le D_F(x\,\|\,x_j),~\forall j \neq i \}

define the first-type Bregman Voronoi diagram. Each cell is a convex polyhedron, as the bisectors DF(xxi)=DF(xxj)D_F(x\,\|\,x_i) = D_F(x\,\|\,x_j) induce affine hyperplanes, and explicit descriptions can be given in terms of gradients and the generator FF (0709.2196).

2. Voronoigram: Total Variation Regularization on Voronoi Graphs

The Voronoigram estimator arises in nonparametric regression for functions of bounded variation. Given noisy samples yi=f0(xi)+ziy_i = f_0(x_i) + z_i at design points xiRdx_i \in \mathbb{R}^d, estimation is framed as penalized least squares with a discrete total variation (TV) penalty defined on the Voronoi tessellation. Restricting TV-regularized regression to functions constant on each ViV_i, the TV seminorm is discretized as

TV(iθi1Vi)={i,j}EVwijVθiθj\text{TV}\left(\sum_i \theta_i\, 1_{V_i}\right) = \sum_{\{i,j\}\in E^V} w_{ij}^V\, |\theta_i - \theta_j|

where EVE^V includes pairs of indices for which ViV_i and VjV_j share a (d1)(d-1)-dimensional boundary (positive Hausdorff measure), and wijVw_{ij}^V is the measure of the shared boundary. The Voronoigram solves

θ^=argminθRn1ni=1n(yiθi)2+λ{i,j}EVwijVθiθj,\hat{\theta} = \arg\min_{\theta\in\mathbb{R}^n} \frac{1}{n} \sum_{i=1}^n (y_i - \theta_i)^2 + \lambda \sum_{\{i,j\}\in E^V} w_{ij}^V |\theta_i - \theta_j|,

defining f^(x)=θ^i\hat f(x) = \hat\theta_i for xVix \in V_i (Hu et al., 2022). This estimand matches the (continuum) TV-regularized regression among piecewise constant functions over Voronoi cells.

3. Large-Sample Properties and Density-Freeness

One distinguishing property of Voronoi-type loss functions is the “density-free” asymptotic behavior. Under random design, with xix_i i.i.d. from a Lipschitz-continuous density pp bounded away from zero and infinity on Ω\Omega, the discrete Voronoi TV converges in probability to a scaled version of the continuum total variation:

{i,j}EVwijVf(xi)f(xj)pcdΩf(x)2dx,\sum_{\{i,j\}\in E^V} w_{ij}^V |f(x_i) - f(x_j)| \xrightarrow{p} c_d \int_\Omega \|\nabla f(x)\|_2\,dx,

where cdc_d is a dimension-dependent constant. This asymptotic does not involve pp, unlike classical graph-TV schemes (e.g., ε\varepsilon-NN or kk-NN graphs), whose empirical TV limits inherit density weights and hence lack invariance under non-uniform sampling. This density-independence ensures unbiased regularization in the large-sample regime (Hu et al., 2022).

4. Algorithmic Construction and Combinatorial Complexity

Bregman Voronoi diagrams and their induced loss landscapes admit efficient construction through two principal methods: half-space intersection and reduction to power diagrams. For half-space intersection, each site xix_i defines a tangent hyperplane to z=F(x)z = F(x) at (xi,F(xi))(x_i, F(x_i)), and the associated Voronoi cell results from projecting the intersection of half-spaces onto XX. Alternatively, in the dual (gradient) coordinates after the Legendre transform FF^*, the power diagram of points ci=F(xi)c_i = \nabla F(x_i) with weights wiw_i yields the Voronoi segmentation, matching the combinatorial cell structure in xx-space. Both approaches require O(nlogn+n(d+1)/2)O(n \log n + n^{\lfloor (d+1)/2 \rfloor}) time (fixed dd) (0709.2196).

For the Voronoigram, efficient computation entails constructing the Voronoi tessellation of the points and computing the measures of shared boundaries. These edge weights—surface area in d=3d=3, length in d=2d=2—constitute the adjacency graph for the TV penalty in the optimization.

5. Extensions: Higher-Order and Composite Divergence Losses

Voronoi-type loss partitions generalize to kk-order and kk-bag diagrams. The kk-order Bregman diagram assigns each point to the kk nearest sites under DFD_F, with cells again characterized by power-diagram structures via specifically constructed centers and weights:

cT=1kjTF(xj),wT=F(cT)cT,F(cT)1kjT[F(xj)xj,F(xj)].c_T = \frac{1}{k} \sum_{j\in T} \nabla F(x_j), \quad w_T = F(c_T) - \langle c_T, \nabla F(c_T)\rangle - \frac{1}{k}\sum_{j\in T}[F(x_j)-\langle x_j,\nabla F(x_j)\rangle].

Composite “kk-bag” diagrams encode mixtures of divergences, defined by convex generators FF_\ell and site-dependent mixtures (α(i)\alpha^{(i)}), leading to partitions in lifted spaces of dimension d+kd+k. In both generalizations, the combinatorial and computational properties parallel those of the base diagrams, with complexity scaling as O(n(d+k)/2)O(n^{\lfloor (d+k)/2 \rfloor}) for kk-bag diagrams (0709.2196).

Bregman triangulations—the analogues of Delaunay triangulations—arise via lower convex hulls of lifted points (xi,F(xi))(x_i, F(x_i)) or, equivalently, as geodesic triangulations in the dual space. These structures generalize classical spatial interpolation and nearest-neighbor constructs, admitting rich geometric combinatorics.

6. Statistical and Practical Properties

The Voronoigram provides a conceptually clean, tuning-free, density-free discretization of the continuum total variation functional. Its fitted function admits one-to-one correspondence between discrete and continuum TV, thereby preserving complexity under extrapolation: the number of constant regions in the fitted fVf^V matches the number of connected components in the Voronoi graph. Importantly, the Voronoigram, with properly chosen regularization parameter λσn(d1)/d(logn)1/2+α\lambda\asymp\sigma n^{(d-1)/d}(\log n)^{1/2+\alpha}, achieves minimax risk up to logarithmic factors for bounded-variation regression:

EfVf0L2(P)2Cn1/d(logn)O(1).\mathbb{E}\|f^V - f_0\|_{L^2(P)}^2 \leq C n^{-1/d} (\log n)^{O(1)}.

Comparable rates are attainable for other graph-TV and wavelet schemes, but only the Voronoigram achieves this rate without auxiliary neighborhood tuning (e.g., ε\varepsilon, kk), and with exact preservation of TV under 1-NN extrapolation (Hu et al., 2022).

In first-type Bregman Voronoi diagrams, the minimum-loss assignment (x)=miniDF(xxi)\ell(x) = \min_i D_F(x\,\|\,x_i) serves as a natural prototype-based clustering rule, with polyhedral cell structure for computational tractability. The Legendre dual structure further enables handling of curved bisectors and symmetry in statistical learning applications (0709.2196).

7. Comparison and Context

The table below summarizes core distinctions among Voronoi-type loss functions in TV-regularized regression:

Method Density Dependence Tuning Parameter TV-Extrapolation Preservation
Voronoigram Density-free None Exact
ε\varepsilon-NN TV Density-weighted ε\varepsilon No
kk-NN TV Density-weighted kk No

The Voronoigram is uniquely tuning-free and density-free, with extrapolation matching the discrete TV structure precisely. In contrast, classical graph-TV regularizers require explicit parameter selection and yield continuum limits biased by sampling density.

Voronoi-type loss function frameworks—spanning Bregman divergence–based partitioning and TV-regularization on Voronoi graphs—offer principled, efficient, and statistically robust methodologies for loss landscape construction, spatial regularization, and high-dimensional function estimation (0709.2196, Hu et al., 2022).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Voronoi-type Loss Functions.