Papers
Topics
Authors
Recent
Search
2000 character limit reached

Map of Elections Framework

Updated 1 February 2026
  • Map of Elections Framework is a methodological paradigm that represents elections as high-dimensional vectors then embeds them into low-dimensional, interpretable maps.
  • It employs efficient distance metrics like positionwise Earth-Mover’s Distance and approval-wise ℓ1 distance to compare elections even when exact isomorphic matching is computationally infeasible.
  • The framework facilitates practical analysis of synthetic models and real-world elections by providing canonical compass points and robust embedding techniques for model benchmarking and visualization.

A map of elections framework is a methodological paradigm and visualization tool for representing, analyzing, and comparing elections (or election data) by embedding them into a metric or feature space, and constructing interpretable low-dimensional representations—often two-dimensional maps—where geometric proximity reflects high-dimensional similarities. These frameworks enable both abstract exploration of "election spaces" constructed from synthetic or real-world data and the systematic evaluation of model, computational, or social-choice properties associated with different types of elections (Szufa et al., 2022, Szufa, 2024, Szufa et al., 4 Apr 2025).

1. Representing Election Spaces

The foundational element is to formalize a space of elections. In the standard variant, an (ordinal) election E=(C,V)E=(C,V) is specified by a fixed set of mm candidates CC and a multiset VV of nn voters, each casting a linear order over CC. For approval ballots, each voter submits a subset A(v)CA(v)\subseteq C instead of a ranking. The totality of such elections (with given mm and nn) forms a discrete but very high-dimensional "election space".

Each election EE can be transformed into a high-dimensional vector (or matrix), such as the position-frequency matrix F(E)=(fi,j)F(E) = (f_{i,j}) with fi,jf_{i,j} equal to the fraction of voters ranking candidate cjc_j in position ii (ordinal), or the approval-wise vector av(E)=(x1,,xm)\mathrm{av}(E)=(x_1,\ldots,x_m) with xjx_j being the normalized approval fraction for cjc_j.

This vectorization is the basis for the subsequent definition of inter-election distances and the construction of low-dimensional maps (Szufa, 2024, Szufa et al., 4 Apr 2025, Szufa et al., 2022).

2. Election Distances and Metrics

To compare elections quantitatively, the framework defines a distance d(E,F)d(E,F) between any two elections EE and FF. The gold-standard isomorphic distance—minimizing, e.g., swap (Kendall-tau) distance under all voter/candidate permutations to align elections—is computationally infeasible for realistic dataset sizes, as it is NP-hard (GI-hard by Kemeny-score reduction) (Boehmer et al., 2022, Szufa et al., 4 Apr 2025).

As a tractable yet expressive alternative, Szufa et al. [AAMAS 2020] introduce the positionwise (EMD) distance dposd_{\mathrm{pos}}, defined as follows: For each pair of elections, the distance is the minimum (over candidate bijections) of the sum (over candidates) of Earth-Mover's Distances (EMD) between their frequency vectors. For approval elections, the approval-wise 1\ell_1 distance between sorted score vectors is employed (Szufa et al., 2022). These distances are true metrics (or pseudometrics), admit efficient computation via the Hungarian algorithm or prefix-sum methods, and exhibit high empirical correlation with the intractable isomorphic metrics (Boehmer et al., 2022, Szufa et al., 4 Apr 2025).

Summary of primary metrics:

Metric Type Complexity
Isomorphic swap NP-hard impractical
Positionwise (EMD) Polynomial recommended
Approval-wise 1\ell_1 Polynomial fast

Computing dposd_{\mathrm{pos}} typically involves: constructing the frequency matrices, deriving a cost matrix of vector distances between candidates (via EMD), and solving a min-cost matching (Boehmer et al., 2022, Szufa et al., 4 Apr 2025).

3. Low-Dimensional Embedding: Map Construction

Given a collection {E1,,Ek}\{E_1,\ldots,E_k\} and their pairwise distances, the next step is to embed them as points {xE1,,xEk}R2\{x_{E_1},\ldots,x_{E_k}\}\subset\mathbb{R}^2 such that xEixEjd(Ei,Ej)\|x_{E_i}-x_{E_j}\|\approx d(E_i,E_j).

Common dimensionality reduction and graph-drawing methods include:

  • Kamada–Kawai (spring energy minimization; preferred for monotonicity and low distortion)
  • Fruchterman–Reingold (force-directed; more evenly spreads points)
  • Metric Multidimensional Scaling (MDS; stress minimization)
  • Others: PCA, t-SNE, LLE (mainly for exploratory use)

Empirical and theoretical evaluations establish that e.g., Kamada–Kawai embeddings achieve \sim0.97 Pearson correlation between original and map distances, and average distortion ratios of \sim1.2, ensuring that map geometry is a faithful summary of intrinsic inter-election distances (Szufa, 2024, Szufa et al., 4 Apr 2025).

4. Canonical Structure and Interpretability: Compass and Skeleton Maps

A key innovation for interpretability is the inclusion of extremal ("compass") elections as canonical corner points:

  • Identity (ID): all voters perfectly agree
  • Uniformity (UN): all possible orders equally present
  • Stratification (ST): two internally cohesive camps, each with block preference
  • Antagonism (AN): two camps with orders exactly reversed

By analytically computing inter-compass distances (e.g., dpos(ID,UN)=(m21)/3d_{\rm pos}(\mathrm{ID},\mathrm{UN}) = (m^2-1)/3), convex-combination paths between them structure the map's backbone quadrilateral (the "skeleton"), and any election can be barycentrically projected onto the compass axes for interpretable “concentration” and “polarity” coordinates (Boehmer et al., 2021, Boehmer et al., 2022, Szufa et al., 4 Apr 2025).

The frequency-matrix “skeleton map” enables fast, geometric assessment of any dataset or model's placement relative to these archetypes (Boehmer et al., 2022).

5. Applications: Synthetic Models, Real Data, and Visualization

Synthetic election cultures (e.g., Impartial Culture, Mallows, urn, Euclidean, single-peaked) and real-world elections (political, sports, surveys) can be rendered and positioned on the map. Extensive benchmarks show:

  • Real elections typically cluster near the Mallows path with normalized dispersion parameters (θ[0.25,0.4]\theta\in[0.25,0.4]) (Boehmer et al., 2021, Boehmer et al., 2022).
  • Sporting events and indicator-based surveys spread from ID through UN, with varying degree of consensus.
  • Synthetic models often define distinctive map regions—e.g., IC near UN, low-dispersion Mallows near ID, Euclidean and urn models as intermediary points (Szufa, 2024, Szufa et al., 4 Apr 2025, Boehmer et al., 2022).

Map colorings facilitate further empirical and algorithmic studies, such as mapping computational hardness of winner determination, cohesiveness (e.g., \ell-cohesive groups in approval maps), Condorcet winner frequency, or approximation ratios in multiwinner rule heuristics (Szufa et al., 2022, Szufa et al., 4 Apr 2025).

6. Extensions: Varying Sizes, Incomplete Ballots, and Feature Maps

Faliszewski et al. extend the standard framework to handle varying numbers of candidates, voters, and top-truncated ballots. The extended positionwise distance d^pos\hat{d}_{\rm pos} employs matrix "stretching" for comparison, and the DAP (Diversity, Agreement, Polarization) feature metric provides a fast, robust alternative summary (Faliszewski et al., 25 Jan 2026, Faliszewski et al., 2023).

The framework also supports applications to approval ballots (using approval-wise 1\ell_1 distance with corresponding convex hulls and clustering interpretation) (Szufa et al., 2022), and construction of “maps of preference orders” at the voter-ballot level for fine-grained internal structure analysis (Faliszewski et al., 2023).

7. Theoretical Guarantees and Practical Guidance

Rigorous propositions validate the metric properties, embedding accuracy, and nearly lossless “backlifting” from frequency-matrix skeletons back to valid elections (Szufa et al., 4 Apr 2025, Boehmer et al., 2022). The framework provides robust empirical reliability across variable dataset sizes, candidate counts, and sampling errors (Boehmer et al., 2022).

From a design perspective, the map of elections guides experimental regime selection (e.g., sampling from a Mallows model with realistic θ\theta per survey and political data analysis), benchmarking of algorithmic performance, and diagnosis of structural “gaps” in the space for further model innovation.


References:

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Map of Elections Framework.