Map of Elections Framework
- Map of Elections Framework is a methodological paradigm that represents elections as high-dimensional vectors then embeds them into low-dimensional, interpretable maps.
- It employs efficient distance metrics like positionwise Earth-Mover’s Distance and approval-wise ℓ1 distance to compare elections even when exact isomorphic matching is computationally infeasible.
- The framework facilitates practical analysis of synthetic models and real-world elections by providing canonical compass points and robust embedding techniques for model benchmarking and visualization.
A map of elections framework is a methodological paradigm and visualization tool for representing, analyzing, and comparing elections (or election data) by embedding them into a metric or feature space, and constructing interpretable low-dimensional representations—often two-dimensional maps—where geometric proximity reflects high-dimensional similarities. These frameworks enable both abstract exploration of "election spaces" constructed from synthetic or real-world data and the systematic evaluation of model, computational, or social-choice properties associated with different types of elections (Szufa et al., 2022, Szufa, 2024, Szufa et al., 4 Apr 2025).
1. Representing Election Spaces
The foundational element is to formalize a space of elections. In the standard variant, an (ordinal) election is specified by a fixed set of candidates and a multiset of voters, each casting a linear order over . For approval ballots, each voter submits a subset instead of a ranking. The totality of such elections (with given and ) forms a discrete but very high-dimensional "election space".
Each election can be transformed into a high-dimensional vector (or matrix), such as the position-frequency matrix with equal to the fraction of voters ranking candidate in position (ordinal), or the approval-wise vector with being the normalized approval fraction for .
This vectorization is the basis for the subsequent definition of inter-election distances and the construction of low-dimensional maps (Szufa, 2024, Szufa et al., 4 Apr 2025, Szufa et al., 2022).
2. Election Distances and Metrics
To compare elections quantitatively, the framework defines a distance between any two elections and . The gold-standard isomorphic distance—minimizing, e.g., swap (Kendall-tau) distance under all voter/candidate permutations to align elections—is computationally infeasible for realistic dataset sizes, as it is NP-hard (GI-hard by Kemeny-score reduction) (Boehmer et al., 2022, Szufa et al., 4 Apr 2025).
As a tractable yet expressive alternative, Szufa et al. [AAMAS 2020] introduce the positionwise (EMD) distance , defined as follows: For each pair of elections, the distance is the minimum (over candidate bijections) of the sum (over candidates) of Earth-Mover's Distances (EMD) between their frequency vectors. For approval elections, the approval-wise distance between sorted score vectors is employed (Szufa et al., 2022). These distances are true metrics (or pseudometrics), admit efficient computation via the Hungarian algorithm or prefix-sum methods, and exhibit high empirical correlation with the intractable isomorphic metrics (Boehmer et al., 2022, Szufa et al., 4 Apr 2025).
Summary of primary metrics:
| Metric | Type | Complexity |
|---|---|---|
| Isomorphic swap | NP-hard | impractical |
| Positionwise (EMD) | Polynomial | recommended |
| Approval-wise | Polynomial | fast |
Computing typically involves: constructing the frequency matrices, deriving a cost matrix of vector distances between candidates (via EMD), and solving a min-cost matching (Boehmer et al., 2022, Szufa et al., 4 Apr 2025).
3. Low-Dimensional Embedding: Map Construction
Given a collection and their pairwise distances, the next step is to embed them as points such that .
Common dimensionality reduction and graph-drawing methods include:
- Kamada–Kawai (spring energy minimization; preferred for monotonicity and low distortion)
- Fruchterman–Reingold (force-directed; more evenly spreads points)
- Metric Multidimensional Scaling (MDS; stress minimization)
- Others: PCA, t-SNE, LLE (mainly for exploratory use)
Empirical and theoretical evaluations establish that e.g., Kamada–Kawai embeddings achieve 0.97 Pearson correlation between original and map distances, and average distortion ratios of 1.2, ensuring that map geometry is a faithful summary of intrinsic inter-election distances (Szufa, 2024, Szufa et al., 4 Apr 2025).
4. Canonical Structure and Interpretability: Compass and Skeleton Maps
A key innovation for interpretability is the inclusion of extremal ("compass") elections as canonical corner points:
- Identity (ID): all voters perfectly agree
- Uniformity (UN): all possible orders equally present
- Stratification (ST): two internally cohesive camps, each with block preference
- Antagonism (AN): two camps with orders exactly reversed
By analytically computing inter-compass distances (e.g., ), convex-combination paths between them structure the map's backbone quadrilateral (the "skeleton"), and any election can be barycentrically projected onto the compass axes for interpretable “concentration” and “polarity” coordinates (Boehmer et al., 2021, Boehmer et al., 2022, Szufa et al., 4 Apr 2025).
The frequency-matrix “skeleton map” enables fast, geometric assessment of any dataset or model's placement relative to these archetypes (Boehmer et al., 2022).
5. Applications: Synthetic Models, Real Data, and Visualization
Synthetic election cultures (e.g., Impartial Culture, Mallows, urn, Euclidean, single-peaked) and real-world elections (political, sports, surveys) can be rendered and positioned on the map. Extensive benchmarks show:
- Real elections typically cluster near the Mallows path with normalized dispersion parameters () (Boehmer et al., 2021, Boehmer et al., 2022).
- Sporting events and indicator-based surveys spread from ID through UN, with varying degree of consensus.
- Synthetic models often define distinctive map regions—e.g., IC near UN, low-dispersion Mallows near ID, Euclidean and urn models as intermediary points (Szufa, 2024, Szufa et al., 4 Apr 2025, Boehmer et al., 2022).
Map colorings facilitate further empirical and algorithmic studies, such as mapping computational hardness of winner determination, cohesiveness (e.g., -cohesive groups in approval maps), Condorcet winner frequency, or approximation ratios in multiwinner rule heuristics (Szufa et al., 2022, Szufa et al., 4 Apr 2025).
6. Extensions: Varying Sizes, Incomplete Ballots, and Feature Maps
Faliszewski et al. extend the standard framework to handle varying numbers of candidates, voters, and top-truncated ballots. The extended positionwise distance employs matrix "stretching" for comparison, and the DAP (Diversity, Agreement, Polarization) feature metric provides a fast, robust alternative summary (Faliszewski et al., 25 Jan 2026, Faliszewski et al., 2023).
The framework also supports applications to approval ballots (using approval-wise distance with corresponding convex hulls and clustering interpretation) (Szufa et al., 2022), and construction of “maps of preference orders” at the voter-ballot level for fine-grained internal structure analysis (Faliszewski et al., 2023).
7. Theoretical Guarantees and Practical Guidance
Rigorous propositions validate the metric properties, embedding accuracy, and nearly lossless “backlifting” from frequency-matrix skeletons back to valid elections (Szufa et al., 4 Apr 2025, Boehmer et al., 2022). The framework provides robust empirical reliability across variable dataset sizes, candidate counts, and sampling errors (Boehmer et al., 2022).
From a design perspective, the map of elections guides experimental regime selection (e.g., sampling from a Mallows model with realistic per survey and political data analysis), benchmarking of algorithmic performance, and diagnosis of structural “gaps” in the space for further model innovation.
References:
- (Szufa, 2024) "Map of Elections"
- (Szufa et al., 4 Apr 2025) "Drawing a Map of Elections"
- (Szufa et al., 2022) "How to Sample Approval Elections?"
- (Boehmer et al., 2021) "Putting a Compass on the Map of Elections"
- (Boehmer et al., 2022) "Collecting, Classifying, Analyzing, and Using Real-World Elections"
- (Boehmer et al., 2022) "Understanding Distance Measures Among Elections"
- (Boehmer et al., 2022) "Expected Frequency Matrices of Elections: Computation, Geometry, and Preference Learning"
- (Faliszewski et al., 25 Jan 2026) "Distances Between Top-Truncated Elections of Different Sizes"
- (Faliszewski et al., 2023) "Diversity, Agreement, and Polarization in Elections"