Papers
Topics
Authors
Recent
2000 character limit reached

Martingale Majority Certificate (MMC)

Updated 24 October 2025
  • Martingale Majority Certificate (MMC) is a framework that uses sequential martingale analysis to certify majority vote accuracy with non-asymptotic guarantees.
  • It leverages concentration inequalities, PAC-Bayes theory, and e-processes to provide finite-sample, anytime-valid error bounds.
  • MMC underpins reliable LLM inference, distributed protocols, and control verification by quantifying sample efficiency and decision reliability.

The Martingale Majority Certificate (MMC) is a statistical framework for certifying aggregate decisions made via majority vote or weighted voting in settings where outcomes are modeled by sequential stochastic processes, typically martingales or supermartingales. Originally introduced in the context of reliable inference for LLMs, MMC generalizes to a variety of sequential decision-making, verification, and protocol design scenarios. The certificate leverages martingale concentration inequalities, PAC-Bayes theory, and sequential e-processes to provide non-asymptotic, anytime-valid guarantees that the majority vote is correct or, more generally, that the group-level decision remains within a specified error envelope.

1. Mathematical Formulation and Core Principles

MMC is fundamentally based on constructing a sequential stopping rule and a family of nonnegative supermartingales (e-processes) that monitor the empirical margin between the “leader” (majority answer) and the runner-up (second-most frequent answer) across independent samples. Let X1,X2,,XnX_{1}, X_{2}, \ldots, X_{n} be i.i.d. draws from the terminal distribution pp of an LLM or, more generally, from the output of a stochastic process on a hypothesis space HH. The majority vote at time nn is c^n=argmaxcNc\hat{c}_n = \arg\max_{c} N_c where Nc=i=1n1{Xi=c}N_c = \sum_{i=1}^{n} \mathbf{1}\{X_i = c\}. MMC seeks to answer, for a given sample size nn, whether c^n\hat{c}_n coincides with the mode cc^* of pp.

The core idea is to construct e-processes EnE_n such that the maximum value over all nn is sub-exponential under the null hypothesis (i.e., the majority vote is not the true mode). Ville’s inequality then gives anytime-valid probabilistic guarantees that, upon crossing a preset threshold 1/ϵ1/\epsilon, the certificate guarantees with high probability (1ϵ1-\epsilon) that the majority decision matches the correct label. The construction typically uses cumulative log-margins or sign-margins (e.g., i=1n[1{Xi=c}1{Xi=j}]\sum_{i=1}^{n} [1\{X_i = c^*\} - 1\{X_i = j^*\}]).

2. Concentration Inequalities and Finite-Sample Certification

MMC leverages concentration bounds to quantify the sample complexity and reliability of the certification. For finite nn and kk candidate answers, Hoeffding’s inequality provides a basic guarantee: P[c^nc](k1)exp(n2δ2),\mathbb{P}\left[\hat{c}_n \neq c^*\right] \leq (k-1)\exp\left(-\frac{n}{2}\delta^2\right), with δ:=pcmaxjcpj\delta := p_{c^*} - \max_{j \neq c^*} p_j. Bernstein and Chernoff–Markov bounds yield sharper certificates incorporating variance:

  • Bernstein: uses σj2:=pc+pj(pcpj)2\sigma^2_j := p_{c^*} + p_j - (p_{c^*} - p_j)^2
  • Chernoff–Markov: P[c^nc]jexp{nlog[1(pcpj)2]}\mathbb{P}[\hat{c}_n \neq c^*] \leq \sum_{j} \exp\{n\log[1 - (\sqrt{p_{c^*}} - \sqrt{p_j})^2]\}

The bounds ensure that error probabilities decay exponentially with nn, with rates governed by the signal-to-noise ratio (SNR) of the margin variable and the separation δ\delta.

3. Sequential Decision Making and Anytime-Validity

MMC formalizes adaptive sample sufficiency through sequential analysis. Each new sample updates the e-processes, and the system monitors whether thresholds have been surpassed, thus triggering certification:

  • The stopping rule is governed by the martingale property of the e-processes: nonnegative supermartingales such that expected future increments, conditioned on the present, are non-increasing.
  • By Ville’s inequality, for any ϵ\epsilon, the probability that the running maximum of the e-process exceeds 1/ϵ1/\epsilon is at most ϵ\epsilon.
  • This mechanism yields an “anytime” guarantee: one can halt sampling as soon as certification is achieved, balancing computational cost and reliability.

In LLMs, this approach certifies self-consistency: the majority vote over rollouts matches the modal output with high probability (Cordero-Encinar et al., 20 Oct 2025).

4. Connection to Self-Consistency, Test-Time Training, and Post-Training Objectives

MMC analyzes and unifies the statistical justification for two widely used LLM reliability strategies:

  • Self-consistency sampling: Aggregates answers from independent chain-of-thought trajectories; MMC certifies the mode.
  • Test-Time Reinforcement Learning (TTRL): Adaptively modifies the terminal distribution via KL-regularized objectives to sharpen the mode.

Label-free post-training methods exponentially tilt the answer distribution, increasing the margin δ\delta and SNR, thus reducing sample requirements for certification. Proposed test-time objectives directly optimize SNR:

  • SNR-based reward: rn1(Y)=(NleaderNrunner-up)2n(Nleader+Nrunner-up)(NleaderNrunner-up)2r^1_n(\mathcal{Y}) = \frac{(N_{\text{leader}} - N_{\text{runner-up}})^2}{n(N_{\text{leader}} + N_{\text{runner-up}}) - (N_{\text{leader}} - N_{\text{runner-up}})^2}, converging to SNR(Δ\Delta).
  • Entropy-based reward: rn2(Y)=jNjnlogNjnr^2_n(\mathcal{Y}) = \sum_j \frac{N_j}{n} \log \frac{N_j}{n} (negative Shannon entropy).

These objectives are optimized under a KL-regularization constraint, balancing sharpness and bias.

5. Extension to Martingale Mixtures and General Sequential Protocols

MMC generalizes to ensemble decision-making and sequential protocols where aggregated behavior is modeled via martingale mixtures. Consider a hypothesis space HH with discrete-time martingales {Mt(h)}\{M_t(h)\} and mixing distribution ρ\rho on HH, the mixture process being Eρ[Mt]\mathbb{E}_\rho[M_t]. PAC-Bayes bounds provide uniform error guarantees over all mixtures: Eρ[Mt]6(e2)Eρ[Vt](2lnln3(e2)Eρ[Vt]Eρ[Mt]+ln(2/δ)+KL(ρπ)).\mathbb{E}_{\rho}[M_t] \leq \sqrt{6(e-2)\mathbb{E}_\rho[V_t]\left(2\ln \ln \frac{3(e-2)\mathbb{E}_\rho[V_t]}{\mathbb{E}_\rho[M_t]} + \ln(2/\delta) + \text{KL}(\rho\|\pi)\right)}. Here VtV_t is the cumulative conditional variance, and the iterated logarithm (ln ln) factor ensures optimal scale-adaptation, especially in low-variance regimes (Balsubramani, 2015). This approach enables majority certificate guarantees in distributed or ensemble protocols, including cryptographic coin-tossing, where security bounds can be sharply quantified in terms of martingale gap functions (susceptibility curves) and stopping times (Khorasgani et al., 2019).

6. Quantitative Supermartingale Certificates and Control Synthesis

The supermartingale certificate methodology generalizes MMC to quantitative model checking and control. Given a shift-invariant specification LL, one synthesizes a stochastic invariant II and value function V0V_0 such that:

  • PV0(s)V0(s)PV_0(s) \leq V_0(s) for sIs \in I
  • V0(s)1V_0(s) \geq 1 for sIs \notin I

The probability of remaining in II forever, P(ΦIω)P(\Phi \in I^\omega), lower-bounds the satisfaction probability of LL. In finite-state systems, the bound is tight. The decomposition strategy supports both quantitative verification and control, applying certificate search algorithms (using polynomial templates and SMT/Positivstellensatz tools) to synthesize invariants and supermartingales (Abate et al., 7 Apr 2025). This framework can be viewed as providing a “majority certificate” for the satisfaction of sequential (often temporal) specifications in both stochastic verification and control synthesis.

7. Significance, Applications, and Generalizations

The Martingale Majority Certificate framework has broad implications:

  • In LLM inference, MMC specifies when majority voting is statistically reliable and quantifies sample sufficiency.
  • In distributed randomness generation and cryptographic protocols, MMC-style bounds quantify adversarial susceptibility and optimize protocol fairness/security.
  • In verification and control, supermartingale certificates ensure satisfaction probabilities for complex temporal specifications.
  • The statistical unification of self-consistency and test-time adaptation connects heuristic aggregation strategies to formal martingale theory and PAC-Bayes analysis, enabling the design of certificate-based adaptive algorithms for reliability.

A plausible implication is that MMC methodologies will deepen the connection between sequential statistical guarantees, algorithmic adaptivity, and ensemble decision reliability across AI, cryptography, and formal verification domains.


Table: Core Components of MMC in Different Domains

Domain Certification Target Mechanism
LLMs Modal correctness of majority vote Sequential e-process, SNR/entropy optimization
Distributed protocols Security / fairness of outcomes Martingale mixture bounds, gap analysis
Verification/Control Satisfaction probability bound Supermartingale certificate, stochastic invariants

The Martingale Majority Certificate serves as a rigorous, adaptive mechanism for certifying group decisions within the statistical structure provided by martingale theory, PAC-Bayes bounds, and quantitative supermartingale analysis.

Whiteboard

Follow Topic

Get notified by email when new papers are published related to Martingale Majority Certificate (MMC).