Claim-Conditioned Probability (CCP)
- Claim-Conditioned Probability (CCP) is a framework that generalizes traditional conditional probability through conditional expectations defined on plausible preorders over algebraic structures.
- It employs algebraic and measure-free constructions to extend inference to cases with zero-probability events and supports robust probabilistic reasoning.
- CCP is applied in neural language models to quantify token-level uncertainty, demonstrating improved performance in detecting hallucinations compared to standard metrics.
Claim-Conditioned Probability (CCP) is a formal framework designed to generalize the notion of conditional probability to settings where classical constructions break down, including cases involving conditioning on claims or events of probability zero and contexts such as uncertainty quantification in neural text generation. This framework arises both as a mathematical foundation for conditional inference and as a practical tool for evaluating the epistemic confidence of models or knowledge-based systems under various forms of evidence and logical structure.
1. Mathematical Foundations of Conditioning and Preorders
The core of CCP in recent mathematical treatments is the reinterpretation of probability as a special case of conditional expectation induced by a plausible preorder on a commutative real algebra of random quantities. A random quantity is an element of a commutative real algebra ; events are idempotent elements with , forming a Boolean algebra under the operations:
- Conjunction:
- Negation:
- Disjunction:
A plausible preorder on satisfies:
- Plausibility: If is an event, .
- Additivity: If 0 and 1, then 2.
- Homogeneity: If 3 and 4 (real), then 5.
- Extension: 6 iff 7.
For 8, the expectation induced by 9 is 0, with well-defined behavior for infinite or coincident extrema. Conditional expectation is defined for any event 1 by 2, and 3 is then the expectation under this conditional preorder.
A partial function 4 is called coherent if and only if it arises as the restriction of some 5 for a plausible preorder, yielding a general characterization theorem for conditional probabilities that includes degenerate cases 6 (Mečíř, 2019).
2. Coherent Conditional Probability and Claims
Applying this machinery, for any event 7 and "claim" 8 (not necessarily with 9), define the claim-conditioned probability as
0
with 1 as above. Properties include:
- 2 for any 3
- Monotonicity: if 4, 5
- Subadditivity and Bayes' rule hold in full generality
- 6 if 7 and 8 are disjoint; 9 if 0 implies 1
- Definition extends to 2 with 3 or 4 undefined
This construction provides a rigorous and order-theoretically sound basis for conditional reasoning not limited by the usual restrictions of the Kolmogorov framework (Mečíř, 2019).
3. Measure-Free Algebraic Constructions
Independently, a measure-free approach to conditional probability constructs an extended algebra of events and "conditional objects" 5 (for 6 events in a Boolean algebra 7), defined as principal-ideal cosets:
8
These new objects admit a semantics in which any (finitely additive) 9 consistently extends to 0 when 1. The construction is minimal and necessary for consistently interpreting inference rules as conditional statements in probabilistic expert systems (Goodman, 2013).
Key algebraic properties include:
- Identities
- 2
- 3
- 4
- Bayes’ rule and chaining law for conditional objects
- Computations on conditional objects are polynomial in the size of the Boolean representation of 5
This algebra provides a syntax and calculus for conditional reasoning, supporting applications in knowledge-based inference, evidence fusion, and sequential updating (Goodman, 2013).
4. Claim-Conditioned Probability in LLMs
Recently, CCP has been operationalized as a metric of token-level uncertainty in autoregressive neural LLMs, with direct application to hallucination detection and model calibration (Fadeeva et al., 2024). For a set of tokens associated with an atomic claim, CCP is defined as follows.
At each position 6, let 7 be the top-8 candidate tokens given context 9. For each 0, a pretrained NLI (Natural Language Inference) model determines the logical relation (entailment, contradiction, neutral) with the reference output 1, distinguishing between shared claim-type and factual value.
The token-level CCP is:
2
where the numerator sums probabilities over alternatives entailing 3 (same claim value), and the denominator over alternatives with the same claim type (could be contradictory values). This design factors out uncertainty about claim-type and surface form, isolating factual uncertainty.
Claim-level CCP aggregates as
4
A high 5 implicates high uncertainty and probable hallucination.
5. Empirical Evaluation and Implementation Notes
The CCP metric, as applied to LLM outputs, systematically outperforms baselines such as maximum token probability, entropy, and "ask-the-model" scores for factuality discrimination. Experimental setups have used various open and proprietary models (Vicuna 13b, Mistral 7b, Jais 13b, GPT-3.5, GPT-4, Yi 6B), with automatic and human annotation of claims in biography-generation tasks across multiple languages.
Quantitative results on ROC-AUC for discriminating supported vs. false claims show consistent superiority of CCP by 6–7 points over baselines. Example cases demonstrate that CCP successfully isolates uncertainty localized at factual content, while other scores respond to irrelevant surface variability or claim-type uncertainty. Implementation involves top-8 beam search, efficient NLI evaluation, and computational tractability with 9–0; product-based aggregation at claim level maximizes detection accuracy (Fadeeva et al., 2024).
6. Practical Applications and Theoretical Significance
CCP establishes a uniform formalism that:
- Admits order-theoretic, algebraic (measure-free), and neural uncertainty quantification instantiations
- Generalizes conditional probability for inference rules, evidence combination, and hypothesis updating even when conditioning on zero-probability or undefined events
- Provides rigorous, computable frameworks for expert systems and for white-box hallucination detection in generative neural models
Since CCP is underpinned by a minimal extension of classical structures (plausible preorders, coset algebras), its integrity and computational feasibility are maintained across logic-based and data-driven systems (Mečíř, 2019, Goodman, 2013, Fadeeva et al., 2024).
7. Relation to Broader Frameworks and Limitations
The CCP methodology unifies aspects of conditional probability under a common umbrella, admitting both order-theoretic and algebraic perspectives, and linking formal epistemic reasoning with statistical and neural machine learning contexts. It circumvents traditional singularities (e.g., 1) and supports generalization of inference rules. However, the operational definition of claims, and the practical implementation of NLI-based CCP in language generation, depend on system-specific architectures and assumptions about token-claim alignment and NLI reliability.
A plausible implication is that CCP provides a stable substrate for future research on fact-checking, uncertainty quantification, and reasoning under both symbolic and statistical paradigms. Limitations may arise insofar as the quality of the underlying models (e.g., NLI, atomization of claims) influences CCP’s discriminative power in empirical contexts (Mečíř, 2019, Fadeeva et al., 2024, Goodman, 2013).