Papers
Topics
Authors
Recent
Search
2000 character limit reached

Claim-Conditioned Probability (CCP)

Updated 19 February 2026
  • Claim-Conditioned Probability (CCP) is a framework that generalizes traditional conditional probability through conditional expectations defined on plausible preorders over algebraic structures.
  • It employs algebraic and measure-free constructions to extend inference to cases with zero-probability events and supports robust probabilistic reasoning.
  • CCP is applied in neural language models to quantify token-level uncertainty, demonstrating improved performance in detecting hallucinations compared to standard metrics.

Claim-Conditioned Probability (CCP) is a formal framework designed to generalize the notion of conditional probability to settings where classical constructions break down, including cases involving conditioning on claims or events of probability zero and contexts such as uncertainty quantification in neural text generation. This framework arises both as a mathematical foundation for conditional inference and as a practical tool for evaluating the epistemic confidence of models or knowledge-based systems under various forms of evidence and logical structure.

1. Mathematical Foundations of Conditioning and Preorders

The core of CCP in recent mathematical treatments is the reinterpretation of probability as a special case of conditional expectation induced by a plausible preorder on a commutative real algebra of random quantities. A random quantity is an element of a commutative real algebra TT; events are idempotent elements ATA\in T with AA=AA\cdot A=A, forming a Boolean algebra under the operations:

  • Conjunction: AB:=ABA\wedge B := A\cdot B
  • Negation: ¬A:=1A\neg A := 1-A
  • Disjunction: AB:=A+BABA\vee B := A+B-A\cdot B

A plausible preorder \leq on TT satisfies:

  1. Plausibility: If AA is an event, 0A0\leq A.
  2. Additivity: If ATA\in T0 and ATA\in T1, then ATA\in T2.
  3. Homogeneity: If ATA\in T3 and ATA\in T4 (real), then ATA\in T5.
  4. Extension: ATA\in T6 iff ATA\in T7.

For ATA\in T8, the expectation induced by ATA\in T9 is AA=AA\cdot A=A0, with well-defined behavior for infinite or coincident extrema. Conditional expectation is defined for any event AA=AA\cdot A=A1 by AA=AA\cdot A=A2, and AA=AA\cdot A=A3 is then the expectation under this conditional preorder.

A partial function AA=AA\cdot A=A4 is called coherent if and only if it arises as the restriction of some AA=AA\cdot A=A5 for a plausible preorder, yielding a general characterization theorem for conditional probabilities that includes degenerate cases AA=AA\cdot A=A6 (Mečíř, 2019).

2. Coherent Conditional Probability and Claims

Applying this machinery, for any event AA=AA\cdot A=A7 and "claim" AA=AA\cdot A=A8 (not necessarily with AA=AA\cdot A=A9), define the claim-conditioned probability as

AB:=ABA\wedge B := A\cdot B0

with AB:=ABA\wedge B := A\cdot B1 as above. Properties include:

  • AB:=ABA\wedge B := A\cdot B2 for any AB:=ABA\wedge B := A\cdot B3
  • Monotonicity: if AB:=ABA\wedge B := A\cdot B4, AB:=ABA\wedge B := A\cdot B5
  • Subadditivity and Bayes' rule hold in full generality
  • AB:=ABA\wedge B := A\cdot B6 if AB:=ABA\wedge B := A\cdot B7 and AB:=ABA\wedge B := A\cdot B8 are disjoint; AB:=ABA\wedge B := A\cdot B9 if ¬A:=1A\neg A := 1-A0 implies ¬A:=1A\neg A := 1-A1
  • Definition extends to ¬A:=1A\neg A := 1-A2 with ¬A:=1A\neg A := 1-A3 or ¬A:=1A\neg A := 1-A4 undefined

This construction provides a rigorous and order-theoretically sound basis for conditional reasoning not limited by the usual restrictions of the Kolmogorov framework (Mečíř, 2019).

3. Measure-Free Algebraic Constructions

Independently, a measure-free approach to conditional probability constructs an extended algebra of events and "conditional objects" ¬A:=1A\neg A := 1-A5 (for ¬A:=1A\neg A := 1-A6 events in a Boolean algebra ¬A:=1A\neg A := 1-A7), defined as principal-ideal cosets:

¬A:=1A\neg A := 1-A8

These new objects admit a semantics in which any (finitely additive) ¬A:=1A\neg A := 1-A9 consistently extends to AB:=A+BABA\vee B := A+B-A\cdot B0 when AB:=A+BABA\vee B := A+B-A\cdot B1. The construction is minimal and necessary for consistently interpreting inference rules as conditional statements in probabilistic expert systems (Goodman, 2013).

Key algebraic properties include:

  • Identities
    • AB:=A+BABA\vee B := A+B-A\cdot B2
    • AB:=A+BABA\vee B := A+B-A\cdot B3
    • AB:=A+BABA\vee B := A+B-A\cdot B4
  • Bayes’ rule and chaining law for conditional objects
  • Computations on conditional objects are polynomial in the size of the Boolean representation of AB:=A+BABA\vee B := A+B-A\cdot B5

This algebra provides a syntax and calculus for conditional reasoning, supporting applications in knowledge-based inference, evidence fusion, and sequential updating (Goodman, 2013).

4. Claim-Conditioned Probability in LLMs

Recently, CCP has been operationalized as a metric of token-level uncertainty in autoregressive neural LLMs, with direct application to hallucination detection and model calibration (Fadeeva et al., 2024). For a set of tokens associated with an atomic claim, CCP is defined as follows.

At each position AB:=A+BABA\vee B := A+B-A\cdot B6, let AB:=A+BABA\vee B := A+B-A\cdot B7 be the top-AB:=A+BABA\vee B := A+B-A\cdot B8 candidate tokens given context AB:=A+BABA\vee B := A+B-A\cdot B9. For each \leq0, a pretrained NLI (Natural Language Inference) model determines the logical relation (entailment, contradiction, neutral) with the reference output \leq1, distinguishing between shared claim-type and factual value.

The token-level CCP is:

\leq2

where the numerator sums probabilities over alternatives entailing \leq3 (same claim value), and the denominator over alternatives with the same claim type (could be contradictory values). This design factors out uncertainty about claim-type and surface form, isolating factual uncertainty.

Claim-level CCP aggregates as

\leq4

A high \leq5 implicates high uncertainty and probable hallucination.

5. Empirical Evaluation and Implementation Notes

The CCP metric, as applied to LLM outputs, systematically outperforms baselines such as maximum token probability, entropy, and "ask-the-model" scores for factuality discrimination. Experimental setups have used various open and proprietary models (Vicuna 13b, Mistral 7b, Jais 13b, GPT-3.5, GPT-4, Yi 6B), with automatic and human annotation of claims in biography-generation tasks across multiple languages.

Quantitative results on ROC-AUC for discriminating supported vs. false claims show consistent superiority of CCP by \leq6–\leq7 points over baselines. Example cases demonstrate that CCP successfully isolates uncertainty localized at factual content, while other scores respond to irrelevant surface variability or claim-type uncertainty. Implementation involves top-\leq8 beam search, efficient NLI evaluation, and computational tractability with \leq9–TT0; product-based aggregation at claim level maximizes detection accuracy (Fadeeva et al., 2024).

6. Practical Applications and Theoretical Significance

CCP establishes a uniform formalism that:

  • Admits order-theoretic, algebraic (measure-free), and neural uncertainty quantification instantiations
  • Generalizes conditional probability for inference rules, evidence combination, and hypothesis updating even when conditioning on zero-probability or undefined events
  • Provides rigorous, computable frameworks for expert systems and for white-box hallucination detection in generative neural models

Since CCP is underpinned by a minimal extension of classical structures (plausible preorders, coset algebras), its integrity and computational feasibility are maintained across logic-based and data-driven systems (Mečíř, 2019, Goodman, 2013, Fadeeva et al., 2024).

7. Relation to Broader Frameworks and Limitations

The CCP methodology unifies aspects of conditional probability under a common umbrella, admitting both order-theoretic and algebraic perspectives, and linking formal epistemic reasoning with statistical and neural machine learning contexts. It circumvents traditional singularities (e.g., TT1) and supports generalization of inference rules. However, the operational definition of claims, and the practical implementation of NLI-based CCP in language generation, depend on system-specific architectures and assumptions about token-claim alignment and NLI reliability.

A plausible implication is that CCP provides a stable substrate for future research on fact-checking, uncertainty quantification, and reasoning under both symbolic and statistical paradigms. Limitations may arise insofar as the quality of the underlying models (e.g., NLI, atomization of claims) influences CCP’s discriminative power in empirical contexts (Mečíř, 2019, Fadeeva et al., 2024, Goodman, 2013).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (3)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Claim-Conditioned Probability (CCP).