Principal Semantic Vector (PSV)
- Principal Semantic Vector (PSV) is a continuous semantic representation that distills and propagates the core logical essence from an LLM’s internal reasoning trace.
- It extracts critical tokens using Global Causal Contribution and Competitive Persistence Scoring, then applies PCA to determine a dominant semantic direction for watermark guidance.
- Empirical results demonstrate that PSV-guided watermark embedding lowers perplexity and achieves high detection AUC, ensuring robust and contextually coherent answer generation.
The Principal Semantic Vector (PSV) is a continuous semantic representation designed to distill and propagate the logical essence of a reasoning trace in LLMs. Within the ReasonMark watermarking framework, the PSV acts as the dynamic semantic compass that bridges the undisturbed Thinking Phase of model inference with the Answering Phase, enabling robust, contextually adaptive watermark embedding. By extracting the most semantically pivotal tokens from the chain-of-thought and aggregating their embeddings into a principal direction, the PSV ensures that watermark bias aligns with the evolving logic of generation, preserving answer correctness while facilitating high-fidelity, detectable watermarking (Liu et al., 8 Jan 2026).
1. Segmentation of Reasoning and Answer Generation
ReasonMark operationalizes two strictly distinct stages for watermarking in reasoning-intensive LLMs:
- Thinking Phase : The model computes internal token probabilities , progressing the reasoning trajectory with no external emission.
- Answering Phase : The model generates the final output, modulating each token's watermark bias according to its semantic alignment with the evolving PSV.
The PSV, denoted , is initialized by distilling information from :
This function captures the “semantic essence” of the internal chain-of-thought.
2. Criticality Score: Identifying Semantically Pivotal Tokens
To extract tokens that anchor the reasoning trajectory, ReasonMark computes a Criticality Score (CS) for each vocabulary word based on two components:
- Global Causal Contribution (GCC): Quantifies a token’s probabilistic influence across reasoning steps, weighted by Jensen–Shannon divergence and normalized cosine similarity of distributions.
- : Probability of at step .
- : Measures distributional shift.
- : Propagates influence through similarity.
- Competitive Persistence Scoring (CPS): Measures how persistently a token remains salient and competitive.
- : Inverse surprisal of generated token.
- : Logit margin against the strongest competitor.
- : Persistence in the candidate set.
The aggregate score,
selects the top- “Critical Tokens” , encapsulating the most semantically influential elements of the chain-of-thought.
3. Construction and Updating of the Principal Semantic Vector
Upon identifying , ReasonMark forms an embedding matrix :
where maps tokens to their embedding space of dimension . Principal Component Analysis is applied:
yielding the dominant semantic direction. As answer generation proceeds, the PSV evolves:
with modulated by semantic alignment, ensuring that remains anchored to the reasoning trajectory while adapting to the generated content.
4. PSV-Guided Watermark Embedding
During the Answering Phase, watermark bias for each candidate token is determined by its cosine similarity with the current PSV:
The logit of each candidate in the “green list” is augmented:
where is the base strength and controls semantic sensitivity.
Tokens with high alignment receive stronger watermark bias, whereas those less aligned are perturbed minimally—preserving logical integrity and answer quality.
5. Step-by-Step Algorithmic Workflow
The complete mechanism for PSV extraction and guided watermarking can be summarized as:
| Phase | Step | Key Operations |
|---|---|---|
| Thinking | Compute , | Select top- critical tokens |
| Build ; PCA to get | Semantic distillation of chain-of-thought | |
| Answering | Initialize | Partition vocabulary; compute , update logits |
| Sample ; EMA update of | Dynamic semantic compass during generation |
This procedure enables a tightly synchronized watermark signal that is semantically adaptive and computationally efficient.
6. Empirical Performance and Robustness
Extensive benchmarking demonstrates the PSV’s centrality. ReasonMark achieves lowest perplexity (10.31, C4) and highest detection AUC (99.31%) compared to KGW and others. In machine translation (WMT16 DE–EN), a top BLEU score (9.916) and AUC (87.25%) are reported. In mathematical tasks (AIME, GSM8K), accuracy is on par with or better than unwatermarked baselines, while watermark detection remains at AUC .
Ablation studies dissect PSV contributions:
- Replacing CTs (Critical Tokens) with random tokens raises C4 perplexity from 10.31 to 12.88.
- Removing GCC increases perplexity to 11.15.
- Removing CPS reduces AUC from 99.31% to 98.69%.
Attack robustness tests indicate ReasonMark, via PSV guidance, retains high detection under word-level and semantic paraphrasing ( AUC) and under translation attacks ( AUC), outperforming prior methods.
7. Context and Implications
The development of the PSV within ReasonMark addresses central challenges in watermarking reasoning LLMs: avoiding logical corruption, controlling computational latency, and ensuring adversarial robustness. The compact, dynamic nature of the PSV allows for semantic harmony between the reasoning and generation steps, directly modulating watermark strength where the reasoning trajectory is most salient.
A plausible implication is that dynamic, semantically grounded watermark mechanisms such as PSV enable traceable, trustworthy deployment of LLMs in sensitive, real-world environments, promoting accountability without sacrificing solution quality or efficiency (Liu et al., 8 Jan 2026).