Vanishing of higher magnitude homology for the LLM-induced generalized metric space of texts
Prove that for the generalized metric space M whose objects are all strings over a finite token alphabet that begin with the beginning-of-sentence token and have length at most N (optionally ending with the end-of-sentence token), and whose distance function is d(x,y) = -ln π(y|x) where π(y|x) equals 1 if y = x, equals the product of next-token probabilities along the unique right-extension path from x to y when y extends x, and equals 0 otherwise, the magnitude homology groups H_{k,ℓ}(M) vanish for all homological degrees k ≥ 2 and all lengths ℓ ≥ 0.
References
We conjecture that higher homology groups vanish.
— The Magnitude of Categories of Texts Enriched by Language Models
(2501.06662 - Bradley et al., 11 Jan 2025) in Subsection “Magnitude homology” (Section 3)