Fair crosslingual computation of fertility
Develop a fair, crosslinguistically valid procedure for computing fertility—defined as the average number of tokens per word—that can be consistently applied across languages with differing notions of wordhood and orthographic conventions.
References
We thus opt not to use fertility, as it is unclear how to fairly calculate this across different languages.
— Explaining and Mitigating Crosslingual Tokenizer Inequities
(2510.21909 - Arnett et al., 24 Oct 2025) in Section 2.2 (Related Work — Measuring Compression)