Conjecture: Clock algorithm chosen to improve accuracy
Investigate whether transformer-based large language models preferentially use the Clock algorithm to perform addition in order to improve accuracy relative to linear number representations, establishing that helix-based representations confer accuracy benefits analogous to human use of decimal digits.
References
While LLMs could do addition linearly, we conjecture that LLMs use the Clock algorithm to improve accuracy, analogous to humans using decimal digits (which are a generalized helix with $T = [10,100,\dots]$) for addition rather than slide rules.
— Language Models Use Trigonometry to Do Addition
(2502.00873 - Kantamneni et al., 2 Feb 2025) in Conclusion