Understanding the Countably Infinite: Neural Network Models of the Successor Function and its Acquisition
Abstract: As children enter elementary school, their understanding of the ordinal structure of numbers transitions from a memorized count list of the first 50-100 numbers to knowing the successor function and understanding the countably infinite. We investigate this developmental change in two neural network models that learn the successor function on the pairs (N, N+1) for N in (0, 98). The first uses a one-hot encoding of the input and output values and corresponds to children memorizing a count list, while the second model uses a place-value encoding and corresponds to children learning the language rules for naming numbers. The place-value model showed a predicted drop in representational similarity across tens boundaries. Counting across a tens boundary can be understood as a vector operation in 2D space, where the numbers with the same tens place are organized in a linearly separable manner, whereas those with the same ones place are grouped together. A curriculum learning simulation shows that, in the expanding numerical environment of the developing child, representations of smaller numbers continue to be sharpened even as larger numbers begin to be learned. These models set the stage for future work using recurrent architectures to move beyond learning the successor function to simulating the counting process more generally, and point towards a deeper understanding of what it means to understand the countably infinite.
- Ontogenetic origins of human integer representations. Trends in cognitive sciences, 23(10): 823–835.
- To infinity and beyond: Children generalize the successor function to all possible numbers years after learning to count. Cognitive Psychology, 92: 22–36.
- Training Verifiers to Solve Math Word Problems.
- A neural network solves, explains, and generates university math problems by program synthesis and few-shot learning at human level. Proceedings of the National Academy of Sciences, 119(32).
- Can a recurrent neural network learn to count things?
- Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others. In Ranzato, M.; Beygelzimer, A.; Dauphin, Y.; Liang, P.; and Vaughan, J. W., eds., Advances in Neural Information Processing Systems, volume 34, 9963–9976. Curran Associates, Inc.
- Recurrent nets that time and count. In Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium, volume 3, 189–194 vol.3.
- Giaquinto, M. 2001. Knowing numbers. The Journal of Philosophy, 98(1): 5–18.
- Is thirty-two three tens and two ones? The embedded structure of cardinal numbers. Cognition, 203.
- Artificial neural network language models align neurally and behaviorally with humans even after a developmentally realistic amount of training. bioRxiv.
- Ismailov, V. E. 2014. On the approximation by neural networks with bounded number of neurons in hidden layers. Journal of Mathematical Analysis and Applications, 417(2): 963–969.
- Exact Equality and Successor Function: Two Key Concepts on the Path towards Understanding Exact Numbers. Philosophical Psychology, 21(4): 491–505. PMID: 20165569.
- Solving Quantitative Reasoning Problems with Language Models.
- Preschool Origins of Cross-National Differences in Mathematical Competence: The Role of Number-Naming Systems. Psychological Science, 6: 56 – 60.
- Bootstrapping in a language of thought: A formal model of numerical concept learning. Cognition, 123: 199–217.
- The language user as an arithmetician. Cognition, 59: 219–237.
- How counting represents number: What children must learn and when they learn it. Cognition, 108(3): 662–674.
- Do children use language structure to discover the recursive rules of counting? Cognitive Psychology, 117: 101263.
- Spelke, E. S. 2017. Core Knowledge, Language, and Number. Language Learning and Development, 13(2): 147–170.
- Emergence of a ’visual number sense’ in hierarchical generative models. Nature Neuroscience, 15: 194–196.
- T., P. S. 2023. The algorithmic origins of counting. Child development.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.