Scaling behavior of tabular reasoning with increasing input complexity
Characterize how language model performance on tabular reasoning scales as input complexity increases, including the effects of larger tables and longer contexts on reasoning accuracy.
Sponsor
References
Moreover, they often overlook key factors such as table size, leaving open questions about how tabular reasoning scales with increasing input complexity.
— RADAR: Benchmarking Language Models on Imperfect Tabular Data
(2506.08249 - Gu et al., 9 Jun 2025) in Section 2: Background and Related Work