Persistence of Visual Understanding Weakness in Foundation Models
Ascertain whether contemporary foundation models will continue to exhibit relative weakness in visual understanding and spatial reasoning compared to textual capabilities, and determine if this disparity persists as model development progresses.
Sponsor
References
It is unclear if the trend of relative weakness in visual understanding will continue.
— HCAST: Human-Calibrated Autonomy Software Tasks
(2503.17354 - Rein et al., 21 Mar 2025) in Appendix, Section Related Work