Performance ceiling of purely end-to-end OCR architectures
Ascertain the ultimate performance ceiling of purely end-to-end optical character recognition architectures relative to heavily optimized two-stage pipeline OCR systems by determining whether end-to-end models can fully match or surpass pipeline approaches under variations in architecture, training strategies, and data scaling laws.
References
As a pioneering attempt at end-to-end OCR, the ultimate performance ceiling of purely end-to-end architectures remains an open question -- future work should systematically explore architectural innovations, training strategies, and data scaling laws to determine whether end-to-end models can fully match or surpass heavily optimized pipeline systems.
— Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
(2603.13398 - Dong et al., 11 Mar 2026) in Section 7: Limitations and Future Work — Performance Ceiling