Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Document Understanding Dataset and Evaluation (DUDE) (2305.08455v3)

Published 15 May 2023 in cs.CV, cs.CL, and cs.LG

Abstract: We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document layouts based on multi-industry, multi-domain, and multi-page VRDs of various origins, and dates. Moreover, we are pushing the boundaries of current methods by creating multi-task and multi-domain evaluation setups that more accurately simulate real-world situations where powerful generalization and adaptation under low-resource settings are desired. DUDE aims to set a new standard as a more practical, long-standing benchmark for the community, and we hope that it will lead to future extensions and contributions that address real-world challenges. Finally, our work illustrates the importance of finding more efficient ways to model language, images, and layout in DocAI.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (13)
  1. Jordy Van Landeghem (6 papers)
  2. Łukasz Borchmann (17 papers)
  3. Michał Pietruszka (9 papers)
  4. Paweł Józiak (7 papers)
  5. Rafał Powalski (5 papers)
  6. Dawid Jurkiewicz (7 papers)
  7. Mickaël Coustaty (15 papers)
  8. Bertrand Ackaert (1 paper)
  9. Ernest Valveny (28 papers)
  10. Matthew Blaschko (26 papers)
  11. Sien Moens (1 paper)
  12. Tomasz Stanisławek (7 papers)
  13. Rubén Tito (1 paper)
Citations (34)