Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Workshop on Document Intelligence Understanding (2307.16369v1)

Published 31 Jul 2023 in cs.IR and cs.CV

Abstract: Document understanding and information extraction include different tasks to understand a document and extract valuable information automatically. Recently, there has been a rising demand for developing document understanding among different domains, including business, law, and medicine, to boost the efficiency of work that is associated with a large number of documents. This workshop aims to bring together researchers and industry developers in the field of document intelligence and understanding diverse document types to boost automatic document processing and understanding techniques. We also released a data challenge on the recently introduced document-level VQA dataset, PDFVQA. The PDFVQA challenge examines the structural and contextual understandings of proposed models on the natural full document level of multiple consecutive document pages by including questions with a sequence of answers extracted from multi-pages of the full document. This task helps to boost the document understanding step from the single-page level to the full document level understanding.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Soyeon Caren Han (48 papers)
  2. Yihao Ding (16 papers)
  3. Siwen Luo (14 papers)
  4. Josiah Poon (41 papers)
  5. HeeGuen Yoon (1 paper)
  6. Zhe Huang (57 papers)
  7. Paul Duuring (2 papers)
  8. Eun Jung Holden (1 paper)