Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Graph-based Deep Generative Modelling for Document Layout Generation (2107.04357v1)

Published 9 Jul 2021 in cs.CV and cs.LG

Abstract: One of the major prerequisites for any deep learning approach is the availability of large-scale training data. When dealing with scanned document images in real world scenarios, the principal information of its content is stored in the layout itself. In this work, we have proposed an automated deep generative model using Graph Neural Networks (GNNs) to generate synthetic data with highly variable and plausible document layouts that can be used to train document interpretation systems, in this case, specially in digital mailroom applications. It is also the first graph-based approach for document layout generation task experimented on administrative document images, in this case, invoices.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Sanket Biswas (31 papers)
  2. Pau Riba (13 papers)
  3. Umapada Pal (80 papers)
  4. Josep Lladós (40 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.