Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DocSynthv2: A Practical Autoregressive Modeling for Document Generation (2406.08354v1)

Published 12 Jun 2024 in cs.CV, cs.AI, and cs.LG

Abstract: While the generation of document layouts has been extensively explored, comprehensive document generation encompassing both layout and content presents a more complex challenge. This paper delves into this advanced domain, proposing a novel approach called DocSynthv2 through the development of a simple yet effective autoregressive structured model. Our model, distinct in its integration of both layout and textual cues, marks a step beyond existing layout-generation approaches. By focusing on the relationship between the structural elements and the textual content within documents, we aim to generate cohesive and contextually relevant documents without any reliance on visual components. Through experimental studies on our curated benchmark for the new task, we demonstrate the ability of our model combining layout and textual information in enhancing the generation quality and relevance of documents, opening new pathways for research in document creation and automated design. Our findings emphasize the effectiveness of autoregressive models in handling complex document generation tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Sanket Biswas (31 papers)
  2. Rajiv Jain (20 papers)
  3. Vlad I. Morariu (31 papers)
  4. Jiuxiang Gu (73 papers)
  5. Puneet Mathur (22 papers)
  6. Curtis Wigington (13 papers)
  7. Tong Sun (49 papers)
  8. Josep Lladós (40 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com