Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Document Layout Analysis with Aesthetic-Guided Image Augmentation (2111.13809v1)

Published 27 Nov 2021 in cs.CV

Abstract: Document layout analysis (DLA) plays an important role in information extraction and document understanding. At present, document layout analysis has reached a milestone achievement, however, document layout analysis of non-Manhattan is still a challenge. In this paper, we propose an image layer modeling method to tackle this challenge. To measure the proposed image layer modeling method, we propose a manually-labeled non-Manhattan layout fine-grained segmentation dataset named FPD. As far as we know, FPD is the first manually-labeled non-Manhattan layout fine-grained segmentation dataset. To effectively extract fine-grained features of documents, we propose an edge embedding network named L-E3Net. Experimental results prove that our proposed image layer modeling method can better deal with the fine-grained segmented document of the non-Manhattan layout.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Tianlong Ma (20 papers)
  2. Xingjiao Wu (26 papers)
  3. Xin Li (980 papers)
  4. Xiangcheng Du (11 papers)
  5. Zhao Zhou (9 papers)
  6. Liang Xue (13 papers)
  7. Cheng Jin (76 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.