Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Document Layout Analysis via Dynamic Residual Feature Fusion (2104.02874v1)

Published 7 Apr 2021 in cs.CV

Abstract: The document layout analysis (DLA) aims to split the document image into different interest regions and understand the role of each region, which has wide application such as optical character recognition (OCR) systems and document retrieval. However, it is a challenge to build a DLA system because the training data is very limited and lacks an efficient model. In this paper, we propose an end-to-end united network named Dynamic Residual Fusion Network (DRFN) for the DLA task. Specifically, we design a dynamic residual feature fusion module which can fully utilize low-dimensional information and maintain high-dimensional category information. Besides, to deal with the model overfitting problem that is caused by lacking enough data, we propose the dynamic select mechanism for efficient fine-tuning in limited train data. We experiment with two challenging datasets and demonstrate the effectiveness of the proposed module.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Xingjiao Wu (26 papers)
  2. Ziling Hu (1 paper)
  3. Xiangcheng Du (11 papers)
  4. Jing Yang (320 papers)
  5. Liang He (202 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.