Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Cascaded Detail-Preserving Networks for Super-Resolution of Document Images (1911.10714v1)

Published 25 Nov 2019 in cs.CV

Abstract: The accuracy of OCR is usually affected by the quality of the input document image and different kinds of marred document images hamper the OCR results. Among these scenarios, the low-resolution image is a common and challenging case. In this paper, we propose the cascaded networks for document image super-resolution. Our model is composed by the Detail-Preserving Networks with small magnification. The loss function with perceptual terms is designed to simultaneously preserve the original patterns and enhance the edge of the characters. These networks are trained with the same architecture and different parameters and then assembled into a pipeline model with a larger magnification. The low-resolution images can upscale gradually by passing through each Detail-Preserving Network until the final high-resolution images. Through extensive experiments on two scanning document image datasets, we demonstrate that the proposed approach outperforms recent state-of-the-art image super-resolution methods, and combining it with standard OCR system lead to signification improvements on the recognition results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Zhichao Fu (2 papers)
  2. Yu Kong (37 papers)
  3. Yingbin Zheng (18 papers)
  4. Hao Ye (50 papers)
  5. Wenxin Hu (10 papers)
  6. Jing Yang (320 papers)
  7. Liang He (202 papers)
Citations (8)