Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks (1710.07363v1)

Published 19 Oct 2017 in cs.CV

Abstract: In this paper, we present a novel approach to perform deep neural networks layer-wise weight initialization using Linear Discriminant Analysis (LDA). Typically, the weights of a deep neural network are initialized with: random values, greedy layer-wise pre-training (usually as Deep Belief Network or as auto-encoder) or by re-using the layers from another network (transfer learning). Hence, many training epochs are needed before meaningful weights are learned, or a rather similar dataset is required for seeding a fine-tuning of transfer learning. In this paper, we describe how to turn an LDA into either a neural layer or a classification layer. We analyze the initialization technique on historical documents. First, we show that an LDA-based initialization is quick and leads to a very stable initialization. Furthermore, for the task of layout analysis at pixel level, we investigate the effectiveness of LDA-based initialization and show that it outperforms state-of-the-art random weight initialization methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Michele Alberti (20 papers)
  2. Mathias Seuret (23 papers)
  3. Vinaychandran Pondenkandath (13 papers)
  4. Rolf Ingold (21 papers)
  5. Marcus Liwicki (86 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.