Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DIVA-DAF: A Deep Learning Framework for Historical Document Image Analysis (2201.08295v3)

Published 20 Jan 2022 in cs.CV

Abstract: Deep learning methods have shown strong performance in solving tasks for historical document image analysis. However, despite current libraries and frameworks, programming an experiment or a set of experiments and executing them can be time-consuming. This is why we propose an open-source deep learning framework, DIVA-DAF, which is based on PyTorch Lightning and specifically designed for historical document analysis. Pre-implemented tasks such as segmentation and classification can be easily used or customized. It is also easy to create one's own tasks with the benefit of powerful modules for loading data, even large data sets, and different forms of ground truth. The applications conducted have demonstrated time savings for the programming of a document analysis task, as well as for different scenarios such as pre-training or changing the architecture. Thanks to its data module, the framework also allows to reduce the time of model training significantly.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Lars Vögtlin (9 papers)
  2. Anna Scius-Bertrand (4 papers)
  3. Paul Maergner (3 papers)
  4. Andreas Fischer (54 papers)
  5. Rolf Ingold (21 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.