DIVA-DAF : a deep learning framework for historical document image analysis

Vötglin, Lars; Scius-Bertrand, Anna; Maergner, Paul; Fischer, Andreas; Ingold, Rolf

doi:10.1145/3604951.3605511

Vötglin, Lars; Scius-Bertrand, Anna; Maergner, Paul; Fischer, Andreas; Ingold, Rolf

2023

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Deep learning methods have shown strong performance in solving tasks for historical document image analysis. However, despite current libraries and frameworks, programming an experiment or a set of experiments and executing them can be time-consuming. This is why we propose an open-source deep learning framework, DIVA-DAF, which is based on PyTorch Lightning and specifically designed for historical document analysis. Pre-implemented tasks such as segmentation and classification can be easily used or customized. It is also easy to create one’s own tasks with the benefit of powerful modules for loading data, even large data sets, and different forms of ground truth. The applications conducted have demonstrated time savings for the programming of a document analysis task, as well as for different scenarios such as pre-training or changing the architecture. Thanks to its data module, the framework also allows to reduce the time of model training significantly.

Détails

Titre DIVA-DAF : a deep learning framework for historical document image analysis

Auteur(s)/ trice(s) Vötglin, Lars (University of Fribourg, Fribourg, Switzerland)
Scius-Bertrand, Anna (University of Fribourg, Fribourg, Switzerland)
Maergner, Paul (University of Fribourg, Fribourg, Switzerland)
Fischer, Andreas (University of Fribourg, Fribourg, Switzerland)
Ingold, Rolf (University of Fribourg, Fribourg, Switzerland)

Date 2023-08

Publié dans Proceedings of the 7th International Workshop on Historical Document Imaging and Processing (HIP'23), 25-26 August 2023, San José, CA, USA

Pages / Numéro d'article 61-66

Pagination 6 p.

Présenté à 7th International Workshop on Historical Document Imaging and Processing, San José, CA, USA, 2023-08-25, 2023-08-26

ISBN 9798400708411

DOI https://doi.org/10.1145/3604951.3605511

Mots-clés (libres) deep learning framework ; document image analysis ; historical documents ; deep neural networks

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Note SCIUS-BERTRAND, Anna est chercheuse à la HES-SO, HEIA-FR, depuis 2019. FISCHER, Andreas est chercheur à la HES-SO, HEIA-FR, depuis 2015.

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions

PDF