The Bullinger dataset : a writer adaptation challenge

Scius-Bertrand, Anna; Ströbel, Phillip; Volk, Martin; Hodel, Tobias; Fischer, Andreas

doi:10.1007/978-3-031-41676-7_23

Scius-Bertrand, Anna; Ströbel, Phillip; Volk, Martin; Hodel, Tobias; Fischer, Andreas

2023

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

One of the main challenges of automatically transcribing large collections of handwritten letters is to cope with the high variability of writing styles present in the collection. In particular, the writing styles of non-frequent writers, who have contributed only few letters, are often missing in the annotated learning samples used for training handwriting recognition systems. In this paper, we introduce the Bullinger dataset for writer adaptation, which is based on the Heinrich Bullinger letter collection from the 16th century, using a subset of 3,622 annotated letters (about 1.2 million words) from 306 writers. We provide baseline results for handwriting recognition with modern recognizers, before and after the application of standard techniques for supervised adaptation of frequent writers and self-supervised adaptation of non-frequent writers.

Détails

Titre The Bullinger dataset : a writer adaptation challenge

Auteur(s)/ trice(s) Scius-Bertrand, Anna (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland ; University of Fribourg, Fribourg, Switzerland)
Ströbel, Phillip (University of Zurich, Zurich, Switzerland)
Volk, Martin (University of Zurich, Zurich, Switzerland)
Hodel, Tobias (University of Bern, Bern, Switzerland)
Fischer, Andreas (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland ; University of Fribourg, Fribourg, Switzerland)

Date 2023-08

Publié dans Document analysis and recognition ICDAR 2023 ; Proceedings of the 17th International Conference, 21-26 August 2023, San José, CA, USA

Volume 1

Pages / Numéro d'article 397-410

Pagination 14 p.

Présenté à Document Analysis and Recognition - ICDAR 2023, San José, CA, USA, 2023-08-21, 2023-08-26

ISBN 978-3-031-41675-0

DOI https://doi.org/10.1007/978-3-031-41676-7_23

ISSN 0302-9743

Collection et n° Lecture Notes in Computer Science (LNCS), vol. 14187

Mots-clés (libres) handwriting recognition ; writer adaptation ; historical documents ; handwritten letters

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions

PDF