Transcription alignment of historical vietnamese manuscripts without human-annotated learning samples

Scius-Bertrand, Ann; Jungo, Michael; Wolf, Beat; Fischer, Andreas; Bui, Marc

doi:10.3390/app11114894

Scius-Bertrand, Ann; Jungo, Michael; Wolf, Beat; Fischer, Andreas; Bui, Marc

2021

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

The current state of the art for automatic transcription of historical manuscripts is typically limited by the requirement of human-annotated learning samples, which are are necessary to train specific machine learning models for specific languages and scripts. Transcription alignment is a simpler task that aims to find a correspondence between text in the scanned image and its existing Unicode counterpart, a correspondence which can then be used as training data. The alignment task can be approached with heuristic methods dedicated to certain types of manuscripts, or with weakly trained systems reducing the required amount of annotations. In this article, we propose a novel learning-based alignment method based on fully convolutional object detection that does not require any human annotation at all. Instead, the object detection system is initially trained on synthetic printed pages using a font and then adapted to the real manuscripts by means of self-training. On a dataset of historical Vietnamese handwriting, we demonstrate the feasibility of annotation-free alignment as well as the positive impact of self-training on the character detection accuracy, reaching a detection accuracy of 96.4% with a YOLOv5m model without using any human annotation.

Détails

Titre Transcription alignment of historical vietnamese manuscripts without human-annotated learning samples

Auteur(s)/ trice(s) Scius-Bertrand, Ann (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland ; Ecole Pratique des Hautes Études (PSL), Paris, France)
Jungo, Michael (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland)
Wolf, Beat (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland)
Fischer, Andreas (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland ; University of Fribourg, Fribourg, Switzerland)
Bui, Marc (Ecole Pratique des Hautes Etudes (PSL), Paris, France)

Date 2021-05

Publié dans Applied Sciences

Volume 2021, vol. 11, no. 11, article no. 4894

Pagination 18 p.

DOI https://doi.org/10.3390/app11114894

ISSN 2076-3417

Mots-clés (libres) transcription alignment ; object detection ; self-training ; YOLO ; chu nom characters

Type d'article scientifique

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Le document apparaît dans Articles scientifiques
Global

Résumé

Détails

Actions

PDF