Character queries : a transformer-based approach to on-line handwritten character segmentation

Jungo, Michael; Wolf, Beat; Maksai, Andrii; Musat, Claudiu; Fischer, Andreas

doi:10.1007/978-3-031-41676-7_6

Character queries : a transformer-based approach to on-line handwritten character segmentation

Jungo, Michael; Wolf, Beat; Maksai, Andrii; Musat, Claudiu; Fischer, Andreas

2023

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

On-line handwritten character segmentation is often associated with handwriting recognition and even though recognition models include mechanisms to locate relevant positions during the recognition process, it is typically insufficient to produce a precise segmentation. Decoupling the segmentation from the recognition unlocks the potential to further utilize the result of the recognition. We specifically focus on the scenario where the transcription is known beforehand, in which case the character segmentation becomes an assignment problem between sampling points of the stylus trajectory and characters in the text. Inspired by the k-means clustering algorithm, we view it from the perspective of cluster assignment and present a Transformer-based architecture where each cluster is formed based on a learned character query in the Transformer decoder block. In order to assess the quality of our approach, we create character segmentation ground truths for two popular on-line handwriting datasets, IAM-OnDB and HANDS-VNOnDB, and evaluate multiple methods on them, demonstrating that our approach achieves the overall best results.

Détails

Titre Character queries : a transformer-based approach to on-line handwritten character segmentation

Auteur(s)/ trice(s) Jungo, Michael (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland ; University of Fribourg, Fribourg, Switzerland)
Wolf, Beat (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland)
Maksai, Andrii (Google Research, Zurich, Switzerland)
Musat, Claudiu (Google Research, Zurich, Switzerland)
Fischer, Andreas (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland ; University of Fribourg, Fribourg, Switzerland)

Date 2023-08

Publié dans Document analysis and recognition ICDAR 2023 ; Proceedings of the 17th International Conference, 21-26 August 2023, San José, CA, USA

Volume 1

Pages / Numéro d'article 98-114

Pagination 17 p.

Présenté à Document Analysis and Recognition - ICDAR 2023, San José, CA, USA, 2023-08-21, 2023-08-26

ISBN 978-3-031-41675-0

DOI https://doi.org/10.1007/978-3-031-41676-7_6

ISSN 0302-9743

Collection et n° Lecture Notes in Computer Science (LNCS), vol. 14187

Mots-clés (libres) on-line handwriting ; digital ink ; character segmentation ; transformer

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions

PDF