Page segmentation of historical document images with convolutional autoencoders

Chen, Kai; Seuret, Mathias; Liwicki, Marcus; Hennebert, Jean; Ingold, Rolf

doi:10.1109/ICDAR.2015.7333914

Chen, Kai; Seuret, Mathias; Liwicki, Marcus; Hennebert, Jean; Ingold, Rolf

2015

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

In this paper, we present an unsupervised feature learning method for page segmentation of historical handwritten documents available as color images. We consider page segmentation as a pixel labeling problem, i.e., each pixel is classified as either periphery, background, text block, or decoration. Traditional methods in this area rely on carefully hand-crafted features or large amounts of prior knowledge. In contrast, we apply convolutional autoencoders to learn features directly from pixel intensity values. Then, using these features to train an SVM, we achieve high quality segmentation without any assumption of specific topologies and shapes. Experiments on three public datasets demonstrate the effectiveness and superiority of the proposed approach.

Détails

Titre Page segmentation of historical document images with convolutional autoencoders

Auteur(s)/ trice(s) Chen, Kai (University of Fribourg, Fribourg, Switzerland)
Seuret, Mathias (University of Fribourg, Fribourg, Switzerland)
Liwicki, Marcus (DFKI - German Research Center for Artificial Intelligence)
Hennebert, Jean (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland)
Ingold, Rolf (University of Fribourg, Fribourg, Switzerland)

Date 2015-08

Publié dans Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), 23-26 August 2015, Tunis, Tunisia

Volume 2015, pp. 1011-1015

Publié par Tunis, Tunisia, 23-26 August 2015

Pagination 5 p.

Présenté à 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, Tunisia, 2015-08-23, 2015-08-26

ISBN 978-1-4799-1805-8

DOI https://doi.org/10.1109/ICDAR.2015.7333914

Mots-clés (libres) support vector machines ; robustness ; image segmentation

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions