Page segmentation for historical handwritten document images using conditional random fields

Chen, Kai; Seuret, Mathias; Liwicki, Marcus; Hennebert, Jean; Liu, Cheng-Lin; Ingold, Rolf

doi:10.1109/ICFHR.2016.0029

Chen, Kai; Seuret, Mathias; Liwicki, Marcus; Hennebert, Jean; Liu, Cheng-Lin; Ingold, Rolf

2016

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

In this paper, we present a Conditional Random Field (CRF) model to deal with the problem of segmenting handwritten historical document images into different regions. We consider page segmentation as a pixel-labeling problem, i.e., each pixel is assigned to one of a set of labels. Features are learned from pixel intensity values with stacked convolutional autoencoders in an unsupervised manner. The features are used for the purpose of initial classification with a multilayer perceptron. Then a CRF model is introduced for modeling the local and contextual information jointly in order to improve the segmentation. For the purpose of decreasing the time complexity, we perform labeling at superpixel level. In the CRF model, graph nodes are represented by superpixels. The label of each pixel is determined by the label of the superpixel to which it belongs. Experiments on three public datasets demonstrate that, compared to previous methods, the proposed method achieves more accurate segmentation results and is much faster.

Détails

Titre Page segmentation for historical handwritten document images using conditional random fields

Auteur(s)/ trice(s) Chen, Kai (University of Fribourg, Fribourg, Switzerland)
Seuret, Mathias (University of Fribourg, Fribourg, Switzerland)
Liwicki, Marcus (University of Fribourg, Fribourg, Switzerland)
Hennebert, Jean (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland)
Liu, Cheng-Lin (NLPR, Institute of Automation of Chinese Academy of Sciences, China)
Ingold, Rolf (University of Fribourg, Fribourg, Switzerland)

Date 2016-10

Publié dans Proceedings of the 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), 23-26 October 2016, Shenzhen, China

Volume pp. 90-95

Editeur Shenzhen, China, 23-26 October 2016

Pagination 6 p.

Présenté à 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), Shenzhen, China, 2016-10-23, 2016-10-26

ISBN 978-1-5090-0981-7

DOI https://doi.org/10.1109/ICFHR.2016.0029

ISSN 2167-6445

Mots-clés (libres) image segmentation ; feature extraction ; labeling ; training ; neurons ; handwriting recognition ; electronic mail ; conditional random field ; page segmentation ; historical document image ; autoencoder ; superpixel

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions