Open datasets and tools for arabic text detection and recognition in news video frames

Zayene, Oussama; Touj, Sameh Masmoudi; Hennebert, Jean; Ingold, Rolf; Essoukri Ben Amara, Najoua

doi:10.3390/jimaging4020032

Zayene, Oussama; Touj, Sameh Masmoudi; Hennebert, Jean; Ingold, Rolf; Essoukri Ben Amara, Najoua

2018

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Recognizing texts in video is more complex than in other environments such as scanned documents. Video texts appear in various colors, unknown fonts and sizes, often affected by compression artifacts and low quality. In contrast to Latin texts, there are no publicly available datasets which cover all aspects of the Arabic Video OCR domain. This paper describes a new well-defined and annotated Arabic-Text-in-Video dataset called AcTiV 2.0. The dataset is dedicated especially to building and evaluating Arabic video text detection and recognition systems. AcTiV 2.0 contains 189 video clips serving as a raw material for creating 4063 key frames for the detection task and 10,415 cropped text images for the recognition task. AcTiV 2.0 is also distributed with its annotation and evaluation tools that are made open-source for standardization and validation purposes. This paper also reports on the evaluation of several systems tested under the proposed detection and recognition protocols.

Détails

Titre Open datasets and tools for arabic text detection and recognition in news video frames

Auteur(s)/ trice(s) Zayene, Oussama (LATIS Lab, National engineering School of Sousse (Eniso), University of Sousse, Sousse, Tunisia ; DIVA group, Department of Inforamtics, University of Fribourg, Fribourg, Switzerland)
Touj, Sameh Masmoudi (LATIS Lab, National Engineeering School of Sousse (Eniso), Sousse, Tunisia)
Hennebert, Jean (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland)
Ingold, Rolf (DIVA group, Departmeent of Informatics, University of Fribourg, Fribourg, Switzerland)
Essoukri Ben Amara, Najoua (LATIS Lab, National Engineering School of Sousse (Eniso), Sousse, Tunisia)

Date 2018-01

Publié dans Journal of Imaging

Volume 2018, vol. 4(2), no. 32

Pagination 19 p.

DOI https://doi.org/10.3390/jimaging4020032

ISSN 2313-433X

Mots-clés (libres) video text detection ; video text recognition ; AcTiV dataset ; Arabic Video OCR

Type d'article scientifique

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Le document apparaît dans Articles scientifiques
Global

Résumé

Détails

Actions