A dataset for Arabic text detection, tracking and recognition in news videos- AcTiV

Zayene, Oussama; Hennebert, Jean; Touj, Sameh Masmoudi; Ingold, Rolf; Ben Amara, Najoua Essoukri

doi:10.1109/ICDAR.2015.7333911

Zayene, Oussama; Hennebert, Jean; Touj, Sameh Masmoudi; Ingold, Rolf; Ben Amara, Najoua Essoukri

2015

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Recently, promising results have been reported on video text detection and recognition. Most of the proposed methods are tested on private datasets with non-uniform evaluation metrics. We report here on the development of a publicly accessible annotated video dataset designed to assess the performance of different artificial Arabic text detection, tracking and recognition systems. The dataset includes 80 videos (more than 850,000 frames) collected from 4 different Arabic news channels. An attempt was made to ensure maximum diversities of the textual content in terms of size, position and background. This data is accompanied by detailed annotations for each textbox. We also present a region-based text detection approach in addition to a set of evaluation protocols on which the performance of different systems can be measured.

Détails

Titre A dataset for Arabic text detection, tracking and recognition in news videos- AcTiV

Auteur(s)/ trice(s) Zayene, Oussama (University of Fribourg, Fribourg, Switzerland ; University of Sousse, Sousse, Tunisia)
Hennebert, Jean (University of Fribourg, Fribourg, Switzerland ; School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences Western Switzerland)
Touj, Sameh Masmoudi (University of Sousse, Sousse, Tunisia)
Ingold, Rolf (University of Fribourg, Fribourg, Switzerland)
Ben Amara, Najoua Essoukri (University of Sousse, Sousse, Tunisia)

Date 2015-08

Publié dans Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), 23-26 August 2015, Tunis, Tunisia

Volume 2015, pp. 996-1000

Editeur Zunis, Tunisia, 23-26 August 2015

Pagination 5 p.

Présenté à 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, Tunisia, 2015-08-23, 2015-08-26

ISBN 978-1-4799-1805-8

DOI https://doi.org/10.1109/ICDAR.2015.7333911

Mots-clés (libres) manganese ; high definition video ; random access memory ; ferroelectric films ; nonvolatile memory ; protocols ; video OCR ; video database ; benchmark ; arabic text

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIA-FR

Institut iCoSys - Institut des systèmes complexes

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions