Triplet CNN-based word spotting of historical Arabic documents - IRT SystemX Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Triplet CNN-based word spotting of historical Arabic documents

Résumé

Word Spotting of Historical Arabic Documents is a challenging task due to the complexity of document layouts. This paper proposes a novel word spotting approach that consists of learning feature representation to describe word images. The objective is to investigate optimal embedding spaces to extract a discriminative word image representation. The proposed approach consists of two steps: i) construct a CNN-based embedding space with triplet-loss and then ii) match embedding representations using the Euclidean distance. For training, the CNN takes as input a set of triplet samples (anchor, positive sample and negative sample). Then, the triplet loss serves to create a novel space by minimizing intra-classes distances and maximizing inter-classes distances. The proposed approach is evaluated on the VML-HD dataset and the experiments show its effectiveness compared to the state of the art.
Fichier non déposé

Dates et versions

hal-02473637 , version 1 (10-02-2020)

Identifiants

  • HAL Id : hal-02473637 , version 1

Citer

Abir Fathallah, Mohamed Ibn Khedher, Mounim El Yacoubi, Najoua Essoukri Ben Amara. Triplet CNN-based word spotting of historical Arabic documents. ICONIP 2019: 26th International Conference on Neural Information Processing of the Asia-Pacific Neural Network Society, Dec 2019, Sydney, Australia. ⟨hal-02473637⟩
134 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More