Transfer Learning for Handwriting Recognition on Historical Documents - LS2N - équipe IPI ( Image Perception Interaction ) Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Transfer Learning for Handwriting Recognition on Historical Documents

Résumé

In this work, we investigate handwriting recognition on new historical handwritten documents using transfer learning. Establishing a manual ground-truth of a new collection of handwritten documents is time consuming but needed to train and to test recognition systems. We want to implement a recognition system without performing this annotation step. Our research deals with transfer learning from heterogeneous datasets with a ground-truth and sharing common properties with a new dataset that has no ground-truth. The main difficulties of transfer learning lie in changes in the writing style, the vocabulary, and the named entities over centuries and datasets. In our experiment, we show how a CNN-BLSTM-CTC neural network behaves, for the task of transcribing handwritten titles of plays of the Italian Comedy, when trained on combinations of various datasets such as RIMES, Georges Washington, and Los Esposalles. We show that the choice of the training datasets and the merging methods are determinant to the results of the transfer learning task.
Fichier principal
Vignette du fichier
ICPRAM_2018_48_CR.pdf (737.59 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01681126 , version 1 (16-12-2020)

Identifiants

  • HAL Id : hal-01681126 , version 1

Citer

Adeline Granet, Emmanuel Morin, Harold Mouchère, Solen Quiniou, Christian Viard-Gaudin. Transfer Learning for Handwriting Recognition on Historical Documents. 7th International Conference on Pattern Recognition Applications and Methods (ICPRAM), Jan 2018, Madeira, Portugal. ⟨hal-01681126⟩
896 Consultations
224 Téléchargements

Partager

Gmail Facebook X LinkedIn More