Construction of a Multilingual Corpus Annotated with Translation Relations - Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Construction of a Multilingual Corpus Annotated with Translation Relations

Résumé

Translation relations, which distinguish literal translation from other translation techniques, constitute an important subject of study for human translators (Chuquet and Paillard, 1989). However , automatic processing techniques based on interlingual relations, such as machine translation or paraphrase generation exploiting translational equivalence, have not made use of these relations explicitly until now. In this work, we present a categorization of translation relations and then we annotate a parallel multilingual (English, French, Chinese) corpus of oral presentations, the TED Talks, with these relations. Our long-term objective will be to automatically detect these relations in order to integrate them as important characteristics for the search of monolingual segments in relation of equivalence (paraphrases) or of entailment. The annotated corpus resulting from our work will be made available to the community.
Fichier principal
Vignette du fichier
Coling_LR4NLP_Yuming.pdf (536.23 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01867896 , version 1 (04-09-2018)

Identifiants

  • HAL Id : hal-01867896 , version 1

Citer

Yuming Zhai, Aurélien Max, Anne Vilnat. Construction of a Multilingual Corpus Annotated with Translation Relations. First Workshop on Linguistic Resources for Natural Language Processing, Aug 2018, Santa Fe, United States. pp.102 - 111. ⟨hal-01867896⟩
79 Consultations
148 Téléchargements

Partager

Gmail Facebook X LinkedIn More