Ambiguity Diagnosis for Terms in Digital Humanities - LINA - Equipe Traitement Automatique du Langage Naturel Access content directly
Conference Papers Year : 2016

Ambiguity Diagnosis for Terms in Digital Humanities

Abstract

Among all researches dedicating to terminology and word sense disambiguation, little attention has been devoted to the ambiguity of term occurrences. If a lexical unit is indeed a term of the domain, it is not true, even in a specialised corpus, that all its occurrences are terminological. Some occurrences are terminological and other are not. Thus, a global decision at the corpus level about the terminological status of all occurrences of a lexical unit would then be erroneous. In this paper, we propose three original methods to characterise the ambiguity of term occurrences in the domain of social sciences for French. These methods differently model the context of the term occurrences: one is relying on text mining, the second is based on textometry, and the last one focuses on text genre properties. The experimental results show the potential of the proposed approaches and give an opportunity to discuss about their hybridisation.
Fichier principal
Vignette du fichier
desamb.pdf (229.3 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-01423650 , version 1 (30-12-2016)

Identifiers

  • HAL Id : hal-01423650 , version 1

Cite

Béatrice Daille, Evelyne Jacquey, Gaël Lejeune, Luis Felipe Melo, Yannick Toussaint. Ambiguity Diagnosis for Terms in Digital Humanities. Language Resources and Evaluation Conference, May 2016, Portorož, Slovenia. ⟨hal-01423650⟩
631 View
273 Download

Share

Gmail Facebook X LinkedIn More