LIA/LINA at the INEX 2012 Tweet Contextualization track - LINA - Equipe Traitement Automatique du Langage Naturel Access content directly
Conference Papers Year : 2012

LIA/LINA at the INEX 2012 Tweet Contextualization track

Romain Deveaud
  • Function : Author
  • PersonId : 933031
Florian Boudin

Abstract

In this paper we describe our participation in the INEX 2012 Tweet Contextualization track and present our contributions. We combined Information Retrieval, Automatic Summarization and Topic Modeling techniques to provide the context of each tweet. We first formulate a specific query using hashtags and important words in the Tweets to retrieve the most relevant Wikipedia articles. Then, we segment the articles into sentences and compute several measures for each sentence, in order to estimate their contextual relevance to the topics expressed by the Tweets. Finally, the best scored sentences are used to form the context. Official results suggest that our methods performed very well compared to other participants.
Fichier principal
Vignette du fichier
CLEF2012wn-INEX-DeveaudEt2012.pdf (190.35 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00755496 , version 1 (21-11-2012)

Identifiers

  • HAL Id : hal-00755496 , version 1

Cite

Romain Deveaud, Florian Boudin. LIA/LINA at the INEX 2012 Tweet Contextualization track. INitiative for the Evaluation of XML Retrieval (INEX), Sep 2012, Rome, Italy. pp.n/a. ⟨hal-00755496⟩
227 View
126 Download

Share

Gmail Facebook X LinkedIn More