A decentralized and robust approach to estimating a probabilistic mixture model for structuring distributed data - LINA-DUKE Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

A decentralized and robust approach to estimating a probabilistic mixture model for structuring distributed data

Résumé

Data sharing services on the web host huge amounts of resources supplied and accessed by millions of users around the world. While the classical approach is a central control over the data set, even if this data set is distributed, there is growing interesting in decentralized solutions, because of good properties (in particularity, privacy and scaling up). In this paper, we explore a machine learning side of this work direction. We propose a novel technique for decentralized estimation of probabilistic mixture models, which are among the most versatile generative models for understanding data sets. More precisely, we demonstrate how to estimate a global mixture model from a set of local models. Our approach accommodates dynamic topology and data sources and is statistically robust, i.e. resilient to the presence of unreliable local models. Such outlier models may arise from local data which are outliers, compared to the global trend, or poor mixture estimation. We report experiments on synthetic data and real geo-location data from Flickr.
Fichier principal
Vignette du fichier
Ali-El-Attar-Web_Intelligence-WI2011.pdf (446.76 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-00595300 , version 1 (28-08-2012)

Identifiants

  • HAL Id : hal-00595300 , version 1

Citer

Ali El Attar, Antoine Pigeau, Marc Gelgon. A decentralized and robust approach to estimating a probabilistic mixture model for structuring distributed data. IEEE/ACM Int. conf on Web Intelligence, Aug 2011, Lyon, France. pp.372-379. ⟨hal-00595300⟩
163 Consultations
286 Téléchargements

Partager

Gmail Facebook X LinkedIn More