Forests of latent tree models for the detection of genetic associations - LINA-DUKE Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Forests of latent tree models for the detection of genetic associations

Résumé

Together with the population aging concern, increasing health care costs require understanding the causal basis for common genetic diseases. The high dimensionality and complexity of genetic data hamper the detection of genetic associations. To alleviate the core risks (missing of the causal factor, spurious discoveries), machine learning offers an appealing alternative framework to standard statistical approaches. A novel class of probabilistic graphical models has recently been proposed - the forest of latent tree models - , to obtain a trade-off between faithful modeling of data dependences and tractability. In this paper, we evaluate the soundness of this modeling approach in an association genetics context. We have performed intensive tests, in various controlled conditions, on realistic simulated data. We have also tested the model on real data. Beside guaranteeing data dimension reduction through latent variables, the model is empirically proven able to capture indirect genetic associations with the disease, both on simulated and real data. Strong associations are evidenced between the disease and the ancestor nodes of the causal genetic marker node, in the forest. In contrast, very weak associations are obtained for other nodes.

Dates et versions

hal-00637500 , version 1 (02-11-2011)

Identifiants

Citer

Christine Sinoquet, Raphaël Mourad, Philippe Leray. Forests of latent tree models for the detection of genetic associations. International Conference on Bioinformatics Models, Methods and Algorithms (BIOINFORMATICS 2012), Feb 2012, Vilamoura, Portugal. pp.1-10, ⟨10.5220/0003703400050014⟩. ⟨hal-00637500⟩
97 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More