Model selection with BIC and ICL criteria for binned data clustering by bin-EM-CEM algorithms

Abstract : Several clustering approaches are adapted to binned data in order to accelerate the clustering process or to deal with data of limited precision. Bin-EM-CEM algorithms of fourteen parsimonious Gaussian mixture models are developed. Each model performs differently according to its specific feature. Without knowing any information of the data, a criterion is considered to select the best model in order to obtain a good result. In this article, BIC and ICL criteria are adapted to binned data clustering to choose the bin-EM-CEM algorithm of the right model as well as the number of clusters. By different experiments on simulated data and real data, the performance of BIC and ICL criteria in model selection for binned data clustering are studied and compared on different aspects.
Complete list of metadatas

https://hal-supelec.archives-ouvertes.fr/hal-00952379
Contributor : Alexandra Siebert <>
Submitted on : Wednesday, February 26, 2014 - 4:28:22 PM
Last modification on : Thursday, March 29, 2018 - 11:06:05 AM

Identifiers

Collections

Citation

Hani Hamdan, Jingwen Wu. Model selection with BIC and ICL criteria for binned data clustering by bin-EM-CEM algorithms. 2013 IEEE International Conference on Systems, Man, and Cybernetics (SMC 2013), Oct 2013, Manchester, United Kingdom. pp.3133 - 3138, ⟨10.1109/SMC.2013.534⟩. ⟨hal-00952379⟩

Share

Metrics

Record views

165