Selecting an optimal clustering solution is a longstanding problem. In model-based clustering this amounts to choose the architecture of the model mixture distribution. Decisions to be made pertain to: cluster prototype distribution; number of mixture components; (optionally) restrictions on the clusters’ geometry. Classical pro- posals address this issue via penalized model selection criteria based on the observed likelihood function. In this study, we compare these techniques with the less explored cross-validation alternative, which is rather popular for many other data-driven opti- mized methods. We analyze both classical methods such as BIC, AIC, AIC3 and ICL, and several cross-validation schemes where the risk is defined in terms of minus the log-likelihood function. Selection methods are compared by using the Iris dataset.

Likelihood-type methods for comparing clustering solutions / Coraggio, Luca; Coretto, Pietro. - (2019), pp. 120-123. (Intervento presentato al convegno CLADAG 2019 tenutosi a Cassino (FR) nel 11-13 settembre 2019).

Likelihood-type methods for comparing clustering solutions

Coraggio, Luca;
2019

Abstract

Selecting an optimal clustering solution is a longstanding problem. In model-based clustering this amounts to choose the architecture of the model mixture distribution. Decisions to be made pertain to: cluster prototype distribution; number of mixture components; (optionally) restrictions on the clusters’ geometry. Classical pro- posals address this issue via penalized model selection criteria based on the observed likelihood function. In this study, we compare these techniques with the less explored cross-validation alternative, which is rather popular for many other data-driven opti- mized methods. We analyze both classical methods such as BIC, AIC, AIC3 and ICL, and several cross-validation schemes where the risk is defined in terms of minus the log-likelihood function. Selection methods are compared by using the Iris dataset.
2019
9788883171086
Likelihood-type methods for comparing clustering solutions / Coraggio, Luca; Coretto, Pietro. - (2019), pp. 120-123. (Intervento presentato al convegno CLADAG 2019 tenutosi a Cassino (FR) nel 11-13 settembre 2019).
File in questo prodotto:
File Dimensione Formato  
Coraggio2019 - Likelihood Type Methods for Comparing Clustering Solutions.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 15.03 MB
Formato Adobe PDF
15.03 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/909464
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact