In cluster analysis one often finds several partitions of a data set using different clustering methods and algorithms set with a variety of hyperparameters and tunings. The number of clusters K is one of the most relevant of such hyperparameters. Cluster selection is the task of choosing the desired partitions. The Bootstrap Quadratic Scoring is a recently introduced method where the cluster selection is performed by optimizing a score attached to a partition that is based on the quadratic discriminant function. Previously, we proposed the estimation of this cluster score via bootstrap resampling and investigated the proposed estimator based on numerical experiments and real data applications. However, that earlier work did not provide theoretical guarantees. In this paper, we fill that gap. We study the asymptotic behavior of the scoring method and show that the proposed estimator converges to well-defined population counterparts.

Asymptotic Results for the Estimation of the Quadratic Score of a Clustering / Coraggio, Luca; Coretto, Pietro. - In: MATHEMATICS. - ISSN 2227-7390. - 12:21(2024). [10.3390/math12213417]

Asymptotic Results for the Estimation of the Quadratic Score of a Clustering

Coraggio, Luca
;
Coretto, Pietro
2024

Abstract

In cluster analysis one often finds several partitions of a data set using different clustering methods and algorithms set with a variety of hyperparameters and tunings. The number of clusters K is one of the most relevant of such hyperparameters. Cluster selection is the task of choosing the desired partitions. The Bootstrap Quadratic Scoring is a recently introduced method where the cluster selection is performed by optimizing a score attached to a partition that is based on the quadratic discriminant function. Previously, we proposed the estimation of this cluster score via bootstrap resampling and investigated the proposed estimator based on numerical experiments and real data applications. However, that earlier work did not provide theoretical guarantees. In this paper, we fill that gap. We study the asymptotic behavior of the scoring method and show that the proposed estimator converges to well-defined population counterparts.
2024
Asymptotic Results for the Estimation of the Quadratic Score of a Clustering / Coraggio, Luca; Coretto, Pietro. - In: MATHEMATICS. - ISSN 2227-7390. - 12:21(2024). [10.3390/math12213417]
File in questo prodotto:
File Dimensione Formato  
CoraggioCoretto - 2024 - M - Asymptotic Results for the Estimation of the Quadratic Score of a Clustering.pdf

accesso aperto

Descrizione: PDF articolo
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 345.96 kB
Formato Adobe PDF
345.96 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/987250
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact