Motivated by the analysis of rating data concerning perceived health status, a crucial variable in biomedical, economic and life insurance models, the paper deals with diagnostic procedures for identifying anomalous and/or infuential observations in ordinal response models with challenging data structures. Deviations due to some respondents’ atypical behavior, outlying covariates and gross errors may afect the reliability of likelihood based inference, especially when non robust link functions are adopted. The present paper investigates and exploits the properties of the generalized residuals. They appear in the estimating equations of the regression coeffcients and hold the remarkable characteristic of interacting with the covariates in the same fashion as the linear regression residuals. Identifcation of statistical units incoherent with the model can be achieved by the analysis of the residuals produced by maximum likelihood or robust M-estimation, while the inspection of the weights generated by M-estimation allows to identify infuential data. Simple guidelines are proposed to this end, which disclose information on the data structure. The purpose is twofold: recognizing statistical units that deserve specifc attention for their peculiar features, and being aware of the sensitivity of the ftted model to small changes in the sample. In the analysis of the self-perceived health status, extreme design points associated with incoherent responses produce highly infuential observations. The diagnostic procedures identify the outliers and assess their infuence.
Generalized residuals and outlier detection for ordinal data with challenging data structures / Iannario, M.; Monti, A. C.. - In: STATISTICAL METHODS & APPLICATIONS. - ISSN 1618-2510. - 32:4(2023), pp. 1197-1216. [10.1007/s10260-023-00686-1]
Generalized residuals and outlier detection for ordinal data with challenging data structures
Iannario M.
;Monti A. C.
2023
Abstract
Motivated by the analysis of rating data concerning perceived health status, a crucial variable in biomedical, economic and life insurance models, the paper deals with diagnostic procedures for identifying anomalous and/or infuential observations in ordinal response models with challenging data structures. Deviations due to some respondents’ atypical behavior, outlying covariates and gross errors may afect the reliability of likelihood based inference, especially when non robust link functions are adopted. The present paper investigates and exploits the properties of the generalized residuals. They appear in the estimating equations of the regression coeffcients and hold the remarkable characteristic of interacting with the covariates in the same fashion as the linear regression residuals. Identifcation of statistical units incoherent with the model can be achieved by the analysis of the residuals produced by maximum likelihood or robust M-estimation, while the inspection of the weights generated by M-estimation allows to identify infuential data. Simple guidelines are proposed to this end, which disclose information on the data structure. The purpose is twofold: recognizing statistical units that deserve specifc attention for their peculiar features, and being aware of the sensitivity of the ftted model to small changes in the sample. In the analysis of the self-perceived health status, extreme design points associated with incoherent responses produce highly infuential observations. The diagnostic procedures identify the outliers and assess their infuence.File | Dimensione | Formato | |
---|---|---|---|
iannario monti 2023.pdf
accesso aperto
Licenza:
Dominio pubblico
Dimensione
2.39 MB
Formato
Adobe PDF
|
2.39 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.