The framework of this paper is supervised statistical learning in data mining. A typical data-mining problem is to deal with large sets of within-groups correlated inputs compared to the number of observed objects. In case of complex relationships standard tree-based procedures offer unstable and not ever interpretable solutions. For that multiple splits defined upon a suitable combination of inputs are required. This paper provides a solution to build up a tree-based method whose nodes splitting is due to factorial multiple splitting categorical variables. A recursive partitioning algorithm is introduced considering a two-stage splitting criterion based on object scores from Nonlinear canonical correlation analysis.
Optimal scaling trees / Tutore, VALERIO ANIELLO; Mooijaart, A.. - STAMPA. - (2007), pp. 359-362. (Intervento presentato al convegno Meeting of Classification and Data Analysis Group 2007 tenutosi a Macerata nel 12-14 settembre).
Optimal scaling trees
TUTORE, VALERIO ANIELLO;
2007
Abstract
The framework of this paper is supervised statistical learning in data mining. A typical data-mining problem is to deal with large sets of within-groups correlated inputs compared to the number of observed objects. In case of complex relationships standard tree-based procedures offer unstable and not ever interpretable solutions. For that multiple splits defined upon a suitable combination of inputs are required. This paper provides a solution to build up a tree-based method whose nodes splitting is due to factorial multiple splitting categorical variables. A recursive partitioning algorithm is introduced considering a two-stage splitting criterion based on object scores from Nonlinear canonical correlation analysis.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.