This paper constitutes Part I of the contribution to the analysis of web visit histories through a new methodological framework. Firstly, web usage and web structure mining are considered as an unique mining process to detect the latent structure of the web navigation across the web sections of a single portal. We extend association rules theory to web data defining new concepts of web (patterns) association and preference matrices, as well as of (indirect and direct) sequence rules. We identify the most significant rules, according to a multiple testing procedure. In the literature, web usage patterns can be visualized in no-distance-based graphs describing the navigation behavior across web pages with sequential arrows. In the following, we introduce a geometrical visualization of sequence rules at any click of the web navigation. In particular, we provide two distance-based visualization methods for the static analysis of all data tout court and the dynamic analysis to discover the most significant web paths click by click. A real world case study is considered throughout the methodological description.
Analysis of web visit histories, part I: Distance-based visualization of sequence rules / Siciliano, Roberta; D'Ambrosio, Antonio; Aria, Massimo; Amodio, S.. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - 33:(2016), pp. 298-324. [10.1007/s00357-016-9204-8]
Analysis of web visit histories, part I: Distance-based visualization of sequence rules
SICILIANO, ROBERTA;D'AMBROSIO, ANTONIO;ARIA, MASSIMO;
2016
Abstract
This paper constitutes Part I of the contribution to the analysis of web visit histories through a new methodological framework. Firstly, web usage and web structure mining are considered as an unique mining process to detect the latent structure of the web navigation across the web sections of a single portal. We extend association rules theory to web data defining new concepts of web (patterns) association and preference matrices, as well as of (indirect and direct) sequence rules. We identify the most significant rules, according to a multiple testing procedure. In the literature, web usage patterns can be visualized in no-distance-based graphs describing the navigation behavior across web pages with sequential arrows. In the following, we introduce a geometrical visualization of sequence rules at any click of the web navigation. In particular, we provide two distance-based visualization methods for the static analysis of all data tout court and the dynamic analysis to discover the most significant web paths click by click. A real world case study is considered throughout the methodological description.File | Dimensione | Formato | |
---|---|---|---|
Siciliano2016_Article_AnalysisOfWebVisitHistoriesPar.pdf
Open Access dal 02/02/2021
Tipologia:
Versione Editoriale (PDF)
Licenza:
Accesso privato/ristretto
Dimensione
456.9 kB
Formato
Adobe PDF
|
456.9 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.