This paper constitutes Part I of the contribution to the analysis of web visit histories through a new methodological framework. Firstly, web usage and web structure mining are considered as an unique mining process to detect the latent structure of the web navigation across the web sections of a single portal. We extend association rules theory to web data defining new concepts of web (patterns) association and preference matrices, as well as of (indirect and direct) sequence rules. We identify the most significant rules, according to a multiple testing procedure. In the literature, web usage patterns can be visualized in no-distance-based graphs describing the navigation behavior across web pages with sequential arrows. In the following, we introduce a geometrical visualization of sequence rules at any click of the web navigation. In particular, we provide two distance-based visualization methods for the static analysis of all data tout court and the dynamic analysis to discover the most significant web paths click by click. A real world case study is considered throughout the methodological description.

Analysis of web visit histories, part I: Distance-based visualization of sequence rules / Siciliano, Roberta; D'Ambrosio, Antonio; Aria, Massimo; Amodio, S.. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - 33:(2016), pp. 298-324. [10.1007/s00357-016-9204-8]

Analysis of web visit histories, part I: Distance-based visualization of sequence rules

SICILIANO, ROBERTA;D'AMBROSIO, ANTONIO;ARIA, MASSIMO;
2016

Abstract

This paper constitutes Part I of the contribution to the analysis of web visit histories through a new methodological framework. Firstly, web usage and web structure mining are considered as an unique mining process to detect the latent structure of the web navigation across the web sections of a single portal. We extend association rules theory to web data defining new concepts of web (patterns) association and preference matrices, as well as of (indirect and direct) sequence rules. We identify the most significant rules, according to a multiple testing procedure. In the literature, web usage patterns can be visualized in no-distance-based graphs describing the navigation behavior across web pages with sequential arrows. In the following, we introduce a geometrical visualization of sequence rules at any click of the web navigation. In particular, we provide two distance-based visualization methods for the static analysis of all data tout court and the dynamic analysis to discover the most significant web paths click by click. A real world case study is considered throughout the methodological description.
2016
Analysis of web visit histories, part I: Distance-based visualization of sequence rules / Siciliano, Roberta; D'Ambrosio, Antonio; Aria, Massimo; Amodio, S.. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - 33:(2016), pp. 298-324. [10.1007/s00357-016-9204-8]
File in questo prodotto:
File Dimensione Formato  
Siciliano2016_Article_AnalysisOfWebVisitHistoriesPar.pdf

Open Access dal 02/02/2021

Tipologia: Versione Editoriale (PDF)
Licenza: Accesso privato/ristretto
Dimensione 456.9 kB
Formato Adobe PDF
456.9 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/605619
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 3
social impact