The extraction of metadata from dynamic data sources represents an extremely challenging task of the data profiling research area, since it requires to handle the update of the inferred metadata without processing the whole dataset from scratch upon modifications. This discussion paper presents IndiBits, an approach for discovering relaxed functional dependencies (rfds for short), which represent data relationships relying on approximate matching paradigms. It exploits a binary representation of data similarities, a new validation method, and specific search methods, to dynamically update the set of rfds, based on previously holding rfds and the type of modifications performed over data. Experimental results demonstrate the effectiveness of IndiBits on real-world datasets, even in comparison with fd and rfd discovery algorithms in both static and dynamic scenarios.

Relaxed Functional Dependency Discovery in Incremental Scenarios / Breve, B.; Caruccio, L.; Cirillo, S.; Deufemia, V.; Polese, G.. - 3478:(2023), pp. 614-622. (Intervento presentato al convegno 31st Symposium of Advanced Database Systems, SEBD 2023 tenutosi a ita nel Galzingano Terme).

Relaxed Functional Dependency Discovery in Incremental Scenarios

Breve B.;
2023

Abstract

The extraction of metadata from dynamic data sources represents an extremely challenging task of the data profiling research area, since it requires to handle the update of the inferred metadata without processing the whole dataset from scratch upon modifications. This discussion paper presents IndiBits, an approach for discovering relaxed functional dependencies (rfds for short), which represent data relationships relying on approximate matching paradigms. It exploits a binary representation of data similarities, a new validation method, and specific search methods, to dynamically update the set of rfds, based on previously holding rfds and the type of modifications performed over data. Experimental results demonstrate the effectiveness of IndiBits on real-world datasets, even in comparison with fd and rfd discovery algorithms in both static and dynamic scenarios.
2023
Relaxed Functional Dependency Discovery in Incremental Scenarios / Breve, B.; Caruccio, L.; Cirillo, S.; Deufemia, V.; Polese, G.. - 3478:(2023), pp. 614-622. (Intervento presentato al convegno 31st Symposium of Advanced Database Systems, SEBD 2023 tenutosi a ita nel Galzingano Terme).
File in questo prodotto:
File Dimensione Formato  
SEBD-2023.pdf

non disponibili

Licenza: Non specificato
Dimensione 1.83 MB
Formato Adobe PDF
1.83 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/977631
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact