Attribute dependency data analysis for massive datasets by fuzzy transforms

Di Martino, Ferdinando; Sessa, Salvatore

doi:10.1007/s00500-021-05760-y

We present a numerical attribute dependency method for massive datasets based on the concepts of direct and inverse fuzzy transform. In a previous work, we used these concepts for numerical attribute dependency in data analysis: Therein, the multi-dimensional inverse fuzzy transform was useful for approximating a regression function. Here we give an extension of this method in massive datasets because the previous method could not be applied due to the high memory size. Our method is proved on a large dataset formed from 402,678 census sections of the Italian regions provided by the Italian National Statistical Institute (ISTAT) in 2011. The results of comparative tests with the well-known methods of regression, called support vector regression and multilayer perceptron, show that the proposed algorithm has comparable performance with those obtained using these two methods. Moreover, the number of parameters requested in our method is minor with respect to those of the cited in the above two algorithms.

Attribute dependency data analysis for massive datasets by fuzzy transforms / DI MARTINO, Ferdinando; Sessa, Salvatore. - In: SOFT COMPUTING. - ISSN 1432-7643. - 25:13(2021), pp. 8731-8746. [10.1007/s00500-021-05760-y]