Background: breast cancer (BC) is the world's most prevalent cancer in the female population, with 2.3 million new cases diagnosed worldwide in 2020. The great efforts made to set screening campaigns, early detection programs, and increasingly targeted treatments led to significant improvement in patients' survival. The Full-Field Digital Mammograph (FFDM) is considered the gold standard method for the early diagnosis of BC. From several previous studies, it has emerged that breast density (BD) is a risk factor in the development of BC, affecting the periodicity of screening plans present today at an international level. Objective: in this study, the focus is the development of mammographic image processing techniques that allow the extraction of indicators derived from textural patterns of the mammary parenchyma indicative of BD risk factors. Methods: a total of 168 patients were enrolled in the internal training and test set while a total of 51 patients were enrolled to compose the external validation cohort. Different Machine Learning (ML) techniques have been employed to classify breasts based on the values of the tissue density. Textural features were extracted only from breast parenchyma with which to train classifiers, thanks to the aid of ML algorithms. Results: the accuracy of different tested classifiers varied between 74.15% and 93.55%. The best results were reached by a Support Vector Machine (accuracy of 93.55% and a percentage of true positives and negatives equal to TPP = 94.44% and TNP = 92.31%). The best accuracy was not influenced by the choice of the features selection approach. Considering the external validation cohort, the SVM, as the best classifier with the 7 features selected by a wrapper method, showed an accuracy of 0.95, a sensitivity of 0.96, and a specificity of 0.90. Conclusions: our preliminary results showed that the Radiomics analysis and ML approach allow us to objectively identify BD.
Machine Learning Approaches with Textural Features to Calculate Breast Density on Mammography / Sansone, Mario; Fusco, Roberta; Grassi, Francesca; Gatta, Gianluca; Belfiore, Maria Paola; Angelone, Francesca; Ricciardi, Carlo; Ponsiglione, Alfonso Maria; Amato, Francesco; Galdiero, Roberta; Grassi, Roberta; Granata, Vincenza; Grassi, Roberto. - In: CURRENT ONCOLOGY. - ISSN 1718-7729. - 30:1(2023), pp. 839-853. [10.3390/curroncol30010064]
Machine Learning Approaches with Textural Features to Calculate Breast Density on Mammography
Sansone, MarioMethodology
;Angelone, FrancescaData Curation
;Ricciardi, CarloMembro del Collaboration Group
;Ponsiglione, Alfonso MariaValidation
;Amato, FrancescoSupervision
;
2023
Abstract
Background: breast cancer (BC) is the world's most prevalent cancer in the female population, with 2.3 million new cases diagnosed worldwide in 2020. The great efforts made to set screening campaigns, early detection programs, and increasingly targeted treatments led to significant improvement in patients' survival. The Full-Field Digital Mammograph (FFDM) is considered the gold standard method for the early diagnosis of BC. From several previous studies, it has emerged that breast density (BD) is a risk factor in the development of BC, affecting the periodicity of screening plans present today at an international level. Objective: in this study, the focus is the development of mammographic image processing techniques that allow the extraction of indicators derived from textural patterns of the mammary parenchyma indicative of BD risk factors. Methods: a total of 168 patients were enrolled in the internal training and test set while a total of 51 patients were enrolled to compose the external validation cohort. Different Machine Learning (ML) techniques have been employed to classify breasts based on the values of the tissue density. Textural features were extracted only from breast parenchyma with which to train classifiers, thanks to the aid of ML algorithms. Results: the accuracy of different tested classifiers varied between 74.15% and 93.55%. The best results were reached by a Support Vector Machine (accuracy of 93.55% and a percentage of true positives and negatives equal to TPP = 94.44% and TNP = 92.31%). The best accuracy was not influenced by the choice of the features selection approach. Considering the external validation cohort, the SVM, as the best classifier with the 7 features selected by a wrapper method, showed an accuracy of 0.95, a sensitivity of 0.96, and a specificity of 0.90. Conclusions: our preliminary results showed that the Radiomics analysis and ML approach allow us to objectively identify BD.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.