A new approach for histological classification of breast cancer using deep hybrid heterogenous ensemble
Data Technologies and Applications
ISSN: 2514-9288
Article publication date: 18 October 2022
Issue publication date: 25 April 2023
Abstract
Purpose
Hundreds of thousands of deaths each year in the world are caused by breast cancer (BC). An early-stage diagnosis of this disease can positively reduce the morbidity and mortality rate by helping to select the most appropriate treatment options, especially by using histological BC images for the diagnosis.
Design/methodology/approach
The present study proposes and evaluates a novel approach which consists of 24 deep hybrid heterogenous ensembles that combine the strength of seven deep learning techniques (DenseNet 201, Inception V3, VGG16, VGG19, Inception-ResNet-V3, MobileNet V2 and ResNet 50) for feature extraction and four well-known classifiers (multi-layer perceptron, support vector machines, K-nearest neighbors and decision tree) by means of hard and weighted voting combination methods for histological classification of BC medical image. Furthermore, the best deep hybrid heterogenous ensembles were compared to the deep stacked ensembles to determine the best strategy to design the deep ensemble methods. The empirical evaluations used four classification performance criteria (accuracy, sensitivity, precision and F1-score), fivefold cross-validation, Scott–Knott (SK) statistical test and Borda count voting method. All empirical evaluations were assessed using four performance measures, including accuracy, precision, recall and F1-score, and were over the histological BreakHis public dataset with four magnification factors (40×, 100×, 200× and 400×). SK statistical test and Borda count were also used to cluster the designed techniques and rank the techniques belonging to the best SK cluster, respectively.
Findings
Results showed that the deep hybrid heterogenous ensembles outperformed both their singles and the deep stacked ensembles and reached the accuracy values of 96.3, 95.6, 96.3 and 94 per cent across the four magnification factors 40×, 100×, 200× and 400×, respectively.
Originality/value
The proposed deep hybrid heterogenous ensembles can be applied for the BC diagnosis to assist pathologists in reducing the missed diagnoses and proposing adequate treatments for the patients.
Keywords
Acknowledgements
This work was conducted under the research project “Machine Learning based Breast Cancer Diagnosis and Treatment”, 2020–2023. The authors would like to thank the Moroccan Ministry of Higher Education and Scientific Research, Digital Development Agency (ADD), CNRST and UM6P for their support.
Funding: This study was funded by Mohammed VI polytechnic university at Ben Guerir Morocco.
Conflicts of interest/competing interests: The authors report no conflicts of interest.
Citation
Zerouaoui, H., Idri, A. and El Alaoui, O. (2023), "A new approach for histological classification of breast cancer using deep hybrid heterogenous ensemble", Data Technologies and Applications, Vol. 57 No. 2, pp. 245-278. https://doi.org/10.1108/DTA-05-2022-0210
Publisher
:Emerald Publishing Limited
Copyright © 2022, Emerald Publishing Limited