International Journal on Advanced Science, Engineering and Information Technology, Vol. 11 (2021) No. 2, pages: 508-515, DOI:10.18517/ijaseit.11.2.12955

Gradient Boosting Machine Based on PSO for prediction of Leukemia after a Breast Cancer Diagnosis

Mohanad A. Deif, Rania E. Hammam, Ahmed A. A. Solyman


The purpose of this study is to develop an accurate risk predictive model for Chronic Myeloid Leukemia (CML) after an early diagnosis of Breast Cancer (BC). Gradient Boosting Machine (GBM) classification algorithm has been applied to the SEER breast cancer dataset for females diagnosed with BC from 2010 to 2016. A practical Swarm optimizer (PSO) was utilized to optimize the GBM algorithm's hyperparameters to find the SEER dataset's best attributes. Nine attributes were carefully selected to study the growth of CML after a lag time of 6 months following BC's diagnosis. The results revealed that the predictive model could classify patients with breast cancer only and patients with breast cancer with Leukemia by an achieved Accuracy, Sensitivity, and Specificity rates of 98.5 %, 99 %, 97.85 %, respectively. To verify the performance of the proposed algorithm, the accuracy of the suggested GBM classifier model was compared with another state-of-the-art model classifiers KNN (k-Nearest Neighbor), SVM (Support Vector Machine), and RF (Random Forest), which are commonly applied algorithms in most of the existing literature. The results also proved the superior ability of the implemented GBM model Classifier in the classification of breast cancer disease and prediction of patients having Leukemia developed after having breast cancer. These results are promising as they show the integral role of the GBM classifier to classify and predict the tumor with high accuracy and efficiency, which will further help in better cancer diagnosis and treatment of the disease.


Risk predictive model; chronic myeloid leukemia; breast cancer; gradient boosting machine; classification algorithm.

