Classification of Acute Myeloid Leukemia Subtypes M1, M2 and M3 Using K-Nearest Neighbor

Nurcahya Pradana Taufik Prakisya (1), Febri Liantoni (2), Yusfia Hafid Aristyagama (3), Puspanda Hatta (4)
(1) Department of Computer and Informatics Education, Universitas Sebelas Maret, Surakarta, Indonesia
(2) Department of Computer and Informatics Education, Universitas Sebelas Maret, Surakarta, Indonesia
(3) Department of Computer and Informatics Education, Universitas Sebelas Maret, Surakarta, Indonesia
(4) Department of Computer and Informatics Education, Universitas Sebelas Maret, Surakarta, Indonesia
Fulltext View | Download
How to cite (IJASEIT) :
Prakisya, Nurcahya Pradana Taufik, et al. “Classification of Acute Myeloid Leukemia Subtypes M1, M2 and M3 Using K-Nearest Neighbor”. International Journal on Advanced Science, Engineering and Information Technology, vol. 11, no. 5, Oct. 2021, pp. 1847-53, doi:10.18517/ijaseit.11.5.9585.
Leukemia is a malignant disease caused by the massive and rapid development of white blood cells in the bone marrow. These excessive white blood cells begin to interfere with the body’s mechanism rather than fighting infection. Acute Myeloid Leukemia (AML) is one of the four main types of leukemia with eight subtypes, M0 to M7. AML M1, M2, and M3 have similarities, making them more difficult to distinguish from the other types. Furthermore, they are usually identified by calculating the ratio of myeloblast, promyelocyte, and monoblastic. This research aims to apply the k-Nearest Neighbor (k-NN) in classifying these cell types. k-NN is an algorithm used for classification based on a similarity measure. In cases of finding the best number of neighborhoods, trial and error were conducted. The features needed for classification are cell area, perimeter, roundness, nucleus ratio, mean and standard deviation. Four distance metrics such as Euclidean, Manhattan, Minkowski, and Chebyshev were used in this research. The results show that the Euclidean, Manhattan, Chebyshev, and Minkowski distance successfully identified 207 out of 300 objects at K=18, 197 out of 300 objects at K=13,  209 out of 300 correct objects at K=9, and 210 out of 300 objects at K=7.  In conclusion, Minkowski was chosen as the best distance metric for KNN in classifying leukemia-forming blood cells. Furthermore, the accuracy, recall, and precision values of KNN with Minkowski distance obtained from 5-fold cross-validation were 80.552%, 44.145%, and 42.592%, respectively.

A. Setiawan, A. Harjoko, T. Ratnaningsih, E. Suryani, Wiharto, and S. Palgunadi, “Classification of cell types in Acute Myeloid Leukemia (AML) of M4, M5 and M7 subtypes with support vector machine classifier,” 2018 Int. Conf. Inf. Commun. Technol. ICOIACT 2018, vol. 2018-Janua, no. Cml, pp. 45-49, 2018.

P. Sachin and R. Y. Kumar, “Detection and Classification of Blood Cancer from Microscopic Cell Images Using SVM KNN and NN Classifier,” Int. J. Adv. Res., vol. 3, no. 6, pp. 315-324, 2017.

E. Suryani, Wiharto, S. Palgunadi, and N. P. T. Prakisya, “Classification of Acute Myelogenous Leukemia (AML M2 and AML M3) using Momentum Back Propagation from Watershed Distance Transform Segmented Images,” in Journal of Physics: Conference Series, 2017, vol. 801, no. 1.

A. Harjoko, T. Ratnaningsih, E. Suryani, Wiharto, S. Palgunadi, and N. P. T. Prakisya, “Classification of acute myeloid leukemia subtypes M1, M2 and M3 using active contour without edge segmentation and momentum backpropagation artificial neural network,” in MATEC Web of Conferences, 2018, vol. 154.

S. Rajpurohit, S. Patil, N. Choudhary, S. Gavasane, and P. Kosamkar, “Identification of Acute Lymphoblastic Leukemia in Microscopic Blood Image Using Image Processing and Machine Learning Algorithms,” 2018 Int. Conf. Adv. Comput. Commun. Informatics, ICACCI 2018, no. Cll, pp. 2359-2363, 2018.

M. H. Waseem et al., “On the Feature Selection Methods and Reject Option Classifiers for Robust Cancer Prediction,” IEEE Access, vol. 7, pp. 141072-141082, 2019.

S. S. Devi, A. Roy, M. Sharma, and R. H. Laskar, “kNN Classification Based Erythrocyte Separation in Microscopic Images of Thin Blood Smear,” Proc. - Int. Conf. Comput. Intell. Networks, vol. 2016-Janua, pp. 69-72, 2016.

M. P. Vaishnnave, K. Suganya Devi, P. Srinivasan, and G. Arutperumjothi, “Detection and classification of groundnut leaf diseases using KNN classifier,” 2019 IEEE Int. Conf. Syst. Comput. Autom. Networking, ICSCAN 2019, pp. 1-5, 2019.

N. Zhang, W. Karimoune, L. Thompson, and H. Dang, “A between-class overlapping coherence-based algorithm in KNN classification,” 2017 IEEE Int. Conf. Syst. Man, Cybern. SMC 2017, vol. 2017-Janua, pp. 572-577, 2017.

B. Harijanto, E. L. Amalia, and M. Mentari, “Recognition of the character on the map captured by the camera using k-nearest neighbor,” IOP Conf. Ser. Mater. Sci. Eng., vol. 732, p. 012043, 2020.

H. Wisnu, M. Afif, and Y. Ruldevyani, “Sentiment analysis on customer satisfaction of digital payment in Indonesia: A comparative study using KNN and Naí¯ve Bayes Sentiment analysis on customer satisfaction of digital payment in Indonesia: A comparative study using KNN and Naí¯ve Bayes,” 2020.

N. Krithika and A. Grace Selvarani, “An individual grape leaf disease identification using leaf skeletons and KNN classification,” Proc. 2017 Int. Conf. Innov. Information, Embed. Commun. Syst. ICIIECS 2017, vol. 2018-Janua, pp. 1-5, 2018.

A. Singh and B. Pandey, “An Euclidean Distance based KNN Computational Method for Assessing Degree of Liver Damage,” Int. Conf. Inven. Comput. Technol., 2016.

J. Williams and Y. Li, “Comparative Study of Distance Functions for Nearest Neighbors,” Adv. Tech. Comput. Sci. Softw. Eng., no. January, 2010.

M. Klimo, O. Å kvarek, P. Tarabek, O. Å uch, and J. Hrabovsky, “Nearest neighbor classification in minkowski quasi-metric space,” DISA 2018 - IEEE World Symp. Digit. Intell. Syst. Mach. Proc., pp. 227-232, 2018.

B. Khaldi, F. Harrou, F. Cherif, and Y. Sun, “Improving robots swarm aggregation performance through the Minkowski distance function,” 2020 6th Int. Conf. Mechatronics Robot. Eng. ICMRE 2020, pp. 87-91, 2020.

F. H. K. Zaman, I. M. Yassin, and A. A. Shafie, “Ensembles of large margin nearest neighbour with grouped lateral patch arrangement for face classification,” IRIS 2016 - 2016 IEEE 4th Int. Symp. Robot. Intell. Sensors Empower. Robot. with Smart Sensors, no. December, pp. 6-12, 2017.

S. Yadav and S. Shukla, “Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification,” Proc. - 6th Int. Adv. Comput. Conf. IACC 2016, no. Cv, pp. 78-83, 2016.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).