Designing Hybrid CNN-SVM Model for COVID-19 Classification Based on X-ray Images Using LGBM Feature Selection

Sri Hartini (1), Zuherman Rustam (2), Rahmat Hidayat (3)
(1) Department of Mathematics, Universitas Indonesia, Depok, 16424, Indonesia
(2) Department of Mathematics, Universitas Indonesia, Depok, 16424, Indonesia
(3) Department of Information Technology, Politeknik Negeri Padang, 25164, Indonesia
Fulltext View | Download
How to cite (IJASEIT) :
Hartini, Sri, et al. “Designing Hybrid CNN-SVM Model for COVID-19 Classification Based on X-Ray Images Using LGBM Feature Selection”. International Journal on Advanced Science, Engineering and Information Technology, vol. 12, no. 5, Sept. 2022, pp. 1895-06, doi:10.18517/ijaseit.12.5.16875.
COVID-19 still exists at an alarming level; hence, early diagnosis is important for treating and controlling this disease due to its rapid spread. The use of X-rays in medical image analysis can play an essential role in fast and affordable diagnosis. This study used a two-level feature selection in hybrid deep convolutional features obtained from the extraction of X-ray images. The transfer learning-based approach was implemented using five convolutional neural networks (CNNs) named VGG16, VGG19, ResNet50, InceptionV3, and Xception. The combination of two or three CNNs' performance as a feature extractor was then carefully analyzed. We selected the features obtained from multiple CNNs in a particular layer with a specified percentage of features in the first level for getting relevant features from various models. Then, we combined those features and did the second level of feature selection to select the most informative features. Both levels of feature selection were carried out using the light gradient boosting machine (LightGBM) algorithm. The final feature set has been used to classify COVID-19 and non-COVID-19 chest X-ray images using the support vector machines (SVM) classifier. The proposed model's performance was evaluated and analyzed on the open-access dataset. The highest accuracy was 99.80% using only 5% of the features extracted from ResNet50 and Xception. The other way of combining the ensemble of deep features and a few recent works for the classification of COVID-19 were also compared with the proposed model. As a result, our proposed model has achieved the best success rate for this dataset and may be deployed to support decision systems for radiologists.

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).