Performances Analysis of Heart Disease Dataset using Different Data Mining Classifications

Wan Hajarul Asikin Wan Zunaidi (1), RD Rohmat Saedudin (2), Zuraini Ali Shah (3), Shahreen Kasim (4), Choon Sen Seah (5), Maman Abdurohman (6)
(1) Faculty of Computing, Universiti Teknologi Malaysia, Johor, Malaysia
(2) School of Industrial Engineering, Telkom University, 40257 Bandung, West Java, Indonesia
(3) Faculty of Computing, Universiti Teknologi Malaysia, Johor, Malaysia
(4) Universiti Tun Hussein Onn Malaysia
(5) 3Faculty of Computer Sciences and Information Technology, Universiti Tun Hussein Onn Malaysia, Batu Pahat, Malaysia
(6) School of Computing, Telkom University, 40257 Bandung, West Java, Indonesia
Fulltext View | Download
How to cite (IJASEIT) :
Wan Zunaidi, Wan Hajarul Asikin, et al. “Performances Analysis of Heart Disease Dataset Using Different Data Mining Classifications”. International Journal on Advanced Science, Engineering and Information Technology, vol. 8, no. 6, Dec. 2018, pp. 2677-82, doi:10.18517/ijaseit.8.6.5042.
nowadays, heart disease is one of the major diseases that cause death. It is a matter for us to concern in today’s highly chaotic life style that leads to various diseases. Early prediction of identification to heart-related diseases has been investigated by many researchers. The death rate can be further brought down if we can predict or identify the heart disease earlier. There are many studies that explore the different classification algorithms for classification and prediction of heart disease. This research studied the prediction of heart disease by using five different techniques in WEKA tools by using the input attributes of the dataset. This research used 13 attributes, such as sex, blood pressure, cholesterol and other medical terms to detect the likelihood of a patient getting heart disease. The classification techniques, namely J48, Decision Stump, Random Forest, Sequential Minimal Optimization (SMO), and Multilayer Perceptron used to analyze the heart disease. Performance measurement for this study are the accuracy of correct classification, mean absolute error and kappa statistics of the classifier. The result shows that Multilayer Perceptron Neural Networks is the most suited for early prediction of heart diseases.

Dangare, C. S., & Apte, S. S. (2012). Improved study of heart disease prediction system using data mining classification techniques. International Journal of Computer Applications, 47(10), 44-48.

Bhatla, N., & Jyoti, K. (2012). An analysis of heart disease prediction using different data mining techniques. International Journal of Engineering, 1(8), 1-4.

Seah, C. S., Kasim, S., Fudzee, M. F., Ping, J. M., Mohamad, M. S., Saedudin, R. R., & Ismail, M. A. (2017). An enhanced topologically significant directed random walk in cancer classification using gene expression datasets. Saudi Journal of Biological Sciences, 24(8), 1828-1841.

Khemphila, A., & Boonjing, V. (2011). Heart disease classification using the neural network and feature selection. In Systems Engineering (ICSEng), 2011 21st International Conference on (pp. 406-409). IEEE.

Palaniappan, S., Awang, R., Intelligent Disease Prediction System Using Data Mining Techniques, IJCSNS International Journal of Computer Science and Network Security. 8(8): 343-350 (2008).

Chaurasia, V., & Pal, S. (2014). Data mining approach to detect heart diseases.

Symbology of the Logical Decision Tree. (2017). Decision-Making Management, 99-100. doi:10.1016/b978-0-12-811540-4.09979-8 Available from: http://en.wikipedia.org.[Last accessed on May 11].

Leo Breiman (2001). Random Forests. Machine Learning. 45(1), pp.5-32.

Palaniappan, S., Awang, R., Intelligent Disease Prediction System Using Data Mining Techniques, IJCSNS International Journal of Computer Science and Network Security. 8(8): 343-350 (2008).

Capilla, C. (2014). Multilayer perceptron and regression modelling to forecast hourly nitrogen dioxide concentrations. Air Pollution XXII. doi:10.2495/air140041

GainRatioAttributeEval. (2017, December 22). Retrieved from http://weka.sourceforge.net/doc.dev/weka/attributeSelection/GainRatioAttributeEval.html

CorrelationAttributeEval. (2017, December 22). Retrieved from http://weka.sourceforge.net/doc.dev/weka/attributeSelection/CorrelationAttributeEval.html

OneRAttributeEval. (2017, December 22). Retrieved from http:// http://weka.sourceforge.net/doc.stable-3-8/index.html?weka/attributeSelection/OneRAttributeEval.html

CfsSubsetEval. (2017, December 22). Retrieved from http://weka.sourceforge.net/doc.dev/weka/attributeSelection/CfsSubsetEval.html

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).