Comparative Analysis of Data Mining Techniques for Malaysian Rainfall Prediction

Suhaila Zainudin (1), Dalia Sami Jasim (2), Azuraliza Abu Bakar (3)
(1) Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia
(2) Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia
(3) Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia
Fulltext View | Download
How to cite (IJASEIT) :
Zainudin, Suhaila, et al. “Comparative Analysis of Data Mining Techniques for Malaysian Rainfall Prediction”. International Journal on Advanced Science, Engineering and Information Technology, vol. 6, no. 6, Dec. 2016, pp. 1148-53, doi:10.18517/ijaseit.6.6.1487.
Climate change prediction analyses the behaviours of weather for a specific time. Rainfall forecasting is a climate change task where specific features such as humidity and wind will be used to predict rainfall in specific locations. Rainfall prediction can be achieved using classification task under Data Mining. Different techniques lead to different performances depending on rainfall data representation including representation for long term (months) patterns and short-term (daily) patterns. Selecting an appropriate technique for a specific duration of rainfall is a challenging task. This study analyses multiple classifiers such as Naí¯ve Bayes, Support Vector Machine, Decision Tree, Neural Network and Random Forest for rainfall prediction using Malaysian data. The dataset has been collected from multiple stations in Selangor, Malaysia. Several pre-processing tasks have been applied in order to resolve missing values and eliminating noise. The experimental results show that with small training data (10%) from 1581 instances Random Forest correctly classified 1043 instances. This is the strength of an ensemble of trees in Random Forest where a group of classifiers can jointly beat a single classifier.

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).