Rate Movie App: Implementation of K-Nearest Neighbors Algorithm in the Development of Decision Support System for Philippine Movie Rating and Classification

Ian Dexter M. Siñel (1), Benilda Eleonor V. Comendador (2)
(1) Polytechnic University of the Philippines Graduate School
(2) Polytechnic University of the Philippines Graduate School
Fulltext View | Download
How to cite (IJASEIT) :
Siñel, Ian Dexter M., and Benilda Eleonor V. Comendador. “Rate Movie App: Implementation of K-Nearest Neighbors Algorithm in the Development of Decision Support System for Philippine Movie Rating and Classification”. International Journal on Advanced Science, Engineering and Information Technology, vol. 9, no. 1, Jan. 2019, pp. 92-99, doi:10.18517/ijaseit.9.1.7579.
Movies that are publicly exhibited in the Philippine Cinema, regardless if produced locally (local films) and/or outside the country (foreign films) undergo a thorough evaluation before public exhibition to properly identify suited audiences. There are many factors that contribute to the classification and rating of a specific movie. Movies play a vital role for Filipino culture as for some people; these serve as their leisure activity, for other people, these are not just a leisure activity instead a form of visual art that may send important messages to the audiences and/or may re-enact human personal experiences. It is very important that movie(s) will be classified accordingly without any form of biases. This paper promotes a Decision Support System that can be used in predicting movie classification and rating using historically evaluated movies from 2010 to 2017. The study considers the user ratings on the following attributes: Sex & Nudity, Violence & Gore, Profanity, Alcohol, Drugs & Smoking and Frightening and Intense Scenes scrapped from a public movie database. Along with these considerations are the genre(s) associated with a movie. The study conducted revealed that K-Nearest Neighbors Algorithm outperforms Naive Bayes and J48/C4.5 Algorithm in classifying Philippine Movie rating with 92.80% accuracy as compared to 68.70% and 56.79% for Naive Bayes and J48/C4.5 algorithm respectively. The developed decision support system implements the K-Nearest Neighbors algorithm to satisfy the objectives mentioned. With this, Review Committees who evaluate movies may have guides in making critical decisions in the domain of movie evaluation.

B. Dalisay. (2014) The Wealth Within Us. [Online]. Available: http://www.philstar.com/arts-and-culture

L. Desiderio. (2015) DTI Seeks Investments in Creative Industries. [Online]. Available: http://www.philstar.com:8080/business/2012/07/15/827808/dti-seeks-investments-creative-industries

I. E. Valera. (2015) “Perceived Status of the Filipino Film Industry: Implications for Media Education” International Conference in Language Learning and Teaching at HCT Men’s College UAE, vol. 8(1), pp. 85-89.

(2018) The MTRCB Official Website. [Online]. Available: http://www.mtrcb.gov.ph

S. Rasheedudin. (2013) “The Theoretical Framework of Data Mining and Its Techniques” International Journal of Social Science, vol. 2(1), pp. 81-85. [Online]. Available: http://www.indianresearchjournals.com/pdf/IJSSIR/2013/January/9.pdf

(2018) Internet Movie Database Official Website. [Online]. Available: http://www.imdb.com

(2018) Waikato Environment for Knowledge Analysis Official Website. [Online]. Available: http://www.cs.waikato.ac.nz/weka/

F. Eibe. (2016) Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kau). [Online]. Available: ftp://ftp.ingv.it/pub/manuela.sbarra/Data Mining Practical Machine Learning Tools and Techniques - WEKA.pdf

(2018) Visual Studio Express Official Website. [Online]. Available: http://www.visualstudio.microsoft.com

L. Garcia and C. Masigan. (2001). An In-depth Study on the Film Industry in the Philippines. [Online]. Available: https://www.dirp4.pids.gov.ph/ris/taps/tapspp0103.pdf

S. Deshpande and V. Thakare. (2010) “Data Mining System and Applications: A Review” International Journal of Distributed and Parallel Systems, vol. 1(1), pp. 32-44. [Online]. Available: https://doi.org/10.5121/ijdps.2010.1103

L. Marlina, M. Lim and A. P. Utama Siahaan. (2016) “Data Mining Classification Comparison (Naí¯ve Bayes and C4.5 Algorithms) International Journal of Engineering Trends and Technology, vol. 38(7), pp. 380-383. [Online]. Available: https:doi.org/10.14445/22315381/IJETT-V38P268

M. Komorowski, D. Marshall, J. Salciccioli and Y. Crutain. (2016) “Exploratory Data Analysis” Reseach Gate DOI: 10.1007/978-3-319-43742-2_15. pp. 185-203

D. Madigan. (2010) Descriptive Modeling. [Online]. Available: http://www.stat.columbia.edu/~madigan/DM08/descriptive.ppt.pdf

F. Halili and A. Rustemi. (2016) “Predictive Modeling: Data Mining Regression Technique Applied in Prototype” Internatonal Journal of Computer Science and Mobile Computing, vol. 5(8), pp. 207-215

T. Silwattananusarn and K. Tuamsuk. (2012) “Data Mining and Its Application for Knowledge Management: A Literature Review from 2007 to 2012” International Journal of Data Mining and Knowledge Management Process (IJDKP), vol. 2(5), pp. 13-24. [Online]. Available: https://doi.org/10.5121/ijdkp.2012.2502

M. Ramageri. (2010) “Data Mining Techniques and Its Application” Indian Journal of Computer Science and Engineering, vol. 1(4), pp. 301-305

F. Solomon, D. Abebe and S. Bhabani. (2014) “A Comparative Study on Performance Evaluation of Eager versus Lazy Learning Methods” International Journal of Computer Science and Mobile Computing, vol. 3(3), pp. 562-568

A. Kulkarni and R. Maclin. (n.d.) A Nearest Neighbor Approach using Clustering in Netflix Prize Data. [Online]. Available: https://dirp4.pids.gov.ph/ris/taps/tapspp0103.pdf

K. Teknomo. (2010) K-Nearest Neighbors Tutorial. [Online]. Available: https://people.revoluedu.com/kardi/tutorial/KNN

B. Tay, J. K. Hyun and S. Oh. (2014) “A Machine Learning Approach for Specification of Spinal Cord Injuries using Fractional Anisotrophy Values Obtained from Diffusion Tensor Images” Computational and Mathematica Methods in Medicne, Hindawi Publishing Corporation, vol. 2014. [Online]. Available: https://doi.org/10.1155/2014/276589

D. Sontag. (n.d.) Nearest Neighbor Methods. [Online]. Available: http://people.csail.mit.edu/dsontag/courses/ml13/slides/lecture11.pdf

M. Marovic, M. Mihokovic, M. Miksa, S. Pribil and A. Tus. (2011) “Automatic Movie Ratings Prediction using Machine Learning” in 2011 Proceedings of the 34th International Convention MIPRO, pp. 1640-1645

S. P. Sahu. (2017) “Machine Learning Algorithms for Recommender System - A Comparative Analysis” International Journal of Computer Applications Technology and Research, vol. 6(2), pp. 97-100

Y. Peng, Y. Duan and Z. Zou. (n.d.) Movielens: Several Approaches to Rating Prediction. [Online]. Available: https:// https://dirp4.pids.gov.ph/ris/taps/tapspp0103.pdf

(2018) MS Excel Official Website. [Online]. Available: https://www.office.live.com/start/Excel.aspx

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).