Comparison between the Stemmer Porter Effect and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism

Alam Rahmatulloh (1), Neng Ika Kurniati (2), Irfan Darmawan (3), Adi Zaenal Asyikin (4), Deden Witarsyah J (5)
(1) Siliwangi University
(2) Siliwangi University
(3) Telkom University
(4) Siliwangi University
(5) Telkom University
Fulltext View | Download
How to cite (IJASEIT) :
Rahmatulloh, Alam, et al. “Comparison Between the Stemmer Porter Effect and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism”. International Journal on Advanced Science, Engineering and Information Technology, vol. 9, no. 4, Aug. 2019, pp. 1124-8, doi:10.18517/ijaseit.9.4.8844.
Current technological developments change physical paper patterns into digital, and this has a very high impact. Positive impact because paper waste is reduced, on the other hand, the rampant copying of digital data raises the amount of plagiarism that is increasing. At present, there are many efforts made by experts to overcome the problem of plagiarism, one of which is by utilizing the winnowing algorithm as a tool to detect plagiarism data. In its development, many optimizing winnowing algorithms used stemming techniques. The most widely used stemmer algorithms include stemmer porter and nazief-adriani. However, there has not been a discussion on the comparison of the effect of performance using stemmer on the winnowing algorithm in measuring the value of plagiarism. So it is necessary to research the effect of stemmer algorithms on winnowing algorithms so that the results of plagiarism detection are more optimal. The results of this study indicate that the effect of nazief-adriani stemmer on the winnowing algorithm is superior to the stemmer porter, only decreasing the detection performance of the 0.28% similarity value while the Porter stemmer is superior in increasing the processing time to 69% faster.

H. Lamba and S. Govilkar, “A Survey on Plagiarism Detection Techniques for Indian Regional Languages,” Int. J. Comput. Appl., 2017.

A. M. El Tahir Ali, H. M. D. Abdulla, and V. Snasel, “Survey of plagiarism detection methods,” in Proceedings - AMS 2011: Asia Modelling Symposium 2011 - 5th Asia International Conference on Mathematical Modelling and Computer Simulation, 2011.

D. Namdev, “A Survey Paper on Plagiarism Detection Techniques,” Int. Conf. ICT Healthc., pp. 30-34, 2015.

L. Lulu, B. Belkhouche, and S. Harous, “Overview of fingerprinting methods for local text reuse detection,” in Proceedings of the 2016 12th International Conference on Innovations in Information Technology, IIT 2016, 2017.

E. G. Hasan, A. Wicaksana, and S. Hansun, “The Implementation of Winnowing Algorithm for Plagiarism Detection in Moodle-based E-learning,” Proc. - 17th IEEE/ACIS Int. Conf. Comput. Inf. Sci. ICIS 2018, pp. 321-325, 2018.

S. Schleimer, D. S. Wilkerson, and A. Aiken, “Winnowing: Local Algorithms for Document Fingerprinting,” in ACM International Conference on Management of Data (SIGMOD), 2003.

N. Elbegbayan, “Winnowing, a Document Fingerprinting Algorithm,” Science (80-.). 2005.

N. Alamsyah, “Perbandingan Algoritma Winnowing Dengan Algoritma Rabin Karp Untuk Mendeteksi Plagiarisme Pada Kemiripan Teks Judul Skripsi,” Technologia, vol. 8, no. 3, pp. 124-134, 2017.

T. Mardiana, T. Bharata Adji, and I. Hidayah, “Stemming Influence on Similarity Detection of Abstract Written in Indonesia,” Telkomnika (Telecommunication Comput. Electron. Control. 2016.

Z. Ceska and C. Fox, “The Influence of Text Pre-processing on Plagiarism Detection,” Int. Conf. RANLP 2009, pp. 55-59, 2009.

H. T. Nugroho, “Pengaruh Algoritma Stemming Nazief-Adriani Terhadap Kinerja Algoritma Winnowing Untuk Mendeteksi Plagiarisme Bahasa Indonesia,” J. Ultim. Comput. vol. 9, no. 1, pp. 36-40, 2017.

J. Vassallo, “WASP (Write a Scientific Paper): Plagiarism and the ethics of dealing with colleagues,” Early Hum. Dev., vol. 124, pp. 65-67, 2018.

Kock and Davison, “Dealing with Plagiarism in the Information Systems Research Community: A Look at Factors That Drive Plagiarism and Ways to Address Them,” MIS Q., 2017.

D. Sharma, “Stemming Algorithms: A Comparative Study and their Analysis,” Int. J. Appl. Inf. Syst., 2013.

P. Willett, “The Porter stemming algorithm: Then and now,” Program, 2006.

R. Sugumar and M. R. Priya, “Improved Performance of Stemming Using Enhanced Porter,” Int. J. Eng. Sci. Res. Technol., vol. 7, no. 4, pp. 681-686, 2018.

J. Asian, H. E. Williams, and S. M. M. Tahaghoghi, “Stemming Indonesian: a confix-stripping approach,” Conf. Res. Pract. Inf. Technol. Ser., vol. 38, pp. 307-314, 2005.

V. Gurusamy and S. K. K. Nandhini, “Performance Analysis : Stemming Algorithm for the English Language,” IJSRD - Int. J. Sci. Res. Dev., vol. 5, no. 05, pp. 1933-1938, 2017.

J. Asian, “Effective Techniques for Indonesian Text Retrieval,” 2007.

A. T. Wibowo, K. W. Sudarmadi, and A. M. Barmawi, “Comparison between fingerprint and winnowing algorithm to detect plagiarism fraud on Bahasa Indonesia documents,” in 2013 International Conference of Information and Communication Technology, ICoICT 2013, 2013.

R. Sutoyo, I. Ramadhani, and A. D. Ardiatma, “Detecting Documents Plagiarism using Winnowing Algorithm and K-Gram Method,” Cybern. Comput. Intell. (CyberneticsCom), 2017 IEEE Int. Conf., pp. 67-72, 2017.

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).