The COVID-19 Tweets Classification Based on Recurrent Neural Network

Arif Dwi Laksito; Nuruddin Wiranda; Shofiyati Nur Karimah; Mardhiya Hayaty

doi:10.18517/ijaseit.14.1.18832

DOI : https://doi.org/10.18517/ijaseit.14.1.18832

The COVID-19 Tweets Classification Based on Recurrent Neural Network

Arif Dwi Laksito ⁽¹⁾, Nuruddin Wiranda ⁽²⁾, Shofiyati Nur Karimah ⁽³⁾, Mardhiya Hayaty ⁽⁴⁾

(1) Faculty of Computer Science, Universitas Amikom Yogyakarta, Yogyakarta, Indonesia

(2) Department of Computer Education, Lambung Mangkurat University, Banjarmasin, Indonesia

(3) Graduate School of Advanced Science, Japan Advanced Institute of Science and Technology (JAIST), Nomi, Ishikawa, Japan

(4) Faculty of Computer Science, Universitas Amikom Yogyakarta, Yogyakarta, Indonesia

Fulltext View | Download

How to cite (IJASEIT) :

[1]

A. D. Laksito, N. Wiranda, S. N. Karimah, and M. Hayaty, “The COVID-19 Tweets Classification Based on Recurrent Neural Network”, Int. J. Adv. Sci. Eng. Inf. Technol., vol. 14, no. 1, pp. 358–364, Feb. 2024.

Citation Format :

Due to its extensive use in both public and commercial contexts, sentiment analysis on Twitter has recently received much attention, particularly concerning tweets about COVID-19. Information about COVID-19 has been widely spread over social media, resulting in various views, opinions, and emotions about this pandemic, significantly impacting people's health. It is exceedingly challenging for the authorities to find these rumors on these public platforms manually. This paper proposes a framework for text classification using the RNN model and its updates, such as LSTM, BiLSTM, and GRU. This study aims to determine the best recurrent network model for handling cases of Twitter data classification. We utilized Twitter data relevant to COVID-19 and the lockdown with four classification classes (sad, joy, fear, and anger). In addition, this study aims to prove whether GloVe pre-trained word embedding can increase the accuracy of model predictions. The training and testing datasets were split into 80% and 20%, respectively. Therefore, in this experiment an early stopping technique was used with a limit of 15 epochs and a minimum delta of 0.01, meaning that training will be stopped if there is no improvement of 0.1% accuracy after 15 epochs. We used the f1-score average to measure the accuracy of the classification task results. The test results show that the BiLSTM model with GloVe word embedding yields the best f1-score compared to other models. Moreover, in all model testing, the f1-score value of the 'fear' class displays the highest accuracy compared to other classes.

H. Kaur, S. U. Ahsaan, B. Alankar, and V. Chang, “A Proposed Sentiment Analysis Deep Learning Algorithm for Analyzing COVID-19 Tweets,” Information Systems Frontiers, vol. 23, no. 6, pp. 1417–1429, 2021, doi: 10.1007/s10796-021-10135-7.

N. Chintalapudi, G. Battineni, and F. Amenta, “Sentimental analysis of COVID-19 tweets using deep learning models,” Infect Dis Rep, vol. 13, no. 2, pp. 329–339, 2021, doi: 10.3390/IDR13020032.

N. Yeasmin et al., “Analysis and Prediction of User Sentiment on COVID-19 Pandemic Using Tweets,” Big Data and Cognitive Computing, vol. 6, no. 2, 2022, doi: 10.3390/bdcc6020065.

C. Singh, T. Imam, S. Wibowo, and S. Grandhi, “A Deep Learning Approach for Sentiment Analysis of COVID-19 Reviews,” Applied Science, vol. 12, no. 8, 2022, doi: 10.3390/app12083709.

E. Alabdulkreem, “Prediction of depressed Arab women using their tweets,” J Decis Syst, vol. 30, no. 2–3, pp. 102–117, 2021, doi:10.1080/12460125.2020.1859745.

M. Edalati, A. S. Imran, Z. Kastrati, and S. M. Daudpota, “The Potential of Machine Learning Algorithms for Sentiment Classification of Students’ Feedback on MOOC,” Lecture Notes in Networks and Systems, vol. 296, pp. 11–22, 2022, doi: 10.1007/978-3-030-82199-9_2.

W. H. Bangyal et al., “Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches,” Comput Math Methods Med, vol. 2021, 2021, doi: 10.1155/2021/5514220.

D. S. Abdelminaam, F. H. Ismail, M. Taha, A. Taha, E. H. Houssein, and A. Nabil, “CoAID-DEEP: An Optimized Intelligent Framework for Automated Detecting COVID-19 Misleading Information on Twitter,” IEEE Access, vol. 9, no. December 2019, pp. 27840–27867, 2021, doi: 10.1109/access.2021.3058066.

A. S. Raamkumar, S. G. Tan, and H. L. Wee, “Use of health belief model–based deep learning classifiers for COVID-19 social media content to examine public perceptions of physical distancing: Model development and case study,” JMIR Public Health Surveill, vol. 6, no. 3, pp. 1–8, 2020, doi: 10.2196/20493.

P. Pathwar and S. Gill, “Tackling COVID-19 Infodemic Using Deep Learning,” Lecture Notes on Data Engineering and Communications Technologies, vol. 99, pp. 319–335, 2022, doi: 10.1007/978-981-16-7182-1_26.

S. H. Hamed, H. Elbakry, H. Elghareeb, and S. Elhishi, “Using XAI Techniques to Persuade Text Classifier Results: A Case Study of Covid-19 Tweets,” Indian J Sci Technol, vol. 15, no. 30, pp. 1484–1494, 2022, doi: 10.17485/ijst/v15i30.397.

M. N. Alenezi and Z. M. Alqenaei, “Machine learning in detecting covid-19 misinformation on twitter,” Future Internet, vol. 13, no. 10, pp. 1–20, 2021, doi: 10.3390/fi13100244.

R. Chandra and A. Krishna, “COVID-19 sentiment analysis via deep learning during the rise of novel cases,” PLoS One, vol. 16, no. 8 August, pp. 1–26, 2021, doi: 10.1371/journal.pone.0255615.

K. N. Alam et al., “Deep Learning-Based Sentiment Analysis of COVID-19 Vaccination Responses from Twitter Data,” Comput Math Methods Med, vol. 2021, 2021, doi: 10.1155/2021/4321131.

L. Miao, M. Last, and M. Litvak, “Tracking social media during the COVID-19 pandemic: The case study of lockdown in New York State,” Expert Syst Appl, vol. 187, p. 115797, 2022, doi:10.1016/j.eswa.2021.115797.

M. Al-Sarem, A. Alsaeedi, F. Saeed, W. Boulila, and O. Ameerbakhsh, “A novel hybrid deep learning model for detecting covid-19-related rumors on social media based on lstm and concatenated parallel cnns,” Applied Sciences (Switzerland), vol. 11, no. 17, 2021, doi:10.3390/APP11177940.

M. Arbane, R. Benlamri, Y. Brik, and A. D. Alahmar, “Social media-based COVID-19 sentiment classification model using Bi-LSTM,” Expert Syst Appl, vol. 212, no. November 2021, p. 118710, 2023, doi: 10.1016/j.eswa.2022.118710.

Q. G. To et al., “Applying machine learning to identify anti‐vaccination tweets during the covid‐19 pandemic,” Int J Environ Res Public Health, vol. 18, no. 8, 2021, doi: 10.3390/ijerph18084069.

M. Y. Kabir and S. Madria, “EMOCOV: Machine learning for emotion detection, analysis and visualization using COVID-19 tweets,” Online Soc Netw Media, vol. 23, no. September 2020, p. 100135, 2021, doi: 10.1016/j.osnem.2021.100135.

T. T. Mengistie and D. Kumar, “Deep Learning Based Sentiment Analysis on COVID-19 Public Reviews,” 3rd International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2021, no. April, pp. 444–449, 2021, doi: 10.1109/ICAIIC51459.2021.9415191.

H. Salehinejad, S. Sankar, J. Barfett, E. Colak, and S. Valaee, “Recent Advances in Recurrent Neural Networks,” Dec. 2017, doi:10.48550/arxiv.1801.01078.

S. Hochreiter and J. Schmidhuber, “Long Short-Term Memory,” Neural Comput, vol. 9, no. 8, pp. 1735–1780, 1997, doi: 10.1162/neco.1997.9.8.1735.

Y. Imrana, Y. Xiang, L. Ali, and Z. Abdul-Rauf, “A bidirectional LSTM deep learning approach for intrusion detection,” Expert Syst Appl, vol. 185, p. 115524, Dec. 2021, doi:10.1016/j.eswa.2021.115524.

K. Cho et al., “Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation,” EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp. 1724–1734, Jun. 2014, doi:10.48550/arxiv.1406.1078.

J. Pennington, R. Socher, and C. D. Manning, “GloVe: Global Vectors for Word Representation,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014, pp. 1532–1543. doi: 10.3115/v1/D14-1162.

K. S. Jones, “A statistical interpretation of term specificity and its application in retrieval,” Journal of Documentation, vol. 28, no. 1, pp. 11–21, 1972, doi: 10.1108/EB026526/full/pdf.

S. Robertson, “Understanding inverse document frequency: On theoretical arguments for IDF,” Journal of Documentation, vol. 60, no. 5, pp. 503–520, 2004, doi: 10.1108/00220410410560582/FULL/PDF.

Scott Deerwester, Susan T. Dumais, George W. Furnas, and Thomas K. Landauer, “Indexing by latent semantic analysis,” Journal of the American Society for Information Science , vol. 41, no. 6, pp. 391–407, 1990, doi: 10.1002/aris.1440380105.

S. Kumar, “Covid 19 Indian Sentiments on covid19 and lockdown,” Dataste of Twiiter sentiment of indians on covid 19. Accessed: January 07, 2023. [Online]. Available: https://www.kaggle.com/surajkum1198/twitterdata

A. Shewalkar, D. nyavanandi, and S. A. Ludwig, “Performance Evaluation of Deep neural networks Applied to Speech Recognition: Rnn, LSTM and GRU,” Journal of Artificial Intelligence and Soft Computing Research, vol. 9, no. 4, pp. 235–245, 2019, doi:10.2478/jaiscr-2019-0006.

R. Ni and H. Cao, “Sentiment Analysis based on GloVe and LSTM-GRU,” Chinese Control Conference, CCC, vol. 2020-July, pp. 7492–7497, Jul. 2020, doi: 10.23919/CCC50068.2020.9188578.

S. Kumar, “Covid 19 Indian Sentiments on covid19 and lockdown,” Dataste of Twiiter sentiment of indians on covid 19. Accessed: January 07, 2023. [Online]. Available: https://www.kaggle.com/surajkum1198/twitterdata

A. Shewalkar, D. nyavanandi, and S. A. Ludwig, “Performance Evaluation of Deep neural networks Applied to Speech Recognition: Rnn, LSTM and GRU,” Journal of Artificial Intelligence and Soft Computing Research, vol. 9, no. 4, pp. 235–245, 2019, doi:10.2478/jaiscr-2019-0006.

R. Ni and H. Cao, “Sentiment Analysis based on GloVe and LSTM-GRU,” Chinese Control Conference, CCC, vol. 2020-July, pp. 7492–7497, Jul. 2020, doi: 10.23919/CCC50068.2020.9188578.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution LicenseÂ that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (SeeÂ The Effect of Open Access).