Exploring a Hybrid Deep Learning Framework to Automatically Discover Topic and Sentiment in COVID-19 Tweets (2312.01178v1)
Abstract: COVID-19 has created a major public health problem worldwide and other problems such as economic crisis, unemployment, mental distress, etc. The pandemic is deadly in the world and involves many people not only with infection but also with problems, stress, wonder, fear, resentment, and hatred. Twitter is a highly influential social media platform and a significant source of health-related information, news, opinion and public sentiment where information is shared by both citizens and government sources. Therefore an effective analysis of COVID-19 tweets is essential for policymakers to make wise decisions. However, it is challenging to identify interesting and useful content from major streams of text to understand people's feelings about the important topics of the COVID-19 tweets. In this paper, we propose a new \textit{framework} for analyzing topic-based sentiments by extracting key topics with significant labels and classifying positive, negative, or neutral tweets on each topic to quickly find common topics of public opinion and COVID-19-related attitudes. While building our model, we take into account hybridization of BiLSTM and GRU structures for sentiment analysis to achieve our goal. The experimental results show that our topic identification method extracts better topic labels and the sentiment analysis approach using our proposed hybrid deep learning model achieves the highest accuracy compared to traditional models.
- Blei, David M., Andrew Y. Ng, and Michael I. Jordan. “Latent dirichlet allocation.” Journal of machine Learning research 3, no. Jan (2003): 993-1022.
- Jahanbin, Kia, and Vahid Rahmanian. “Using twitter and web news mining to predict COVID-19 outbreak.” Asian Pacific journal of tropical medicine 13, no. 8 (2020): 378.
- Malta, Monica, Anne W. Rimoin, and Steffanie A. Strathdee. “The coronavirus 2019-nCoV epidemic: Is hindsight 20/20?.” EClinicalMedicine 20 (2020).
- Mehta, Pooja, and Sharnil Pandya. “A review on sentiment analysis methodologies, practices and applications.” International Journal of Scientific and Technology Research 9, no. 2 (2020): 601-609.
- El Rahman, Sahar A., Feddah Alhumaidi AlOtaibi, and Wejdan Abdullah AlShehri. “Sentiment analysis of twitter data.” In 2019 international conference on computer and information sciences (ICCIS), pp. 1-4. IEEE, 2019.
- O’brien, Michael, Kathleen Moore, and Fiona McNicholas. “Social media spread during Covid-19: the pros and cons of likes and shares.” Ir Med J 113, no. 4 (2020): 52.
- Prabha, M. Indhraom, and G. Umarani Srikanth. “Survey of sentiment analysis using deep learning techniques.” In 2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT), pp. 1-9. IEEE, 2019.
- Performing Sentiment Analysis Using Twitter Data!,https://www.analyticsvidhya.com/blog/2021/07/performing-sentiment-analysis-using-twitter-data/, Accessed: 2022-09-28.
- Ezen-Can, Aysu. “A Comparison of LSTM and BERT for Small Corpus.” arXiv preprint arXiv:2009.05451 (2020).
- Zhang, Lei, Shuai Wang, and Bing Liu. “Deep learning for sentiment analysis: A survey.” Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 8, no. 4 (2018): e1253.
- Guo, Lei, Chris J. Vargo, Zixuan Pan, Weicong Ding, and Prakash Ishwar. “Big social data analytics in journalism and mass communication: Comparing dictionary-based text analysis and unsupervised topic modeling.” Journalism & Mass Communication Quarterly 93, no. 2 (2016): 332-359.
- Wang, Bo, Maria Liakata, Arkaitz Zubiaga, and Rob Procter. “A hierarchical topic modelling approach for tweet clustering.” In International Conference on Social Informatics, pp. 378-390. Springer, Cham, 2017.
- Asmussen, Claus Boye, and Charles Møller. “Smart literature review: a practical topic modelling approach to exploratory literature review.” Journal of Big Data 6, no. 1 (2019): 1-18.
- Hourani, Asseel S. “Arabic Topic Labeling using Naïve Bayes (NB).” In 2021 12th International Conference on Information and Communication Systems (ICICS), pp. 478-479. IEEE, 2021.
- Hingmire, Swapnil, Sandeep Chougule, Girish K. Palshikar, and Sutanu Chakraborti. “Document classification by topic labeling.” In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, pp. 877-880. 2013.
- Abd-Alrazaq, Alaa, Dari Alhuwail, Mowafa Househ, Mounir Hamdi, and Zubair Shah. “Top concerns of tweeters during the COVID-19 pandemic: infoveillance study.” Journal of medical Internet research 22, no. 4 (2020): e19016.
- Sharfuddin, Abdullah Aziz, Md Nafis Tihami, and Md Saiful Islam. “A deep recurrent neural network with bilstm model for sentiment classification.” In 2018 International conference on Bangla speech and language processing (ICBSLP), pp. 1-4. IEEE, 2018.
- Zhang, Yuteng, Wenpeng Lu, Weihua Ou, Guoqiang Zhang, Xu Zhang, Jinyong Cheng, and Weiyu Zhang. “Chinese medical question answer selection via hybrid models based on CNN and GRU.” Multimedia tools and applications 79, no. 21 (2020): 14751-14776.
- Mikolov, Tomas, Kai Chen, Greg Corrado, and Jeffrey Dean. “Efficient estimation of word representations in vector space.” arXiv preprint arXiv:1301.3781 (2013).
- Sarker, Iqbal H., A. S. M. Kayes, and Paul Watters. “Effectiveness analysis of machine learning classification models for predicting personalized context-aware smartphone usage.” Journal of Big Data 6, no. 1 (2019): 1-28.
- Habibabadi, Sedigheh Khademi, and Pari Delir Haghighi. “Topic modelling for identification of vaccine reactions in twitter.” In Proceedings of the Australasian Computer Science Week Multiconference, pp. 1-10. 2019.
- Jang, Beakcheol, Inhwan Kim, and Jong Wook Kim. “Word2vec convolutional neural networks for classification of news articles and tweets.” PloS one 14, no. 8 (2019): e0220976.
- Rustam, Furqan, Arif Mehmood, Muhammad Ahmad, Saleem Ullah, Dost Muhammad Khan, and Gyu Sang Choi. “Classification of shopify app user reviews using novel multi text features.” IEEE Access 8 (2020): 30234-30244.
- Kamila, Sabyasachi, Mohammad Hasanuzzaman, Asif Ekbal, and Pushpak Bhattacharyya. “Resolution of grammatical tense into actual time, and its application in Time Perspective study in the tweet space.” PloS one 14, no. 2 (2019): e0211872.
- Li, Jianjun, Yu Han, Ming Zhang, Gang Li, and Baohua Zhang. “Multi-scale residual network model combined with Global Average Pooling for action recognition.” Multimedia Tools and Applications 81, no. 1 (2022): 1375-1393.
- Sarker, Iqbal H. “Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions.” SN Computer Science 2, no. 6 (2021): 1-20.
- Siami-Namini, Sima, Neda Tavakoli, and Akbar Siami Namin. “The performance of LSTM and BiLSTM in forecasting time series.” In 2019 IEEE International Conference on Big Data (Big Data), pp. 3285-3292. IEEE, 2019.
- Hochreiter, Sepp, and Jürgen Schmidhuber. “Long short-term memory.” Neural computation 9, no. 8 (1997): 1735-1780.
- Sidorov, Grigori, Alexander Gelbukh, Helena Gómez-Adorno, and David Pinto. “Soft similarity and soft cosine measure: Similarity of features in vector space model.” Computación y Sistemas 18, no. 3 (2014): 491-504.
- Boussaadi, Smail, Hassina Aliane, and Pr Ouahabi Abdeldjalil. “The researchers profile with topic modeling.” In 2020 IEEE 2nd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), pp. 1-6. IEEE, 2020.
- Sarker, Iqbal H. “Machine learning: Algorithms, real-world applications and research directions.” SN Computer Science 2, no. 3 (2021): 1-21.
- Khurana, Himja, and Sanjib Kumar Sahu. “Bat inspired sentiment analysis of Twitter data.” In Progress in Advanced Computing and Intelligent Engineering, pp. 639-650. Springer, Singapore, 2018.
- Ordun, Catherine, Sanjay Purushotham, and Edward Raff. “Exploratory analysis of covid-19 tweets using topic modeling, umap, and digraphs.” arXiv preprint arXiv:2005.03082 (2020).
- Chandrasekaran, Ranganathan, Vikalp Mehta, Tejali Valkunde, and Evangelos Moustakas. “Topics, trends, and sentiments of tweets about the COVID-19 pandemic: Temporal infoveillance study.” Journal of medical Internet research 22, no. 10 (2020): e22624.
- Pennington, Jeffrey, Richard Socher, and Christopher D. Manning. “Glove: Global vectors for word representation.” In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1532-1543. 2014.
- Bojanowski, Piotr, Edouard Grave, Armand Joulin, and Tomas Mikolov. “Enriching word vectors with subword information.” Transactions of the association for computational linguistics 5 (2017): 135-146.
- Naseem, Usman, Imran Razzak, Matloob Khushi, Peter W. Eklund, and Jinman Kim. “COVIDSenti: A large-scale benchmark Twitter data set for COVID-19 sentiment analysis.” IEEE Transactions on Computational Social Systems 8, no. 4 (2021): 1003-1015.
- Satu, Md Shahriare, Md Imran Khan, Mufti Mahmud, Shahadat Uddin, Matthew A. Summers, Julian MW Quinn, and Mohammad Ali Moni. “TClustVID: A novel machine learning classification model to investigate topics and sentiment in COVID-19 tweets.” Knowledge-Based Systems 226 (2021): 107126.
- Xiang, Xiaoling, Xuan Lu, Alex Halavanau, Jia Xue, Yihang Sun, Patrick Ho Lam Lai, and Zhenke Wu. “Modern senicide in the face of a pandemic: An examination of public discourse and sentiment about older adults and COVID-19 using machine learning.” The Journals of Gerontology: Series B 76, no. 4 (2021): e190-e200.
- Medford, Richard J., Sameh N. Saleh, Andrew Sumarsono, Trish M. Perl, and Christoph U. Lehmann. “An “infodemic”: leveraging high-volume Twitter data to understand early public sentiment for the coronavirus disease 2019 outbreak.” In Open forum infectious diseases, vol. 7, no. 7, p. ofaa258. US: Oxford University Press, 2020.
- Rustam, Furqan, Madiha Khalid, Waqar Aslam, Vaibhav Rupapara, Arif Mehmood, and Gyu Sang Choi. “A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis.” Plos one 16, no. 2 (2021): e0245909.
- Xue, Jia, Junxiang Chen, Chen Chen, Chengda Zheng, Sijia Li, and Tingshao Zhu. “Public discourse and sentiment during the COVID 19 pandemic: Using Latent Dirichlet Allocation for topic modeling on Twitter.” PloS one 15, no. 9 (2020): e0239441.
- Ahmed, Md Shoaib, Tanjim Taharat Aurpa, and Md Musfique Anwar. “Detecting sentiment dynamics and clusters of Twitter users for trending topics in COVID-19 pandemic.” Plos one 16, no. 8 (2021): e0253300.
- Stappen, Lukas, Alice Baird, Erik Cambria, and Björn W. Schuller. “Sentiment analysis and topic recognition in video transcriptions.” IEEE Intelligent Systems 36, no. 2 (2021): 88-95.
- Jelodar, Hamed, Yongli Wang, Rita Orji, and Shucheng Huang. “Deep sentiment classification and topic discovery on novel coronavirus or COVID-19 online discussions: NLP using LSTM recurrent neural network approach.” IEEE Journal of Biomedical and Health Informatics 24, no. 10 (2020): 2733-2742.
- Barkur, Gopalkrishna, and Giridhar B. Kamath. “Sentiment analysis of nationwide lockdown due to COVID 19 outbreak: Evidence from India.” Asian journal of psychiatry 51 (2020): 102089.
- Imran, Ali Shariq, Sher Muhammad Daudpota, Zenun Kastrati, and Rakhi Batra. “Cross-cultural polarity and emotion detection using sentiment analysis and deep learning on COVID-19 related tweets.” Ieee Access 8 (2020): 181074-181090.
- Nemes, László, and Attila Kiss. “Social media sentiment analysis based on COVID-19.” Journal of Information and Telecommunication 5, no. 1 (2021): 1-15.
- Jiang, Qingnan, Lei Chen, Wei Zhao, and Min Yang. “Toward aspect-level sentiment modification without parallel data.” IEEE Intelligent Systems 36, no. 1 (2021): 75-81.
- Kaur, Simranpreet, Pallavi Kaul, and Pooya Moradian Zadeh. “Monitoring the dynamics of emotions during COVID-19 using Twitter data.” Procedia Computer Science 177 (2020): 423-430.
- Poria, Soujanya, Navonil Majumder, Devamanyu Hazarika, Erik Cambria, Alexander Gelbukh, and Amir Hussain. “Multimodal sentiment analysis: Addressing key issues and setting up the baselines.” IEEE Intelligent Systems 33, no. 6 (2018): 17-25.
- Behera, Ranjan Kumar, Monalisa Jena, Santanu Kumar Rath, and Sanjay Misra. “Co-LSTM: Convolutional LSTM model for sentiment analysis in social big data.” Information Processing & Management 58, no. 1 (2021): 102435.
- Zeng, Daojian, Yuan Dai, Feng Li, R. Simon Sherratt, and Jin Wang. “Adversarial learning for distant supervised relation extraction.” Computers, Materials & Continua 55, no. 1 (2018): 121-136.
- Wang, Wenya, Sinno Jialin Pan, Daniel Dahlmeier, and Xiaokui Xiao. “Coupled multi-layer attentions for co-extraction of aspect and opinion terms.” In Proceedings of the AAAI conference on artificial intelligence, vol. 31, no. 1. 2017.
- Sarker, Iqbal H. “Data science and analytics: an overview from data-driven smart computing, decision-making and applications perspective.” SN Computer Science 2, no. 5 (2021): 1-22.
- Sohrabi, Catrin, Zaid Alsafi, Niamh O’neill, Mehdi Khan, Ahmed Kerwan, Ahmed Al-Jabir, Christos Iosifidis, and Riaz Agha. “World Health Organization declares global emergency: A review of the 2019 novel coronavirus (COVID-19).” International journal of surgery 76 (2020): 71-76.
- Khandaker Tayef Shahriar (2 papers)
- Iqbal H. Sarker (36 papers)