Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Arabic Sentiment Analysis with Noisy Deep Explainable Model (2309.13731v2)

Published 24 Sep 2023 in cs.CL and cs.AI

Abstract: Sentiment Analysis (SA) is an indispensable task for many real-world applications. Compared to limited resourced languages (i.e., Arabic, Bengali), most of the research on SA are conducted for high resourced languages (i.e., English, Chinese). Moreover, the reasons behind any prediction of the Arabic sentiment analysis methods exploiting advanced AI-based approaches are like black-box - quite difficult to understand. This paper proposes an explainable sentiment classification framework for the Arabic language by introducing a noise layer on Bi-Directional Long Short-Term Memory (BiLSTM) and Convolutional Neural Networks (CNN)-BiLSTM models that overcome over-fitting problem. The proposed framework can explain specific predictions by training a local surrogate explainable model to understand why a particular sentiment (positive or negative) is being predicted. We carried out experiments on public benchmark Arabic SA datasets. The results concluded that adding noise layers improves the performance in sentiment analysis for the Arabic language by reducing overfitting and our method outperformed some known state-of-the-art methods. In addition, the introduced explainability with noise layer could make the model more transparent and accountable and hence help adopting AI-enabled system in practice.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. Sentimental analysis on social media comments with recurring models and pretrained word embeddings in portuguese. In Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval, pages 205–209, 2022.
  2. Bing Liu. Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, 5(1):1–167, 2012.
  3. A comprehensive survey of arabic sentiment analysis. Information processing & management, 56(2):320–342, 2019.
  4. A large scale arabic sentiment lexicon for arabic opinion mining. In Proceedings of the EMNLP 2014 workshop on arabic natural language processing (ANLP), pages 165–173, 2014.
  5. Deep learning models for sentiment analysis in arabic. In Proceedings of the second workshop on Arabic natural language processing, pages 9–17, 2015.
  6. Aroma: A recursive deep learning model for opinion mining in arabic as a low resource language. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 16(4):1–20, 2017.
  7. Arsentd-lev: A multi-topic corpus for target-based sentiment analysis in arabic levantine tweets. arXiv preprint arXiv:1906.01830, 2019.
  8. Arabic sentiment classification using convolutional neural network and differential evolution algorithm. Computational intelligence and neuroscience, 2019, 2019a.
  9. Mazajak: An online arabic sentiment analyser. In Proceedings of the Fourth Arabic Natural Language Processing Workshop, pages 192–198, 2019.
  10. Arabert: Transformer-based model for arabic language understanding. arXiv preprint arXiv:2003.00104, 2020.
  11. " why should i trust you?" explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pages 1135–1144, 2016.
  12. Sherin Mary Mathews. Explainable artificial intelligence applications in nlp, biomedical, and malware classification: a literature review. In Intelligent computing-proceedings of the computing conference, pages 1269–1292. Springer, 2019.
  13. Word embeddings for arabic sentiment analysis. In 2016 IEEE International Conference on Big Data (Big Data), pages 3820–3825. IEEE, 2016.
  14. Labr: A large scale arabic book reviews dataset. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 494–498, 2013.
  15. Building large arabic multi-domain resources for sentiment analysis. In International conference on intelligent text processing and computational linguistics, pages 23–34. Springer, 2015.
  16. Sentence-level and document-level sentiment mining for arabic texts. In 2010 IEEE international conference on data mining workshops, pages 1114–1119. IEEE, 2010.
  17. Subjectivity and sentiment analysis of modern standard arabic. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 587–591, 2011.
  18. A comparative study of effective approaches for arabic sentiment analysis. Information Processing & Management, 58(2):102438, 2021.
  19. Sentence-level arabic sentiment analysis. In 2012 international conference on collaboration technologies and systems (CTS), pages 546–550. IEEE, 2012.
  20. A hybrid approach for sentiment classification of egyptian dialect tweets. In 2015 First International Conference on Arabic Computational Linguistics (ACLing), pages 78–85. IEEE, 2015.
  21. Sentiment analysis in arabic tweets. In 2014 5th international conference on information and communication systems (ICICS), pages 1–6. IEEE, 2014.
  22. Machine learning-based model for sentiment and sarcasm detection. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, pages 386–389, 2021.
  23. Hybrid sentiment analyser for arabic tweets using r. In 2015 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K), volume 1, pages 417–424. IEEE, 2015.
  24. A review of sentiment analysis research in arabic language. Future Generation Computer Systems, 112:408–430, 2020.
  25. Sentiment analysis of arabic tweets using deep learning. Procedia Computer Science, 142:114–122, 2018.
  26. Proceedings of the sixth arabic natural language processing workshop. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021.
  27. Combining context-free and contextualized representations for arabic sarcasm detection and sentiment identification. arXiv preprint arXiv:2103.05683, 2021.
  28. Word representations in vector space and their applications for arabic. In International Conference on Intelligent Text Processing and Computational Linguistics, pages 430–443. Springer, 2015.
  29. Leveraging grammatical roles for measuring semantic similarity between texts. IEEE Access, 9:62972–62983, 2021.
  30. Md Shajalal and Masaki Aono. Sentence-level semantic textual similarity using word-level semantics.
  31. Query subtopic diversification based on cluster ranking and semantic features. In 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA), pages 1–6. IEEE, 2016.
  32. Md Shajalal and Masaki Aono. Coverage-based query subtopic diversification leveraging semantic relevance. Knowledge and Information Systems, 62:2873–2891, 2020.
  33. hulmona: The universal language model in arabic. In Proceedings of the fourth arabic natural language processing workshop, pages 68–77, 2019.
  34. Bert-cnn for offensive speech identification in social media. In Proceedings of the Fourteenth Workshop on Semantic Evaluation", Barcelona (online)", International Committee for Computational Linguistics, 2020.
  35. Anshul Wadhawan. Arabert and farasa segmentation based approach for sarcasm and sentiment detection in arabic tweets. arXiv preprint arXiv:2103.01679, 2021.
  36. Leveraging offensive language for sarcasm and sentiment detection in arabic. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, pages 364–369, 2021.
  37. A survey of the state of explainable ai for natural language processing. arXiv preprint arXiv:2010.00711, 2020.
  38. Adversarial noise layer: Regularize neural network by adding noise. In 2019 IEEE International Conference on Image Processing (ICIP), pages 909–913. IEEE, 2019.
  39. Adding gradient noise improves learning for very deep networks. arXiv preprint arXiv:1511.06807, 2015.
  40. Attention mechanism architecture for arabic sentiment analysis. ACM Transactions on Asian and Low-Resource Language Information Processing, 22(4):1–26, 2023.
  41. Sentiment analysis for arabic language using attention-based simple recurrent unit. In 2019 2nd International Conference on new Trends in Computing Sciences (ICTCS), pages 1–6. IEEE, 2019.
  42. Multi-channel embedding convolutional neural network model for arabic sentiment classification. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 18(4):1–23, 2019b.
  43. Hanane Elfaik et al. Deep bidirectional lstm network learning-based sentiment analysis for arabic text. Journal of Intelligent Systems, 30(1):395–412, 2021.

Summary

We haven't generated a summary for this paper yet.