Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Semantic Approach to Negation Detection and Word Disambiguation with Natural Language Processing (2302.02291v3)

Published 5 Feb 2023 in cs.CL, cs.AI, and cs.IR

Abstract: This study aims to demonstrate the methods for detecting negations in a sentence by uniquely evaluating the lexical structure of the text via word-sense disambiguation. The proposed framework examines all the unique features in the various expressions within a text to resolve the contextual usage of all tokens and decipher the effect of negation on sentiment analysis. The application of popular expression detectors skips this important step, thereby neglecting the root words caught in the web of negation and making text classification difficult for machine learning and sentiment analysis. This study adopts the NLP approach to discover and antonimize words that were negated for better accuracy in text classification using a knowledge base provided by an NLP library called WordHoard. Early results show that our initial analysis improved on traditional sentiment analysis, which sometimes neglects negations or assigns an inverse polarity score. The SentiWordNet analyzer was improved by 35%, the Vader analyzer by 20% and the TextBlob by 6%.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. AM Abirami and V Gayathri. 2017. A survey on sentiment analysis methods and approach. In 2016 Eighth International Conference on Advanced Computing (ICoAC). IEEE, 72–76.
  2. Shashank Agarwal and Hong Yu. 2010. Biomedical negation scope detection with conditional random fields. Journal of the American medical informatics association 17, 6 (2010), 696–701.
  3. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10).
  4. C Leroy Baker. 1969. Double negatives. Research on Language & Social Interaction 1, 1 (1969), 16–40.
  5. A simple algorithm for identifying negated findings and diseases in discharge summaries. Journal of biomedical informatics 34, 5 (2001), 301–310.
  6. W Timothy Coombs. 2015. The value of communication during a crisis: Insights from strategic communication research. Business horizons 58, 2 (2015), 141–148.
  7. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
  8. Allyson Ettinger. 2020. What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models. Transactions of the Association for Computational Linguistics 8 (2020), 34–48.
  9. Lexicon-enhanced LSTM with attention for general sentiment analysis. IEEE Access 6 (2018), 71884–71891.
  10. Allennlp: A deep semantic natural language processing platform. arXiv preprint arXiv:1803.07640 (2018).
  11. Laurence R. Horn and Heinrich Wansing. 2020. Negation. In The Stanford Encyclopedia of Philosophy (Spring 2020 ed.), Edward N. Zalta (Ed.). Metaphysics Research Lab, Stanford University.
  12. Clayton Hutto and Eric Gilbert. 2014. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the international AAAI conference on web and social media, Vol. 8. 216–225.
  13. Text–To–Speech Synthesis (TTS). International Journal of Research in Information Technology (IJRIT) 2, 5 (2014), 154–163.
  14. Perception Analysis: Pro- and Anti- Vaccine Classification with NLP and Machine Learning. Proceedings of the 55th Hawaii International Conference on System Sciences (2022), 2981–2990.
  15. Yiru Jiao and Qing-Xing Qu. 2019. A proposal for Kansei knowledge extraction method based on natural language processing technology and online product reviews. Computers in Industry 108 (2019), 1–11.
  16. SHEN Jiaxuan. 2010. Division of negatives and noun/verb division in English and Chinese [J]. Studies of the Chinese Language 5 (2010).
  17. johnbumgarner. 2021. wordhoard. https://github.com/johnbumgarner/wordhoard [Online; accessed 30. Nov. 2021].
  18. Improved lexicon-based sentiment analysis for social media analytics. Security Informatics 4, 1 (2015), 1–13.
  19. Aditya Khandelwal and Suraj Sawant. 2019. Negbert: A transfer learning approach for negation detection and scope resolution. arXiv preprint arXiv:1911.04211 (2019).
  20. Joki Kimberly. 2021. Double Negatives: 3 Rules You Must Know. https://www.grammarly.com/blog/3-things-you-must-know-about-double-negatives [Online; accessed 23. Jul. 2021].
  21. BERT Busters: Outlier Dimensions that Disrupt Transformers. arXiv preprint arXiv:2105.06990 (2021).
  22. Xu Liang. 2020. Treat Negation Stopwords Differently According to Your NLP Task. Medium (May 2020). https://towardsdatascience.com/treat-negation-stopwords-differently-according-to-your-nlp-task-e5a59ab7c91f
  23. Fake news detection using machine learning approaches: A systematic review. In 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI). IEEE, 230–234.
  24. Andreas C Müller and Sarah Guido. 2016. Introduction to machine learning with Python: a guide for data scientists. ” O’Reilly Media, Inc.”.
  25. Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS. Journal of the American Medical Informatics Association 8, 6 (2001), 598–609.
  26. Luis Alonso Ovalle and Elena Guerzoni. 2004. Double negatives, negative concord and metalinguistic negation. In CLS 38.1: The main session. Proceedings from the main session of the 38th meeting of the Chicago Linguistic Society. Citeseer, 15–31.
  27. Madhura Mandar Phadke and Satish R Devane. 2017. Multilingual Machine translation: An analytical study. In 2017 International Conference on Intelligent Computing and Control Systems (ICICCS). IEEE, 881–884.
  28. Ellen Quilty. 2019. University Library: AMA Referencing (Vancouver): Internet & Social Media. (2019).
  29. Vanesa del Río Zamora et al. 2015. Comparative Study of the Use of Double Negatives by Native English Speakers and Spanish Learners of English. (2015).
  30. Negation recognition in medical narrative reports. Information Retrieval 11, 6 (2008), 499–538.
  31. Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports. Automation in Construction 62 (2016), 45–56.
  32. Exploring transformers in natural language generation: Gpt, bert, and xlnet. arXiv preprint arXiv:2102.08036 (2021).
  33. Analysis of Evaluated Sentiments; a Pseudo-Linguistic Approach and Online Acceptability Index for Decision-Making with Data: Nigerian Election in View. Computing 7, 2 (2019), 39–44.
  34. Natural language processing to the rescue? extracting” situational awareness” tweets during mass emergency. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 5.
  35. Jonathan J Webster and Chunyu Kit. 1992. Tokenization as the initial phase in NLP. In COLING 1992 Volume 4: The 14th International Conference on Computational Linguistics.
  36. Dominic Widdows. 2003. Orthogonal negation in vector spaces for modelling word-meanings and document retrieval. In Proceedings of the 41st annual meeting of the association for computational linguistics. 136–143.
  37. Sam Wiseman and Karl Stratos. 2019. Label-agnostic sequence labeling by copying nearest neighbors. arXiv preprint arXiv:1906.04225 (2019).
  38. Refining word embeddings using intensity scores for sentiment analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing 26, 3 (2017), 671–681.
Citations (2)

Summary

We haven't generated a summary for this paper yet.