Targeted aspect-based emotion analysis to detect opportunities and precaution in financial Twitter messages (2404.08665v1)
Abstract: Microblogging platforms, of which Twitter is a representative example, are valuable information sources for market screening and financial models. In them, users voluntarily provide relevant information, including educated knowledge on investments, reacting to the state of the stock markets in real-time and, often, influencing this state. We are interested in the user forecasts in financial, social media messages expressing opportunities and precautions about assets. We propose a novel Targeted Aspect-Based Emotion Analysis (TABEA) system that can individually discern the financial emotions (positive and negative forecasts) on the different stock market assets in the same tweet (instead of making an overall guess about that whole tweet). It is based on NLP techniques and Machine Learning streaming algorithms. The system comprises a constituency parsing module for parsing the tweets and splitting them into simpler declarative clauses; an offline data processing module to engineer textual, numerical and categorical features and analyse and select them based on their relevance; and a stream classification module to continuously process tweets on-the-fly. Experimental results on a labelled data set endorse our solution. It achieves over 90% precision for the target emotions, financial opportunity, and precaution on Twitter. To the best of our knowledge, no prior work in the literature has addressed this problem despite its practical interest in decision-making, and we are not aware of any previous NLP nor online Machine Learning approaches to TABEA.
- Emotional trading in the cryptocurrency market. Finance Research Letters, 42, 101912. doi:10.1016/j.frl.2020.101912.
- Predictors of investment intention in Indian stock markets. International Journal of Bank Marketing, 37, 97–119. doi:10.1108/IJBM-08-2017-0167.
- How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble. IEEE Computational Intelligence Magazine, 15, 64–75. doi:10.1109/MCI.2019.2954667.
- Enhancing Aspect-Based Sentiment Analysis of Arabic hotels’ reviews using morphological, syntactic and semantic features. Information Processing & Management, 56, 308–319. doi:10.1016/j.ipm.2018.01.006.
- Sentiment classification and aspect-based sentiment analysis on yelp reviews using deep learning and word embeddings. Journal of Decision Systems, 30, 259–281. doi:10.1080/12460125.2020.1864106.
- Feature engineering for sentiment analysis in e-health forums. PLOS ONE, 13, e0207996. doi:10.1371/journal.pone.0207996.
- Aspect-Based Sentiment Analysis With Heterogeneous Graph Neural Network. IEEE Transactions on Computational Social Systems, (pp. 1–10). doi:10.1109/TCSS.2022.3148866.
- Comparing pre-trained language models for Spanish hate speech detection. Expert Systems with Applications, 166, 114120. doi:10.1016/j.eswa.2020.114120.
- Classification of sentence level sentiment analysis using cloud machine learning techniques. Cluster Computing, 22, 1199–1209. doi:10.1007/s10586-017-1200-1.
- Handling Concept Drift for Predictions in Business Process Mining. In Procceedings of the IEEE Conference on Business Informatics (pp. 76–83). IEEE volume 1. doi:10.1109/CBI49978.2020.00016.
- Opinion mining and emotion recognition applied to learning environments. Expert Systems with Applications, 150, 113265. doi:10.1016/j.eswa.2020.113265.
- Estimating large losses in insurance analytics and operational risk using the g-and-h distribution. Quantitative Finance, 21, 1207–1221. doi:10.1080/14697688.2020.1849778.
- Enhanced Hoeffding Anytime Tree: A Real-time Algorithm for Early Prediction of Heart Disease. International Journal on Artificial Intelligence Tools, 30, 2150010. doi:10.1142/S021821302150010X.
- Berrar, D. (2019). Bayes’ Theorem and Naive Bayes Classifier. In Encyclopedia of Bioinformatics and Computational Biology (pp. 403–412). Elsevier volume 1-3. doi:10.1016/B978-0-12-809633-8.20473-1.
- Senticnet 7: a commonsense-based neurosymbolic AI framework for explainable sentiment analysis. In Proceedings of The International Conference on Language Resources and Evaluation. European Language Resources Association.
- Forecasting stock market crisis events using Deep and statistical Machine Learning techniques. Expert Systems with Applications, 112, 353–371. doi:10.1016/j.eswa.2018.06.032.
- Tracking and recognizing emotions in short text messages from online chatting services. Information Processing & Management, 54, 1325–1344. doi:10.1016/j.ipm.2018.05.008.
- Using Deep Learning to Develop a Stock Price Prediction Model Based on Individual Investor Emotions. Journal of Behavioral Finance, 22, 480–489. doi:10.1080/15427560.2020.1821686.
- Dissecting emotion and user influence in social media communities: An interaction modeling approach. Information & Management, 57, 103108. doi:10.1016/j.im.2018.09.008.
- Recommendation system exploiting aspect-based opinion mining with deep learning method. Information Sciences, 512, 1279–1292. doi:10.1016/j.ins.2019.10.038.
- Detection of Financial Opportunities in Micro-Blogging Data With a Stacked Classification System. IEEE Access, 8, 215679–215690. doi:10.1109/ACCESS.2020.3041084.
- Analyzing DistilBERT for Sentiment Classification of Banking Financial News. Lecture Notes in Networks and Systems, 248, 501–510. doi:10.1007/978-981-16-3153-5_53.
- FineNews: fine-grained semantic sentiment analysis on financial microblogs and news. International Journal of Machine Learning and Cybernetics, 10, 2199–2207. doi:10.1007/s13042-018-0805-x.
- How emotions influence behavior in financial markets: a conceptual analysis and emotion-based account of buy-sell preferences. The European Journal of Finance, 26, 1417–1438. doi:10.1080/1351847X.2020.1742758.
- Hotel Arabic-Reviews Dataset Construction for Sentiment Analysis Applications. In Studies in Computational Intelligence (pp. 35–52). Springer. doi:10.1007/978-3-319-67056-0_3.
- Creating Emoji Lexica from Unsupervised Sentiment Analysis of their Descriptions. Expert Systems with Applications, 103, 74–91. doi:10.1016/j.eswa.2018.02.043.
- Gao, B. (2021). The Use of Machine Learning Combined with Data Mining Technology in Financial Risk Prevention. Computational Economics, 1, 1–21. doi:10.1007/s10614-021-10101-0.
- Automatic Natural Language Generation Applied to Alternative and Augmentative Communication for Online Video Content Services using SimpleNLG for Spanish. In Proceedings of the Internet of Accessible Things (pp. 1–4). ACM. doi:10.1145/3192714.3192837.
- A library for automatic Natural Language Generation of Spanish texts. Expert Systems with Applications, 120, 372–386. doi:10.1016/j.eswa.2018.11.036.
- Semantics of the Black-Box: Can Knowledge Graphs Help Make Deep Learning Systems More Interpretable and Explainable? IEEE Internet Computing, 25, 51–59. doi:10.1109/MIC.2020.3031769.
- Beyond negative and positive: Exploring the effects of emotions in social media during the stock market crash. Information Processing & Management, 57, 102218. doi:10.1016/j.ipm.2020.102218.
- Guo, J. (2022). Deep learning approach to text analysis for human emotion detection from big data. Journal of Intelligent Systems, 31, 113–126. doi:10.1515/jisys-2022-0001.
- A survey on classification techniques for opinion mining and sentiment analysis. Artificial Intelligence Review, 52, 1495–1545. doi:10.1007/s10462-017-9599-6.
- What predicts student satisfaction with MOOCs: A gradient boosting trees supervised machine learning and sentiment analysis approach. Computers & Education, 145, 103724. doi:10.1016/j.compedu.2019.103724.
- Leveraging Social Media to Predict Continuation and Reversal in Asset Prices. Computational Economics, 57, 433–453. doi:10.1007/s10614-019-09932-9.
- word2set: WordNet-Based Word Representation Rivaling Neural Word Embedding for Lexical Similarity and Sentiment Analysis. IEEE Computational Intelligence Magazine, 14, 41–53. doi:10.1109/MCI.2019.2901085.
- Opinion mining using ensemble text hidden Markov models for text classification. Expert Systems with Applications, 94, 218–227. doi:10.1016/j.eswa.2017.07.019.
- Kaur, C. (2020). Sentiment Analysis of Tweets on Social Issues using Machine Learning Approach. International Journal of Advanced Trends in Computer Science and Engineering, 9, 6303–6311. doi:10.30534/ijatcse/2020/310942020.
- Stock market prediction using machine learning classifiers and social media news. Journal of Ambient Intelligence and Humanized Computing, 1, 1–24. doi:10.1007/s12652-020-01839-w.
- Toward assessing clinical trial publications for reporting transparency. Journal of Biomedical Informatics, 116, 103717–103727. doi:10.1016/j.jbi.2021.103717.
- Predicting Ethereum prices with machine learning based on blockchain information. Expert Systems with Applications, 184, 115480. doi:10.1016/j.eswa.2021.115480.
- Online Anomaly Detection Using Statistical Leverage for Streaming Business Process Events. Lecture Notes in Business Information Processing, 406, 193–205. doi:10.1007/978-3-030-72693-5_15.
- Keeping our rivers clean: Information-theoretic online anomaly detection for streaming business process events. Information Systems, 104, 101894. doi:10.1016/j.is.2021.101894.
- Krippendorff, K. (2018). Content Analysis: An Introduction to its Methodology. SAGE Publications. doi:10.1177/1094428108324513.
- Predicting Business Risks of Commercial Banks Based on BP-GA Optimized Model. Computational Economics, 1, 1–19. doi:10.1007/s10614-020-10088-0.
- More than just Noise? Examining the Information Content of Stock Microblogs on Financial Markets. Journal of Information Technology, 33, 50–69. doi:10.1057/s41265-016-0034-2.
- DWWP: Domain-specific new words detection and word propagation system for sentiment analysis in the tourism domain. Knowledge-Based Systems, 146, 203–214. doi:10.1016/j.knosys.2018.02.004.
- Stock index prediction based on wavelet transform and FCD‐MLGRU. Journal of Forecasting, 39, 1229–1237. doi:10.1002/for.2682.
- Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks. Knowledge-Based Systems, 235, 107643. doi:10.1016/j.knosys.2021.107643.
- An improved aspect-category sentiment analysis model for text sentiment analysis based on RoBERTa. Applied Intelligence, 51, 3522–3533. doi:10.1007/s10489-020-01964-1.
- GSMNet: Global Semantic Memory Network for Aspect-Level Sentiment Classification. IEEE Intelligent Systems, 36, 122–130. doi:10.1109/MIS.2020.3042253.
- The concept of neutrality: a new approach. Journal of Documentation, 76, 333–353. doi:10.1108/JD-05-2019-0102.
- A Hybrid Multilingual Fuzzy-Based Approach to the Sentiment Analysis Problem Using SentiWordNet. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 28, 361–390. doi:10.1142/S0218488520500154.
- Madhu, S. (2018). An approach to analyze suicidal tendency in blogs and tweets using Sentiment Analysis. International Journal of Scientific Research in Computer Science and Engineering, 6, 34–36. doi:10.26438/ijsrcse/v6i4.3436.
- How Does Social Media Impact Bitcoin Value? A Test of the Silent Majority Hypothesis. Journal of Management Information Systems, 35, 19–52. doi:10.1080/07421222.2018.1440774.
- Native language identification with classifier stacking and ensembles. Computational Linguistics, 44, 403–446. doi:10.1162/coli_a_00323.
- Emotion Analysis and Dialogue Breakdown Detection in Dialogue of Chat Systems Based on Deep Neural Networks. Electronics, 11, 695. doi:10.3390/electronics11050695.
- Mehlig, B. (2021). Stochastic Gradient Descent. In Machine Learning with Neural Networks (pp. 96–113). Cambridge University Press volume 1. doi:10.1017/9781108860604.006.
- Spam comments prediction using stacking with ensemble learning. Journal of Physics, 933, 012012. doi:10.1088/1742-6596/933/1/012012.
- Analysis of concept drift in fake reviews detection. Expert Systems with Applications, 169, 114318. doi:10.1016/j.eswa.2020.114318.
- Scikit-multiflow: A multi-output streaming framework. Journal of Machine Learning Research, 19, 1–5. doi:10.5555/3291125.3309634.
- Aspect-based sentiment analysis using adaptive aspect-based lexicons. Expert Systems with Applications, 148, 113234. doi:10.1016/j.eswa.2020.113234.
- A Lexical and Machine Learning-Based Hybrid System for Sentiment Analysis. In Studies in Computational Intelligence (pp. 165–175). Springer. doi:10.1007/978-981-10-4555-4_11.
- A Comparative Analysis of Machine Learning Techniques for Cyberbullying Detection on Twitter. Future Internet, 12, 187. doi:10.3390/fi12110187.
- Classifying streaming of Twitter data based on sentiment analysis using hybridization. Neural Computing and Applications, 31, 1425–1433. doi:10.1007/s00521-018-3476-3.
- A review on sentiment analysis and emotion detection from text. Social Network Analysis and Mining, 11, 81. doi:10.1007/s13278-021-00776-6.
- Word2Vec for Indonesian Sentiment Analysis towards Hotel Reviews: An Evaluation Study. Procedia Computer Science, 157, 360–366. doi:10.1016/j.procs.2019.08.178.
- An analytical approach for big social data analysis for customer decision-making in eco-friendly hotels. Expert Systems with Applications, 186, 115722. doi:10.1016/j.eswa.2021.115722.
- Aspect Based Sentiment Analysis for Restaurant Reviews Using Hybrid ELMoWikipedia and Hybrid Expanded Opinion Lexicon-SentiCircle. International Journal of Intelligent Engineering and Systems, 12, 47–58. doi:10.22266/ijies2019.1231.05.
- Ortiz-Martínez, D. (2016). Online learning for statistical machine translation. Computational Linguistics, 42, 121–161. doi:10.1162/COLI_a_00244.
- A new topic modeling based approach for aspect extraction in aspect based sentiment analysis: SS-LDA. Expert Systems with Applications, 168, 114231. doi:10.1016/j.eswa.2020.114231.
- Predicting Vehicle Sales by Sentiment Analysis of Twitter Data and Stock Market Values. IEEE Access, 6, 57655–57662. doi:10.1109/ACCESS.2018.2873730.
- A Review on Random Forest: An Ensemble Classifier. Lecture Notes on Data Engineering and Communications Technologies, 26, 758–763. doi:10.1007/978-3-030-03146-6_86.
- Cooperative gating network based on a single BERT encoder for aspect term sentiment analysis. Applied Intelligence, 52, 5867–5879. doi:10.1007/s10489-021-02724-5.
- Convolutional attention neural network over graph structures for improving the performance of aspect-level sentiment analysis. Information Sciences, 589, 416–439. doi:10.1016/j.ins.2021.12.127.
- Plutchik, R. (2004). The circumplex as a general model of the structure of emotions and personality. American Psychological Association. doi:10.1037/10261-001.
- Towards Emotion-aware Recommender Systems: an Affective Coherence Model based on Emotion-driven Behaviors. Expert Systems with Applications, 170, 114382. doi:10.1016/j.eswa.2020.114382.
- Comparative Evaluation of the Supervised Machine Learning Classification Methods and the Concept Drift Detection Methods in the Financial Business Problems. Lecture Notes in Business Information Processing, 417, 268–292. doi:10.1007/978-3-030-75418-1_13.
- Text Mining: Use of TF-IDF to Examine the Relevance of Words to Documents. International Journal of Computer Applications, 181, 25–29. doi:10.5120/ijca2018917395.
- Assessing the efficacy of a manual‐based intervention for improving the detection of facial pain expression. European Journal of Pain, 23, 1006–1019. doi:10.1002/ejp.1369.
- Investment Decisions Based on EEG Emotion Recognition. Advanced Science Letters, 23, 11345–11349. doi:10.1166/asl.2017.10280.
- A CNN-BiLSTM Model for Document-Level Sentiment Analysis. Machine Learning and Knowledge Extraction, 1, 832–847. doi:10.3390/make1030048.
- COVID-19 adopted hybrid and parallel deep information fusion framework for stock price movement prediction. Expert Systems with Applications, 187, 115879. doi:10.1016/j.eswa.2021.115879.
- A model for sentiment and emotion analysis of unstructured social media text. Electronic Commerce Research, 18, 181–199. doi:10.1007/s10660-017-9257-8.
- Online Hate Ratings Vary by Extremes. In Proceedings of the Conference on Human Information Interaction and Retrieval (pp. 213–217). Association for Computational Linguistics. doi:10.1145/3295750.3298954.
- Development and accuracy of an artificial intelligence algorithm for acne grading from smartphone photographs. Experimental Dermatology, 28, 1252–1257. doi:10.1111/exd.14022.
- Drift-based approach for evolving data stream classification in intrusion detection system. In Proceedings of the Workshop on Computer Networks & Communications (pp. 23–30). CEUR volume 2889.
- Novel dual-channel long short-term memory compressed capsule networks for emotion recognition. Expert Systems with Applications, 188, 116080. doi:10.1016/j.eswa.2021.116080.
- Targeting Prospective Customers: Robustness of Machine-Learning Methods to Typical Data Challenges. Management Science, 66, 1–43. doi:10.1287/mnsc.2019.3308.
- Big Data: Deep Learning for financial sentiment analysis. Journal of Big Data, 5, 3. doi:10.1186/s40537-017-0111-6.
- Taffler, R. (2018). Emotional finance: investment and the unconscious. The European Journal of Finance, 24, 630–653. doi:10.1080/1351847X.2017.1369445.
- An AI-Enabled Stock Prediction Platform Combining News and Social Sensing with Financial Statements. Future Internet, 13, 138. doi:10.3390/fi13060138.
- Decision tree classifiers for evidential attribute values and class labels. Fuzzy Sets and Systems, 366, 46–62. doi:10.1016/j.fss.2018.11.006.
- Pachinko Prediction: A Bayesian method for event prediction from social media data. Information Processing & Management, 57, 102147. doi:10.1016/j.ipm.2019.102147. arXiv:1809.08427.
- Continuous learning from human post-edits for neural machine translation. The Prague Bulletin of Mathematical Linguistics, 108, 233--244. doi:10.1515/pralin-2017-0023.
- Consensus vote models for detecting and filtering neutrality in sentiment analysis. Information Fusion, 44, 126--135. doi:10.1016/j.inffus.2018.03.007.
- Does Twitter Affect Stock Market Decisions? Financial Sentiment Analysis During Pandemics: A Comparative Study of the H1N1 and the COVID-19 Periods. Cognitive Computation, 14, 372--387. doi:10.1007/S12559-021-09819-8/TABLES/17.
- Harshness-aware sentiment mining framework for product review. Expert Systems with Applications, 187, 115887. doi:10.1016/j.eswa.2021.115887.
- Wang, Y. (2017). Stock Market Forecasting with Financial Micro-blog Based on Sentiment and Time Series Analysis. Journal of Shanghai Jiaotong University, 22, 173--179. doi:10.1007/s12204-017-1818-4.
- Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection. Future Internet, 11, 155. doi:10.3390/fi11070155.
- Multi-Level Fine-Scaled Sentiment Sensing with Ambivalence Handling. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 28, 683--697. doi:10.1142/S0218488520500294.
- Aspect-Based Extraction and Analysis of Affective Knowledge from Social Media Streams. IEEE Intelligent Systems, 32, 80--88. doi:10.1109/MIS.2017.57.
- Exploring fine-grained syntactic information for aspect-based sentiment classification with dual graph neural networks. Neurocomputing, 471, 48--59. doi:10.1016/j.neucom.2021.10.091.
- Mining Emotions of the Public from Social Media for Enhancing Corporate Credit Rating. In Proceedings of the IEEE International Conference on e-Business Engineering (pp. 25--30). IEEE. doi:10.1109/ICEBE.2018.00015.
- Topic sentiment mining for sales performance prediction in e-commerce. Annals of Operations Research, 270, 553--576. doi:10.1007/s10479-017-2421-7.
- A survey of sentiment analysis in social media. Knowledge and Information Systems, 60, 617--663. doi:10.1007/s10115-018-1236-4.
- Aggregated graph convolutional networks for aspect-based sentiment classification. Information Sciences, 600, 73--93. doi:10.1016/j.ins.2022.03.082.
- Tales of emotion and stock in China: volatility, causality and prediction. World Wide Web, 21, 1093--1116. doi:10.1007/s11280-017-0495-4.