Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model (2403.02647v1)

Published 5 Mar 2024 in cs.CL and cs.AI

Abstract: The task of stock earnings forecasting has received considerable attention due to the demand investors in real-world scenarios. However, compared with financial institutions, it is not easy for ordinary investors to mine factors and analyze news. On the other hand, although LLMs in the financial field can serve users in the form of dialogue robots, it still requires users to have financial knowledge to ask reasonable questions. To serve the user experience, we aim to build an automatic system, FinReport, for ordinary investors to collect information, analyze it, and generate reports after summarizing. Specifically, our FinReport is based on financial news announcements and a multi-factor model to ensure the professionalism of the report. The FinReport consists of three modules: news factorization module, return forecasting module, risk assessment module. The news factorization module involves understanding news information and combining it with stock factors, the return forecasting module aim to analysis the impact of news on market sentiment, and the risk assessment module is adopted to control investment risk. Extensive experiments on real-world datasets have well verified the effectiveness and explainability of our proposed FinReport. Our codes and datasets are available at https://github.com/frinkleko/FinReport.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Semantic dependency graph parsing using tree approximations. In Proceedings of the 11th International Conference on Computational Semantics. 217–227.
  2. Mariana S. C. Almeida and André F. T. Martins. 2015. Lisbon: Evaluating TurboSemanticParser on Multiple Languages and Out-of-Domain Data. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). Association for Computational Linguistics. https://doi.org/10.18653/v1/s15-2162
  3. Financial News Quantization and Stock Market Forecast Research Based on CNN and LSTM. In Lecture Notes in Computer Science. Springer International Publishing, 366–375. https://doi.org/10.1007/978-3-030-05755-8_36
  4. Kinjal Chaudhari and Ankit Thakkar. 2023. Data fusion with factored quantization for stock trend prediction using neural networks. Information Processing & Management 60, 3 (2023), 103293. https://doi.org/10.1016/j.ipm.2023.103293
  5. A. Colin Cameron and Frank A.G. Windmeijer. 1997. An R-squared measure of goodness of fit for some common nonlinear regression models. Journal of Econometrics 77, 2 (1997), 329–342. https://doi.org/10.1016/S0304-4076(96)01818-0
  6. Revisiting Pre-Trained Models for Chinese Natural Language Processing. In Findings of the Association for Computational Linguistics: EMNLP 2020. Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.findings-emnlp.58
  7. LERT: A Linguistically-motivated Pre-trained Language Model. (2022). arXiv:2211.05344 [cs.CL]
  8. PERT: Pre-training BERT with Permuted Language Model. (2022). arXiv:2203.06906 [cs.CL]
  9. In Proceedings of the 2019 Conference of the North. Association for Computational Linguistics. https://doi.org/10.18653/v1/n19-1423
  10. Timothy Dozat and Christopher D. Manning. 2018. Simpler but More Accurate Semantic Dependency Parsing. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics. https://doi.org/10.18653/v1/p18-2077
  11. Eugene F. Fama and Kenneth R. French. 2015. A five-factor asset pricing model. Journal of Financial Economics 116, 1 (2015), 1–22. https://doi.org/10.1016/j.jfineco.2014.10.010
  12. Deep learning volatility: a deep neural network perspective on pricing and calibration in (rough) volatility models. Quantitative Finance 21, 1 (2021), 11–27.
  13. Listening to Chaotic Whispers. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. ACM. https://doi.org/10.1145/3159652.3159690
  14. Pricing analysis of wind power derivatives for renewable energy risk management. Applied Energy 304 (2021), 117827.
  15. Machine learning methods for systemic risk analysis in financial sectors. (2019).
  16. Semantic role labeling: an introduction to the special issue. , 145–159 pages.
  17. Harald A Mieg. 2022. Volatility as a transmitter of systemic risk: Is there a structural risk in finance? Risk Analysis 42, 9 (2022), 1952–1964.
  18. MRP 2020: The Second Shared Task on Cross-Framework and Cross-Lingual Meaning Representation Parsing. In Proceedings of the CoNLL 2020 Shared Task: Cross-Framework Meaning Representation Parsing. Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.conll-shared.1
  19. SemEval 2015 Task 18: Broad-Coverage Semantic Dependency Parsing. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). Association for Computational Linguistics. https://doi.org/10.18653/v1/s15-2153
  20. Keyu Pan and Yawen Zeng. 2023. Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models. arXiv:2307.16180 [cs.CL]
  21. Learning Joint Semantic Parsers from Disjoint Data. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics. https://doi.org/10.18653/v1/n18-1135
  22. Energy-based Automated Model Evaluation. arXiv:2401.12689 [cs.LG]
  23. Deep Attentive Learning for Stock Movement Prediction From Social Media Text and Company Correlations. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.emnlp-main.676
  24. Domingo Tavella. 2003. Quantitative methods in derivatives pricing: an introduction to computational finance. John Wiley & Sons.
  25. SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis. https://doi.org/10.48550/ARXIV.2005.05635
  26. Using EGARCH models to predict volatility in unconsolidated financial markets: the case of European carbon allowances. Journal of Environmental Studies and Sciences 13, 3 (May 2023), 500–509. https://doi.org/10.1007/s13412-023-00838-5
  27. Weak Supervision for Fake News Detection via Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence 34, 01 (April 2020), 516–523. https://doi.org/10.1609/aaai.v34i01.5389
  28. Yumo Xu and Shay B. Cohen. 2018. Stock Movement Prediction from Tweets and Historical Prices. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics. https://doi.org/10.18653/v1/p18-1183
  29. Yawen Zeng. 2022. Point Prompt Tuning for Temporally Language Grounding. In SIGIR. 2003–2007.
  30. Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval. In Proceedings of the CVPR. IEEE, 2215–2224.
  31. Keyword-Based Diverse Image Retrieval with Variational Multiple Instance Graph. IEEE Trans. Neural Networks Learn. Syst. (2022).
  32. Contrastive topic-enhanced network for video captioning. Expert Systems with Applications 237 (2024), 121601.
  33. Transition-Based Parsing for Deep Dependency Structures. Computational Linguistics 42, 3 (Sept. 2016), 353–389. https://doi.org/10.1162/coli_a_00252
  34. Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model. In Proceedings of the Fourth Workshop on Financial Technology and Natural Language Processing (FinNLP). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid), 178–186. https://aclanthology.org/2022.finnlp-1.24
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Xiangyu Li (52 papers)
  2. Xinjie Shen (7 papers)
  3. Yawen Zeng (11 papers)
  4. Xiaofen Xing (29 papers)
  5. Jin Xu (131 papers)
Citations (3)