Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Integrating Stock Features and Global Information via Large Language Models for Enhanced Stock Return Prediction (2310.05627v1)

Published 9 Oct 2023 in cs.CL, cs.LG, and q-fin.ST

Abstract: The remarkable achievements and rapid advancements of LLMs such as ChatGPT and GPT-4 have showcased their immense potential in quantitative investment. Traders can effectively leverage these LLMs to analyze financial news and predict stock returns accurately. However, integrating LLMs into existing quantitative models presents two primary challenges: the insufficient utilization of semantic information embedded within LLMs and the difficulties in aligning the latent information within LLMs with pre-existing quantitative stock features. We propose a novel framework consisting of two components to surmount these challenges. The first component, the Local-Global (LG) model, introduces three distinct strategies for modeling global information. These approaches are grounded respectively on stock features, the capabilities of LLMs, and a hybrid method combining the two paradigms. The second component, Self-Correlated Reinforcement Learning (SCRL), focuses on aligning the embeddings of financial news generated by LLMs with stock features within the same semantic space. By implementing our framework, we have demonstrated superior performance in Rank Information Coefficient and returns, particularly compared to models relying only on stock features in the China A-share market.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. Cross-sectional stock price prediction using deep learning for actual investment management. In Proceedings of the 2020 Asia Service Sciences and Software Engineering Conference, pages 9–15, 2020.
  2. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020.
  3. Leveraging social media news to predict stock index movement using rnn-boost. Data & Knowledge Engineering, 118:14–24, 2018.
  4. Deep learning in asset pricing. Management Science, 2023.
  5. Efficient and effective text encoding for chinese llama and alpaca. arXiv preprint arXiv:2304.08177, 2023.
  6. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
  7. Factorvae: A probabilistic dynamic factor model based on variational autoencoder for predicting cross-sectional stock returns. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 4468–4476, 2022.
  8. Common risk factors in the returns on stocks and bonds. Journal of financial economics, 33(1):3–56, 1993.
  9. A five-factor asset pricing model. Journal of financial economics, 116(1):1–22, 2015.
  10. Factor models, machine learning, and asset pricing. Annual Review of Financial Economics, 14:337–368, 2022.
  11. Empirical asset pricing via machine learning. The Review of Financial Studies, 33(5):2223–2273, 2020.
  12. Hisa-smfm: historical and sentiment analysis based stock market forecasting model. arXiv preprint arXiv:2203.08143, 2022.
  13. Listening to chaotic whispers: A deep learning framework for news-oriented stock trend prediction. In Proceedings of the eleventh ACM international conference on web search and data mining, pages 261–269, 2018.
  14. The capital asset pricing model: Some empirical tests. 1972.
  15. Can chatgpt improve investment decision? from a portfolio management perspective. From a Portfolio Management Perspective, 2023.
  16. CSL: A large-scale Chinese scientific literature dataset. In Proceedings of the 29th International Conference on Computational Linguistics, pages 3917–3923, Gyeongju, Republic of Korea, October 2022. International Committee on Computational Linguistics.
  17. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692, 2019.
  18. Can chatgpt forecast stock price movements? return predictability and large language models. arXiv preprint arXiv:2304.07619, 2023.
  19. Ric-nn: A robust transferable deep learning framework for cross-sectional investment strategy. In 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pages 370–379. IEEE, 2020.
  20. et al. Schulman J. Proximal policy optimization algorithms. ArXiv, 1707.06347, 2017.
  21. Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971, 2023.
  22. S_i_lstm: stock price prediction based on multiple data sources and sentiment analysis. Connection Science, 34(1):44–62, 2022.
  23. The wall street neophyte: A zero-shot analysis of chatgpt over multimodal stock movement prediction challenges. arXiv preprint arXiv:2304.05351, 2023.
  24. Cluecorpus2020: A large-scale chinese corpus for pre-training language model. ArXiv, abs/2003.01355, 2020.
  25. Motohiro Yogo. A consumption-based explanation of expected stock returns. The Journal of Finance, 61(2):539–580, 2006.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yujie Ding (4 papers)
  2. Shuai Jia (11 papers)
  3. Tianyi Ma (22 papers)
  4. Bingcheng Mao (1 paper)
  5. Xiuze Zhou (5 papers)
  6. Liuliu Li (1 paper)
  7. Dongming Han (5 papers)
Citations (3)