Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
175 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Can Large Language Models Beat Wall Street? Unveiling the Potential of AI in Stock Selection (2401.03737v2)

Published 8 Jan 2024 in q-fin.CP, cs.AI, cs.CE, cs.CL, and cs.LG

Abstract: This paper introduces MarketSenseAI, an innovative framework leveraging GPT-4's advanced reasoning for selecting stocks in financial markets. By integrating Chain of Thought and In-Context Learning, MarketSenseAI analyzes diverse data sources, including market trends, news, fundamentals, and macroeconomic factors, to emulate expert investment decision-making. The development, implementation, and validation of the framework are elaborately discussed, underscoring its capability to generate actionable and interpretable investment signals. A notable feature of this work is employing GPT-4 both as a predictive mechanism and signal evaluator, revealing the significant impact of the AI-generated explanations on signal accuracy, reliability and acceptance. Through empirical testing on the competitive S&P 100 stocks over a 15-month period, MarketSenseAI demonstrated exceptional performance, delivering excess alpha of 10% to 30% and achieving a cumulative return of up to 72% over the period, while maintaining a risk profile comparable to the broader market. Our findings highlight the transformative potential of LLMs in financial decision-making, marking a significant leap in integrating generative AI into financial analytics and investment strategies.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. Cognitive network science reveals bias in gpt-3, gpt-3.5 turbo, and gpt-4 mirroring math anxiety in high-school students. Big Data and Cognitive Computing 7, 124.
  2. Impact of news-based equity market volatility on international stock markets. Journal of Applied Economics 23, 224–234.
  3. Harnessing the power of chatgpt for automating systematic review process: Methodology, case study, limitations, and future directions. Systems 11, 351.
  4. The role of reddit in the gamestop short squeeze. Economics Letters 211, 110249.
  5. Finbert: Financial sentiment analysis with pre-trained language models. Preprint at https://arxiv.org/abs/1908.10063.
  6. Cognitive biases in natural language: Automatically detecting, differentiating, and measuring bias in text. doi:10.13140/RG.2.2.14044.56967.
  7. BIS, 2022. Market dysfunction and central bank tools. https://www.bis.org/publ/mc_insights.pdf.AccessedSeptember28,2023.
  8. Bloomberg, 2019. What’s an “algo wheel?” and why should you care? — bloomberg professional services. https://www.bloomberg.com/professional/blog/whats-algo-wheel-care/. Accessed September 24, 2023.
  9. Fluctuations and response in financial markets: the subtle nature ofrandom’price changes. Quantitative finance 4, 176.
  10. How does zero-day-to-expiry options trading affect the volatility of underlying assets? Available at SSRN: https://ssrn.com/abstract=4426358 or http://dx.doi.org/10.2139/ssrn.4426358.
  11. Can gpt models be financial analysts? an evaluation of chatgpt and gpt-4 on mock cfa exams. Preprint at https://arxiv.org/abs/2310.08678.
  12. Langchain. https://github.com/langchain-ai/langchain. Accessed December 29, 2023.
  13. The Economic Potential of Generative AI: The Next Productivity Frontier. Technical Report. McKinsey & Company. https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier. Accessed September 24, 2023.
  14. CNBC, 2023. Jpmorgan ai investment advisor. https://www.cnbc.com/2023/05/25/jpmorgan-develops-ai-investment-advisor.html. Accessed September 24, 2023.
  15. A survey for in-context learning. Preprint at https://arxiv.org/abs/2301.00234.
  16. Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy. Statistical science 1, 54–75.
  17. Deepvar: a framework for portfolio risk assessment leveraging probabilistic deep neural networks. Digital finance 5, 29–56.
  18. Transforming sentiment analysis in the financial domain with chatgpt. Machine Learning with Applications 14, 100508.
  19. Can anybody beat the market? https://www.investopedia.com/ask/answers/12/beating-the-market.asp, Accessed January 01, 2024.
  20. Passive investing and market liquidity. The Review of Financial Studies 28, 2167–2203.
  21. Value investing: from Graham to Buffett and beyond. John Wiley & Sons.
  22. How close is chatgpt to human experts? comparison corpus, evaluation, and detection. Preprint at https://arxiv.org/abs/2301.07597.
  23. Financial institutions, markets, and money. John Wiley & Sons.
  24. Bloated disclosures: Can chatgpt help investors process information? Available at SSRN: https://ssrn.com/abstract=4425527 or http://dx.doi.org/10.2139/ssrn.4425527.
  25. Drawdown measures: Are they all the same? The Journal of Portfolio Management 48, 104–120.
  26. Deep learning enhancing banking services: a hybrid transaction classification and cash flow prediction approach. Journal of big Data 9, 100.
  27. Momentum and autocorrelation in stock returns. The Review of Financial Studies 15, 533–564.
  28. Are chatgpt and gpt-4 general-purpose solvers for financial text analytics? an examination on several typical tasks. Preprint at https://arxiv.org/abs/2305.05862.
  29. Gpteval: Nlg evaluation using gpt-4 with better human alignment. Preprint at https://arxiv.org/abs/2303.16634.
  30. Can chatgpt forecast stock price movements? return predictability and large language models. Preprint at https://arxiv.org/abs/2304.07619.
  31. Estimating the impact of good news on stock market volatility. Applied Financial Economics 21, 545–554.
  32. The efficient market hypothesis and its critics. Journal of economic perspectives 17, 59–82.
  33. Marketdigest. https://www.km3am.com/2023/03/13/marketdigest-new-ai-powered-tool-for-wealth-management-insights/. Accessed September 24, 2023.
  34. Experimental evidence on the productivity effects of generative artificial intelligence. Science 381, 187–192. URL: https://www.science.org/doi/abs/10.1126/science.adh2586, doi:10.1126/science.adh2586.
  35. OECD, 2021. Artificial intelligence, machine learning and big data in finance: Opportunities, challenges, and implications for policy makers. https://www.oecd.org/finance/financial-markets/Artificial-intelligence-machine-learning-big-data-in-finance.pdf. Accessed September 24, 2023.
  36. OpenAI, 2023. Gpt-4 technical report. arXiv:2303.08774. preprint at https://arxiv.org/abs/2303.08774.
  37. Fusion-eval: Integrating evaluators with llms. Preprint at https://arxiv.org/abs/2311.09204.
  38. Mpnet: Masked and permuted pre-training for language understanding. Advances in Neural Information Processing Systems 33, 16857–16867.
  39. More than words: Quantifying language to measure firms’ fundamentals. The journal of finance 63, 1437–1467.
  40. Magnificent 7 stocks: What you need to know. https://www.investopedia.com/magnificent-seven-stocks-8402262, Accessed January 01, 2024.
  41. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems 35, 24824–24837.
  42. Behavioral biases in pension fund trustees’ decision making. Review of Behavioral Finance 11, 128–143.
  43. Bloomberggpt: A large language model for finance. Preprint at https://arxiv.org/abs/2303.17564.
  44. Temporal data meets llm–explainable financial time series forecasting. Preprint at https://arxiv.org/abs/2306.11025.
  45. Chatgpt: Unlocking the future of nlp in finance. Available at SSRN: https://ssrn.com/abstract=4323643 or http://dx.doi.org/10.2139/ssrn.4323643.
Citations (11)

Summary

  • The paper introduces MarketSenseAI, a framework that integrates GPT-4 reasoning with multi-modal financial data to emulate expert stock selection.
  • It employs Chain of Thought and zero-shot prompting to analyze news, fundamentals, price dynamics, and macroeconomic trends for generating investment insights.
  • Empirical tests on S&P 100 data yielded up to 72% returns and an excess alpha of 10-30%, highlighting LLMs' potential in risk-adjusted performance.

Can LLMs Beat Wall Street? Unveiling the Potential of AI in Stock Selection

The research paper under consideration, titled "Can LLMs Beat Wall Street? Unveiling the Potential of AI in Stock Selection," presents MarketSenseAI, a framework that leverages GPT-4's reasoning capabilities to offer a sophisticated approach to stock selection in financial markets. This work provides a comprehensive integration of LLMs with multi-modal financial data sources, illustrating the application of generative AI in financial analytics.

Overview and Methodology

MarketSenseAI aims to emulate expert investment decision-making by integrating techniques such as Chain of Thought and In-Context Learning. It processes various data points like market trends, news, company fundamentals, and macroeconomic indicators to generate interpretable investment signals. A distinctive aspect of this framework is the dual role of GPT-4, serving as both a predictive mechanism and a signal evaluator. This dual functionality is supported with zero-shot prompting, allowing MarketSenseAI to extract relevant insights without specific fine-tuning on financial data.

The framework consists of four core components, each providing key inputs for the final investment recommendation:

  1. Progressive News Summarizer: Analyzes and synthesizes stock-specific news, highlighting important developments over time.
  2. Fundamentals Summarizer: Compares a company's recent quarterly financial reports to assess its financial health.
  3. Stock Price Dynamics Summarizer: Evaluates a stock's performance relative to its peers and the broader market.
  4. Macroeconomic Environment Summary: Provides insights into macroeconomic trends, using multiple expert reports to synthesize a collective viewpoint.

Empirical Evaluation and Findings

MarketSenseAI's empirical evaluation applied the model to the S&P 100 index over a 15-month period. Significantly, this LLM-driven strategy yielded an excess alpha of 10% to 30%, achieving cumulative returns of up to 72%. Such results underscore the potential of LLMs in achieving risk-adjusted performance improvements comparable to established market indices.

The research introduces robust quantitative metrics in its evaluation, like the Sharpe and Sortino ratios, facilitating a nuanced understanding of risk-adjusted returns. MarketSenseAI's ability to outperform traditional equally-weighted and market cap-weighted portfolios underscores the effectiveness of integrating qualitative inputs with quantitative assessments in investment strategies.

Implications and Future Directions

This research highlights the transformative role of LLMs in financial markets, showcasing the practical and theoretical potential of AI in stock selection. MarketSenseAI demonstrates that the strategic use of LLMs can extend beyond mere data parsing to involve complex reasoning akin to human financial analysts. This development can significantly democratize access to sophisticated financial insights, offering potential benefits to retail investors and asset managers who lack extensive resources.

The theoretical implications suggest that integrating domain-specific expertise into LLMs can yield tangible advancements in financial decision-making processes. Practically, the modular architecture of MarketSenseAI indicates its versatility, allowing components to be adapted for various applications within the financial sector.

As AI capabilities continue to evolve, future directions for similar research include refining sentiment analysis methodologies, exploring the effects of higher-frequency trading strategies, and understanding the evolving behavioral dynamics introduced by AI participation in financial markets. The research opens avenues for integrating AI's cognitive strengths with the domain-specific intricacies of financial analysis, marking a pivotal step toward the enhanced capability of LLMs in complex decision-making tasks.