Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist (2402.18485v3)

Published 28 Feb 2024 in q-fin.TR and cs.AI

Abstract: Financial trading is a crucial component of the markets, informed by a multimodal information landscape encompassing news, prices, and Kline charts, and encompasses diverse tasks such as quantitative trading and high-frequency trading with various assets. While advanced AI techniques like deep learning and reinforcement learning are extensively utilized in finance, their application in financial trading tasks often faces challenges due to inadequate handling of multimodal data and limited generalizability across various tasks. To address these challenges, we present FinAgent, a multimodal foundational agent with tool augmentation for financial trading. FinAgent's market intelligence module processes a diverse range of data-numerical, textual, and visual-to accurately analyze the financial market. Its unique dual-level reflection module not only enables rapid adaptation to market dynamics but also incorporates a diversified memory retrieval system, enhancing the agent's ability to learn from historical data and improve decision-making processes. The agent's emphasis on reasoning for actions fosters trust in its financial decisions. Moreover, FinAgent integrates established trading strategies and expert insights, ensuring that its trading approaches are both data-driven and rooted in sound financial principles. With comprehensive experiments on 6 financial datasets, including stocks and Crypto, FinAgent significantly outperforms 9 state-of-the-art baselines in terms of 6 financial metrics with over 36% average improvement on profit. Specifically, a 92.27% return (a 84.39% relative improvement) is achieved on one dataset. Notably, FinAgent is the first advanced multimodal foundation agent designed for financial trading tasks.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (57)
  1. Optuna: A Next-Generation Hyperparameter Optimization Framework. In The 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2623–2631.
  2. Deep reinforcement learning for quantitative trading: Challenges and opportunities. IEEE Intelligent Systems 37, 2 (2022), 23–26.
  3. Investment behaviors can tell what inside: Exploring stock intrinsic properties for stock trend prediction. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2376–2384.
  4. Theoremqa: A theorem-driven question answering dataset. arXiv preprint arXiv:2305.12524 (2023).
  5. Palm: Scaling language modeling with pathways. Journal of Machine Learning Research 24, 240 (2023), 1–113.
  6. Pangu-Agent: A fine-tunable generalist agent with structured reasoning. arXiv:2312.14878 [cs.AI]
  7. Deep direct reinforcement learning for financial signal representation and trading. IEEE Transactions on Neural Networks and Learning Systems 28, 3 (2016), 653–664.
  8. Investor-imitator: A framework for trading knowledge extraction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1310–1319.
  9. Technical analysis of stock trends. CRC press.
  10. Temporal relational ranking for stock prediction. ACM Transactions on Information Systems (TOIS) 37, 2 (2019), 1–30.
  11. Soft actor-critic algorithms and applications. arXiv preprint arXiv:1812.05905 (2018).
  12. MetaGPT: Meta programming for a multi-agent collaborative framework. arXiv:2308.00352 [cs.AI]
  13. Listening to chaotic whispers: A deep learning framework for news-oriented stock trend prediction. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM). 261–269.
  14. Billion-scale similarity search with gpus. IEEE Transactions on Big Data 7, 3 (2019), 535–547.
  15. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems 30 (2017).
  16. FinRL: A deep reinforcement learning library for automated stock trading in quantitative finance. Deep RL Workshop, NeurIPS 2020 (2020).
  17. Adaptive quantitative trading: An imitative deep reinforcement learning approach. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 2128–2135.
  18. Mathvista: Evaluating mathematical reasoning of foundation models in visual contexts. arXiv preprint arXiv:2310.02255 (2023).
  19. Learn to explain: Multimodal reasoning via thought chains for science question answering. Advances in Neural Information Processing Systems 35 (2022), 2507–2521.
  20. Chameleon: Plug-and-Play compositional reasoning with large language models. arXiv:2304.09842 [cs.CL]
  21. Dynamic prompt learning via policy gradient for semi-structured mathematical reasoning. arXiv preprint arXiv:2209.14610 (2022).
  22. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013).
  23. Webgpt: Browser-assisted question-answering with human feedback. arXiv preprint arXiv:2112.09332 (2021).
  24. Stock market’s price movement prediction with LSTM neural networks. In 2017 International Joint Conference on Neural Networks (IJCNN). 1419–1426.
  25. OpenAI. 2021. Chatgpt. https://openai.com/research/chatgpt
  26. OpenAI. 2023a. GPT-4 Technical Report. arXiv:2303.08774 [cs.AI]
  27. OpenAI. 2023b. GPT-4V(ision) system card. https://openai.com/research/gpt-4v-system-card
  28. Talm: Tool augmented language models. arXiv preprint arXiv:2205.12255 (2022).
  29. Generative Agents: Interactive simulacra of human behavior. arXiv:2304.03442 [cs.HC]
  30. Earnhft: Efficient hierarchical reinforcement learning for high frequency trading. arXiv preprint arXiv:2309.12891 (2023).
  31. Deep attentive learning for stock movement prediction from social media text and company correlations. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 8415–8426.
  32. VolTAGE: Volatility forecasting via text-audio fusion with graph convolution networks for earnings calls. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 8001–8013.
  33. Quantitative day trading from natural language using reinforcement learning. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 4018–4030.
  34. Toolformer: Language models can teach themselves to use tools. arXiv preprint arXiv:2302.04761 (2023).
  35. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017).
  36. Cognitive architectures for language agents. arXiv preprint arXiv:2309.02427 (2023).
  37. Scieval: A multi-level large language model evaluation benchmark for scientific research. arXiv preprint arXiv:2308.13149 (2023).
  38. PRUDEX-Compass: Towards systematic evaluation of reinforcement learning in financial markets. Transactions on Machine Learning Research (2023).
  39. TradeMaster: A holistic quantitative trading platform empowered by reinforcement learning. In Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track.
  40. Reinforcement learning for quantitative trading. ACM Transactions on Intelligent Systems and Technology 14, 3 (2023), 1–29.
  41. Mastering stock markets with efficient mixture of diversified trading experts. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’23). 2109–2119.
  42. Lamda: Language models for dialog applications. arXiv preprint arXiv:2201.08239 (2022).
  43. Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971 (2023).
  44. Voyager: An open-ended embodied agent with large language models. arXiv preprint arXiv: Arxiv-2305.16291 (2023).
  45. CLVSA: A convolutional LSTM based variational sequence-to-sequence model with attention for predicting trends of financial markets. In Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI). 3705–3711.
  46. Commission fee is not enough: A hierarchical reinforced framework for portfolio management. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 626–633.
  47. Scibench: Evaluating college-level scientific problem-solving abilities of large language models. arXiv preprint arXiv:2307.10635 (2023).
  48. DeepTrader: a deep reinforcement learning approach for risk-return balanced portfolio management with market conditions Embedding. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 643–650.
  49. Bloomberggpt: A large language model for finance. arXiv preprint arXiv:2303.17564 (2023).
  50. Yumo Xu and Shay B Cohen. 2018. Stock movement prediction from tweets and historical prices. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL). 1970–1979.
  51. FinGPT: Open-Source Financial Large Language Models. arXiv preprint arXiv:2306.06031 (2023).
  52. Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions. arXiv:2306.02224 [cs.AI]
  53. MM-REACT: Prompting ChatGPT for multimodal reasoning and action. arXiv:2303.11381 [cs.CV]
  54. Reinforcement-learning based portfolio management with augmented asset movement prediction states. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 1112–1119.
  55. Generate rather than retrieve: Large language models are strong context generators. arXiv preprint arXiv:2209.10063 (2022).
  56. FinMem: A performance-enhanced LLM trading agent with layered memory and character design. arXiv:2311.13743 [q-fin.CP]
  57. AppAgent: Multimodal agents as smartphone users. arXiv:2312.13771 [cs.CV]
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (13)
  1. Wentao Zhang (261 papers)
  2. Lingxuan Zhao (1 paper)
  3. Haochong Xia (4 papers)
  4. Shuo Sun (91 papers)
  5. Jiaze Sun (9 papers)
  6. Molei Qin (5 papers)
  7. Xinyi Li (97 papers)
  8. Yuqing Zhao (5 papers)
  9. Yilei Zhao (4 papers)
  10. Xinyu Cai (26 papers)
  11. Longtao Zheng (10 papers)
  12. Xinrun Wang (39 papers)
  13. Bo An (127 papers)
Citations (13)
Youtube Logo Streamline Icon: https://streamlinehq.com