Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Pre-trained Large Language Models for Financial Sentiment Analysis (2401.05215v1)

Published 10 Jan 2024 in cs.CL and cs.AI

Abstract: Financial sentiment analysis refers to classifying financial text contents into sentiment categories (e.g. positive, negative, and neutral). In this paper, we focus on the classification of financial news title, which is a challenging task due to a lack of large amount of training samples. To overcome this difficulty, we propose to adapt the pretrained LLMs [1, 2, 3] to solve this problem. The LLMs, which are trained from huge amount of text corpora,have an advantage in text understanding and can be effectively adapted to domain-specific task while requiring very few amount of training samples. In particular, we adapt the open-source Llama2-7B model (2023) with the supervised fine-tuning (SFT) technique [4]. Experimental evaluation shows that even with the 7B model (which is relatively small for LLMs), our approach significantly outperforms the previous state-of-the-art algorithms.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Improving language understanding by generative pre-training. 2018.
  2. Language models are unsupervised multitask learners. OpenAI Blog, 1(8):9, 2019.
  3. Bert: Pre-training of deep bidirectional transformers for language understanding, 2019.
  4. Llama: Open and efficient foundation language models, 2023.
  5. Bing Liu. Sentiment Analysis: Mining Opinions, Sentiments, and Emotions. Studies in Natural Language Processing. Cambridge University Press, 2 edition, 2020.
  6. Bo Pang and Lillian Lee. Opinion Mining and Sentiment Analysis, volume 2. 2008.
  7. Sentiment analysis of twitter data. In Proceedings of the Workshop on Languages in Social Media, LSM ’11, page 30–38, USA, 2011. Association for Computational Linguistics.
  8. Introduction, pages 1–21. Springer International Publishing, Cham, 2015.
  9. Good debt or bad debt: Detecting semantic orientations in economic texts. Journal of the Association for Information Science and Technology, 65(4):782–796, 2014.
  10. Decision support from financial disclosures with deep neural networks and transfer learning. Decision Support Systems, 104:38–48, December 2017.
  11. Big data: Deep learning for financial sentiment analysis. Journal of Big Data, 5, 2018.
  12. Sentence-level sentiment analysis of financial news using distributed text representations and multi-instance learning, 2018.
  13. Finsslx: A sentiment analysis model for the financial domain using text simplification. In 2018 IEEE 12th International Conference on Semantic Computing (ICSC), pages 318–319, 2018.
  14. Dogu Araci. Finbert: Financial sentiment analysis with pre-trained language models, 2019.
  15. Finbert: A pretrained language model for financial communications, 2020.
  16. Recent progress on text summarisation based on bert and gpt. In Zhi Jin, Yuncheng Jiang, Robert Andrei Buchmann, Yaxin Bi, Ana-Maria Ghiran, and Wenjun Ma, editors, Knowledge Science, Engineering and Management, pages 225–241, Cham, 2023. Springer Nature Switzerland.
  17. Sentimentgpt: Exploiting gpt for advanced sentiment analysis and its departure from current machine learning, 2023.
  18. Long short-term memory. Neural Comput, 9(8):1735–1780, 1997.
  19. Deep contextualized word representations. In Marilyn Walker, Heng Ji, and Amanda Stent, editors, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 2227–2237, New Orleans, Louisiana, June 2018. Association for Computational Linguistics.
  20. Universal language model fine-tuning for text classification, 2018.
  21. Srikumar Krishnamoorthy. Sentiment analysis of financial news articles using performance indicators. Knowledge and Information Systems, 56(2):373–394, 2018.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Wei Luo (176 papers)
  2. Dihong Gong (14 papers)
Citations (2)
X Twitter Logo Streamline Icon: https://streamlinehq.com