Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams (2304.10740v3)

Published 21 Apr 2023 in q-fin.GN and cs.LG

Abstract: Knowing which factors are significant in credit rating assignment leads to better decision-making. However, the focus of the literature thus far has been mostly on structured data, and fewer studies have addressed unstructured or multi-modal datasets. In this paper, we present an analysis of the most effective architectures for the fusion of deep learning models for the prediction of company credit rating classes, by using structured and unstructured datasets of different types. In these models, we tested different combinations of fusion strategies with different deep learning models, including CNN, LSTM, GRU, and BERT. We studied data fusion strategies in terms of level (including early and intermediate fusion) and techniques (including concatenation and cross-attention). Our results show that a CNN-based multi-modal model with two fusion strategies outperformed other multi-modal techniques. In addition, by comparing simple architectures with more complex ones, we found that more sophisticated deep learning models do not necessarily produce the highest performance; however, if attention-based models are producing the best results, cross-attention is necessary as a fusion strategy. Finally, our comparison of rating agencies on short-, medium-, and long-term performance shows that Moody's credit ratings outperform those of other agencies like Standard & Poor's and Fitch Ratings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (124)
  1. Gunter Löffler. The systemic risk implications of using credit ratings versus quantitative measures to limit bond portfolio risk. Journal of Financial Services Research, 58(1):39–57, 2020.
  2. Artificial neural networks for corporation credit rating analysis. In 2009 International Conference on Networking and Digital Society, volume 1, pages 81–84. IEEE, 2009.
  3. Domain-driven classification based on multiple criteria and multiple constraint-level programming for intelligent credit scoring. IEEE transactions on knowledge and data engineering, 22(6):826–838, 2010.
  4. Rmt-net: Reject-aware multi-task network for modeling missing-not-at-random data in financial credit scoring. IEEE Transactions on Knowledge and Data Engineering, 2022.
  5. Yair E Orgler. Evaluation of bank consumer loans with credit scoring models. Tel-Aviv University, Department of Envirnonmental Sciences, 1971.
  6. Credit scoring and its applications. SIAM, 2017.
  7. Yuhan Zhu. Research on financial risk control algorithm based on machine learning. In 2021 3rd International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), pages 16–19. IEEE, 2021.
  8. Hybrid neural network models for hydrologic time series forecasting. Applied Soft Computing, 7(2):585–592, 2007.
  9. Deep learning. nature, 521(7553):436–444, 2015.
  10. Li Deng. A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA transactions on Signal and Information Processing, 3, 2014.
  11. Deep learning for financial applications: A survey. Applied Soft Computing, 93:106384, 2020.
  12. A review of the application of deep learning in medical image classification and segmentation. Annals of translational medicine, 8(11), 2020.
  13. An introduction to convolutional neural networks. arXiv preprint arXiv:1511.08458, 2015.
  14. Forecasting turning points in stock price by applying a novel hybrid cnn-lstm-resnet model fed by 2d segmented images. Engineering Applications of Artificial Intelligence, 116:105464, 2022.
  15. Evaluation of deep learning models for multi-step ahead time series prediction. IEEE Access, 9:83105–83123, 2021.
  16. Every corporation owns its image: Corporate credit ratings via convolutional neural networks. In 2020 IEEE 6th International Conference on Computer and Communications (ICCC), pages 1578–1583. IEEE, 2020.
  17. Larry R Medsker and LC Jain. Recurrent neural networks. Design and Applications, 5:64–67, 2001.
  18. Spatial–temporal recurrent neural network for emotion recognition. IEEE transactions on cybernetics, 49(3):839–847, 2018.
  19. KR1442 Chowdhary. Natural language processing. Fundamentals of artificial intelligence, pages 603–649, 2020.
  20. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
  21. On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259, 2014.
  22. Attention is all you need. Advances in neural information processing systems, 30, 2017.
  23. Deep learning with gated recurrent unit networks for financial sequence predictions. Procedia computer science, 131:895–903, 2018.
  24. Neural machine translation with bert for post-ocr error detection and correction. In Proceedings of the ACM/IEEE joint conference on digital libraries in 2020, pages 333–336, 2020.
  25. Jumping nlp curves: A review of natural language processing research. IEEE Computational intelligence magazine, 9(2):48–57, 2014.
  26. Covid-19 sentiment analysis via deep learning during the rise of novel cases. PLoS One, 16(8):e0255615, 2021.
  27. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
  28. A comparative study on transformer vs rnn in speech applications. In 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 449–456. IEEE, 2019.
  29. Financial time series forecasting with deep learning: A systematic literature review: 2005–2019. Applied soft computing, 90:106181, 2020.
  30. A survey of applying machine learning techniques for credit rating: Existing models and open issues. In International Conference on Neural Information Processing, pages 122–132. Springer, 2015.
  31. A multimodal event-driven lstm model for stock prediction using online news. IEEE Transactions on Knowledge and Data Engineering, 33(10):3323–3337, 2020a.
  32. The value of text for small business default prediction: A deep learning approach. European Journal of Operational Research, 295(2):758–771, 2021.
  33. Arianna D’Ulizia. Exploring multimodal input fusion strategies. In Multimodal Human Computer Interaction and Pervasive Services, pages 34–57. IGI Global, 2009.
  34. A review of affective computing: From unimodal analysis to multimodal fusion. Information Fusion, 37:98–125, 2017.
  35. Fusing multiple features for depth-based action recognition. ACM Transactions on Intelligent Systems and Technology (TIST), 6(2):1–20, 2015.
  36. Early, intermediate and late fusion strategies for robust deep learning-based multimodal action recognition. Machine Vision and Applications, 32(6):121, 2021.
  37. Multimodal deep learning for finance: integrating and forecasting international stock markets. The Journal of Supercomputing, 76(10):8294–8312, 2020.
  38. Multisensor integration and fusion: issues and approaches. In Sensor Fusion, volume 931, pages 42–49. SPIE, 1988.
  39. Ccnet: Criss-cross attention for semantic segmentation. In Proceedings of the IEEE/CVF international conference on computer vision, pages 603–612, 2019.
  40. Cross attention network for few-shot classification. Advances in Neural Information Processing Systems, 32, 2019.
  41. Stacked cross attention for image-text matching. In Proceedings of the European conference on computer vision (ECCV), pages 201–216, 2018.
  42. A survey on machine learning for data fusion. Information Fusion, 57:115–129, 2020.
  43. Text mining for big data analysis in financial sector: A literature review. Sustainability, 11(5):1277, 2019.
  44. Using natural language processing to assess text usefulness to readers: The case of conference calls and earnings prediction. Available at SSRN 3095754, 2017.
  45. A literature review of the economics of covid-19. Journal of Economic Surveys, 35(4):1007–1044, 2021.
  46. The impact of covid-19 coronavirus on stock markets: evidence from selected countries. Muhasebe ve Finans İncelemeleri Dergisi, 3(1):78–84, 2020.
  47. Predicting mortgage early delinquency with machine learning methods. European Journal of Operational Research, 290(1):358–372, 2021.
  48. How can lenders prosper? comparing machine learning approaches to identify profitable peer-to-peer loan investments. European Journal of Operational Research, 294(2):711–722, 2021.
  49. Deep learning for credit scoring: Do or don’t? European Journal of Operational Research, 295(1):292–305, 2021.
  50. Predicting mortgage default using convolutional neural networks. Expert Systems with Applications, 102:207–217, 2018.
  51. Forecasting gold price with the xgboost algorithm and shap interaction values. Annals of Operations Research, pages 1–21, 2021.
  52. Can deep learning predict risky retail investors? a case study in financial risk behavior forecasting. European Journal of Operational Research, 283(1):217–234, 2020.
  53. An intelligent payment card fraud detection system. Annals of operations research, pages 1–23, 2021.
  54. Deep learning in finance and banking: A literature review and classification. Frontiers of Business Research in China, 14(1):1–24, 2020.
  55. Deep convolutional neural networks versus multilayer perceptron for financial prediction. In 2018 International Conference on Communications (COMM), pages 201–206. IEEE, 2018.
  56. Application of deep neural networks to assess corporate credit rating. arXiv preprint arXiv:2003.02334, 2020.
  57. Soft reordering one-dimensional convolutional neural network for credit scoring. Knowledge-Based Systems, page 110414, 2023.
  58. Credit score prediction using genetic algorithm-lstm technique. In 2022 Conference on Information Communications Technology and Society (ICTAS), pages 1–6. IEEE, 2022.
  59. A transformer-based model for default prediction in mid-cap corporate markets. European Journal of Operational Research, 2022.
  60. Corporate bankruptcy prediction using machine learning methodologies with a focus on sequential data. Computational Economics, 59(3):1231–1249, 2022.
  61. Research on quantitative stock selection strategy based on cnn-lstm. In 2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), pages 1142–1147. IEEE, 2022.
  62. GS Vidya and VS Hari. Gold price prediction and modelling using deep learning techniques. In 2020 IEEE Recent Advances in Intelligent Computational Systems (RAICS), pages 28–31. IEEE, 2020.
  63. Comparative analysis of the application of deep learning techniques for forex rate prediction. In 2019 international conference on advancements in computing (ICAC), pages 329–333. IEEE, 2019.
  64. A new convolutional neural network and long short term memory combined model for stock index prediction. In 2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), pages 1–6. IEEE, 2021.
  65. Deepclue: visual interpretation of text-based deep stock prediction. IEEE Transactions on Knowledge and Data Engineering, 31(6):1094–1108, 2018.
  66. On the importance of text analysis for stock price prediction. In LREC, volume 2014, pages 1170–1175, 2014.
  67. Yoav Goldberg. A primer on neural network models for natural language processing. Journal of Artificial Intelligence Research, 57:345–420, 2016.
  68. Deep learning models for bankruptcy prediction using textual disclosures. European journal of operational research, 274(2):743–758, 2019.
  69. Deep learning for stock market prediction from financial news articles. In 2017 IEEE international conference on computational intelligence and virtual environments for measurement systems and applications (CIVEMSA), pages 60–65. IEEE, 2017.
  70. Description-text related soft information in peer-to-peer lending–evidence from two leading european platforms. Journal of Banking & Finance, 64:169–187, 2016.
  71. The role of punctuation in p2p lending: Evidence from china. Economic Modelling, 68:634–643, 2018.
  72. When words sweat: Identifying signals for loan default in the text of loan applications. Journal of Marketing Research, 56(6):960–980, 2019.
  73. Credit default prediction from user-generated text in peer-to-peer lending using deep learning. European Journal of Operational Research, 302(1):309–323, 2022.
  74. Predicting distresses using deep learning of text segments in annual reports. Expert Systems with Applications, 132:199–208, 2019.
  75. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 3361(10):1995, 1995.
  76. Image classification using convolutional neural network (cnn) and recurrent neural network (rnn): a review. Machine Learning and Information Processing: Proceedings of ICMLIP 2019, pages 367–381, 2020.
  77. NI Widiastuti. Convolution neural network for text mining and natural language processing. In IOP Conference Series: Materials Science and Engineering, volume 662, page 052010. IOP Publishing, 2019.
  78. Image approach to speech recognition on cnn. In Proceedings of the 2019 3rd International Symposium on Computer Science and Intelligent Control, pages 1–6, 2019.
  79. Jeffrey L Elman. Finding structure in time. Cognitive science, 14(2):179–211, 1990.
  80. Paul J Werbos. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10):1550–1560, 1990.
  81. Sepp Hochreiter. The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(02):107–116, 1998.
  82. A review on deep sequential models for forecasting time series data. Applied Computational Intelligence and Soft Computing, 2022, 2022.
  83. Larger-context language modelling. arXiv preprint arXiv:1511.03729, 2015.
  84. Understanding lstm-a tutorial into long short-term memory recurrent neural networks. arXiv preprint arXiv:1909.09586, 2019.
  85. Lstm and gru neural network performance comparison study: Taking yelp review dataset as an example. In 2020 International workshop on electronic communication and artificial intelligence (IWECAI), pages 98–101. IEEE, 2020.
  86. A comparison between arima, lstm, and gru for time series forecasting. In Proceedings of the 2019 2nd International Conference on Algorithms, Computing and Artificial Intelligence, pages 49–55, 2019.
  87. Gated recurrent multiattention network for vhr remote sensing image classification. IEEE Transactions on Geoscience and Remote Sensing, 60:1–13, 2021.
  88. Full-gru natural language video description for service robotics applications. IEEE robotics and automation letters, 3(2):841–848, 2018.
  89. Language model is all you need: Natural language understanding as question answering. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 7803–7807. IEEE, 2021.
  90. Attentional control and the self: the self-attention network (SAN). Cognitive neuroscience, 7(1-4):5–17, 2016.
  91. Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771, 2019.
  92. Transformers in vision: A survey. ACM computing surveys (CSUR), 54(10s):1–41, 2022.
  93. Transformers in time series: A survey. arXiv preprint arXiv:2202.07125, 2022.
  94. How to fine-tune bert for text classification? In Chinese Computational Linguistics: 18th China National Conference, CCL 2019, Kunming, China, October 18–20, 2019, Proceedings 18, pages 194–206. Springer, 2019.
  95. Attentive convolutional neural network based speech emotion recognition: A study on the impact of input features, signal length, and acted speech. arXiv preprint arXiv:1706.00612, 2017.
  96. Franklin E White. Data fusion lexicon. Technical report, Joint Directors of Labs Washington DC, 1991.
  97. Multimodal intelligence: Representation learning, information fusion, and applications. IEEE Journal of Selected Topics in Signal Processing, 14(3):478–493, 2020.
  98. Self-attention with relative position representations. arXiv preprint arXiv:1803.02155, 2018.
  99. Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1):1929–1958, 2014.
  100. Lawrence J White. Markets: The credit rating agencies. Journal of Economic Perspectives, 24(2):211–226, 2010.
  101. A comparison of bond ratings from moody’s s&p and fitch ibca. Financial Markets, Institutions & Instruments, 8(4):1–45, 1999.
  102. Seeking Alpha. Seeking alpha. https://seekingalpha.com. Most recent collection as of July 2022.
  103. Megan French-Marcelin. The Rise of Financial Information Platforms: Markets, Machines, and Open Data. Routledge, New York, NY, 2020. ISBN 9780367187641. doi: 10.4324/9780429459138.
  104. Corporate climate risk: Measurements and responses. Available at SSRN 3508497, 2020b.
  105. Keras tokenizer. https://keras.io/api/keras_nlp/tokenizers/tokenizer/. Accessed: September 2022.
  106. Bert tokenizer. https://huggingface.co/docs/transformers/model_doc/bert. Accessed: September 2022.
  107. A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance. BMC Medical Research Methodology, 22(1):1–12, 2022.
  108. Francois Chollet. Deep learning with Python. Simon and Schuster, 2021.
  109. Efficient, compositional, order-sensitive n-gram embeddings. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 503–508, 2017.
  110. The relationship between precision-recall and roc curves. In Proceedings of the 23rd international conference on Machine learning, pages 233–240, 2006.
  111. A deep-learning based multimodal system for Covid-19 diagnosis using breathing sounds and chest X-ray images. Applied Soft Computing, 109, 107522, 2021. Elsevier.
  112. Collaborative recommendation model based on multi-modal multi-view attention network: Movie and literature cases. In Applied Soft Computing, 110518, 2023. Elsevier.
  113. Multi-modal feature fusion for 3D object detection in the production workshop. Applied Soft Computing, 115, 108245, 2022. Elsevier.
  114. Multimodal learning model based on video–audio–chat feature fusion for detecting e-sports highlights. Applied Soft Computing, 126, 109285, 2022. Elsevier.
  115. Pantelis Z Lappas, Athanasios N Yannacopoulos. A machine learning approach combining expert knowledge with genetic algorithms in feature selection for credit risk assessment. Applied Soft Computing, 107, 107391, 2021. Elsevier.
  116. Statistical and machine learning models in credit scoring: A systematic literature survey. Applied Soft Computing, 91, 106263, 2020. Elsevier.
  117. The value of big data for credit scoring: Enhancing financial inclusion using mobile phone data and social network analytics. Applied Soft Computing, 74, 26–39, 2019. Elsevier.
  118. Yanli Zhao, Guang Yang. Deep Learning-based Integrated Framework for stock price movement prediction. Applied Soft Computing, 133, 109921, 2023. Elsevier.
  119. A review on data fusion in multimodal learning analytics and educational data mining. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 12(4), e1458, 2022. Wiley Online Library.
  120. KerasNLP. Year: 2022. How Published: https://github.com/keras-team/keras-nlp
  121. Credit rating by hybrid machine learning techniques. In Applied soft computing, Volume 10, Number 2, pages 374-380, 2010. Publisher: Elsevier.
  122. A deep-learning based Bayesian approach to seismic imaging and uncertainty quantification. In EAGE 2020 Annual Conference & Exhibition Online, Volume 2020, Number 1, pages 1-5, 2020. Organization: EAGE Publications BV.
  123. Tradeoffs in data augmentation: An empirical study. Year: 2021. Journal: arXiv preprint arXiv:2105.12795.
  124. Sovereign risk ratings’ country classification using machine learning. Year: 2019. Publisher: Federal University of Paraíba: Paraíba.
Citations (5)

Summary

We haven't generated a summary for this paper yet.