Cross-Domain Behavioral Credit Modeling: transferability from private to central data (2401.09778v1)
Abstract: This paper introduces a credit risk rating model for credit risk assessment in quantitative finance, aiming to categorize borrowers based on their behavioral data. The model is trained on data from Experian, a widely recognized credit bureau, to effectively identify instances of loan defaults among bank customers. Employing state-of-the-art statistical and machine learning techniques ensures the model's predictive accuracy. Furthermore, we assess the model's transferability by testing it on behavioral data from the Bank of Italy, demonstrating its potential applicability across diverse datasets during prediction. This study highlights the benefits of incorporating external behavioral data to improve credit risk assessment in financial institutions.
- https://www.experian.com/.
- Optuna: A next-generation hyperparameter optimization framework. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pages 2623–2631, 2019.
- Predicting consumer default: A deep learning approach. Technical report, National Bureau of Economic Research, 2019.
- Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal statistical society: series B (Methodological), 57(1):289–300, 1995.
- The control of the false discovery rate in multiple testing under dependency. Annals of statistics, pages 1165–1188, 2001.
- A note on the validity of cross-validation for evaluating autoregressive time series prediction. Computational Statistics & Data Analysis, 120:70–83, 2018.
- Risk and risk management in the credit card industry. Journal of Banking & Finance, 72:218–239, 2016.
- William Jay Conover. Practical nonparametric statistics, volume 350. john wiley & sons, 1999.
- Applied regression analysis, volume 326. John Wiley & Sons, 1998.
- Autogluon-tabular: Robust and accurate automl for structured data. arXiv preprint arXiv:2003.06505, 2020.
- Strictly proper scoring rules, prediction, and estimation. Journal of the American Statistical Association, 102(477):359–378, 2007.
- The elements of statistical learning: data mining, inference, and prediction, volume 2. Springer, 2009.
- Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, 30, 2017.
- Consumer credit-risk models via machine-learning algorithms. Journal of Banking & Finance, 34(11):2767–2787, 2010.
- Beta calibration: a well-founded and easily implemented improvement on logistic calibration for binary classifiers. In Artificial Intelligence and Statistics, pages 623–631. PMLR, 2017.
- Quinn McNemar. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika, 12(2):153–157, 1947.
- Obtaining calibrated probabilities from boosting. CoRR, abs/1207.1403, 2012.
- Machine learning approach for credit scoring. arXiv preprint arXiv:2008.01687, 2020.
- An artificial intelligence approach to shadow rating. arXiv preprint arXiv:1912.09764, 2019.
- Deep learning for mortgage risk. Available at SSRN 2799443, 2018.
- Dirk Tasche. A traffic lights approach to pd validation. arXiv preprint cond-mat/0305038, 2003.
- The Bank of Italy. Non-performing loans (npls) in italy’s banking system. https://www.bancaditalia.it/media/views/2017/npl/index.html.
- The Bank of Italy. Circular no. 139. central credit register, 1991.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.