Dice Question Streamline Icon: https://streamlinehq.com

Uncertain drivers and potential biases of the BERT-based default risk score

Identify and characterize the primary factors that drive the BERT-based default risk score generated from Lending Club loan descriptions, and ascertain whether the score exhibits systematic biases that affect its predictions.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper fine-tunes BERT on loan descriptions from the Lending Club dataset to produce a default risk score and shows that incorporating this score improves a conventional XGBoost granting model. Despite performance gains, the authors emphasize that the model’s opacity limits interpretability.

They explicitly state that the underlying determinants of the score and any potential biases are not presently known, highlighting the need for transparency to meet regulatory expectations and build user trust.

References

However, the true drivers behind a given score or any potential biases remain uncertain.

Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending (2401.16458 - Sanz-Guerrero et al., 29 Jan 2024) in Section 8 (Conclusions)