Improving Business Insurance Loss Models by Leveraging InsurTech Innovation (2401.16723v1)
Abstract: Recent transformative and disruptive advancements in the insurance industry have embraced various InsurTech innovations. In particular, with the rapid progress in data science and computational capabilities, InsurTech is able to integrate a multitude of emerging data sources, shedding light on opportunities to enhance risk classification and claims management. This paper presents a groundbreaking effort as we combine real-life proprietary insurance claims information together with InsurTech data to enhance the loss model, a fundamental component of insurance companies' risk management. Our study further utilizes various machine learning techniques to quantify the predictive improvement of the InsurTech-enhanced loss model over that of the insurance in-house. The quantification process provides a deeper understanding of the value of the InsurTech innovation and advocates potential risk factors that are unexplored in traditional insurance loss modeling. This study represents a successful undertaking of an academic-industry collaboration, suggesting an inspiring path for future partnerships between industry and academic institutions.
- Optuna: A next-generation hyperparameter optimization framework. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pages 2623–2631.
- Visualizing the effects of predictor variables in black box supervised learning models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 82(4):1059–1086.
- Business model transformation through artificial intelligence in the israeli insurtech. In ISPIM Connect Valencia.
- Exposure as duration and distance in telematics motor insurance using generalized additive models. Risks, 5(4):54.
- Identifying sources of variation and the flow of information in biochemical networks. Proceedings of the National Academy of Sciences, 109(20):E1320–E1328.
- Breiman, L. (2001). Random forests. Machine learning, 45(1):5–32.
- Classification and regression trees. Monterey, CA: Wadsworth & Brooks/Cole Advanced Books & Software.
- Usage-based insurance—impact on insurers and potential implications for insurtech. North American Actuarial Journal, 26(3):428–455.
- Commercial lines insurtech: A pathway to digital. Technical report, McKinsey & Company.
- Implementing local-explainability in gradient boosting trees: Feature contribution. Information Sciences, 589:199–212.
- Actuarial modelling of claim counts: Risk classification, credibility and bonus-malus systems. John Wiley & Sons.
- New technologies and data in insurance. The Geneva Papers on Risk and Insurance - Issues and Practice, 47:495–498.
- Credibility prediction using collateral information. Variance, 11(1):45–59.
- Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Annals of statistics, 29(5):1189–1232.
- Claims frequency modeling using telematics car driving data. Scandinavian Actuarial Journal, 2019(2):143–162.
- Guelman, L. (2012). Gradient boosting trees for auto insurance loss cost modeling and prediction. Expert Systems with Applications, 39(3):3659–3667.
- The use of telematics devices to improve automobile insurance rates. Risk analysis, 39(3):662–672.
- Near‐miss telematics in motor insurance. Journal of Risk & Insurance, 88(3):569–589.
- Can automobile insurance telematics predict the risk of near-miss events? North American Actuarial Journal, 24(1):141–152.
- The elements of statistical learning: data mining, inference, and prediction, volume 2. Springer.
- Boosting insights in insurance tariff plans with tree-based machine learning methods. North American Actuarial Journal, 25(2):255–285.
- Imbalanced learning for insurance using modified loss functions in tree-based models. Insurance: Mathematics and Economics, 106:13–32.
- Automobile insurance classification ratemaking based on telematics driving data. Decision Support Systems, 127:113156.
- Fitting tweedie’s compound poisson model to insurance claims data. Scandinavian Actuarial Journal, 1994(1):69–93.
- Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems, volume 30.
- Insurtech: A guide for the actuarial community. Technical report, Willis Tower Watson. Published by the Society of Actuaries.
- Hal: Computer system for scalable deep learning. In Practice and Experience in Advanced Research Computing, pages 41–48. Association for Computing Machinery.
- Delta boosting machine with application to general insurance. North American Actuarial Journal, 22(3):405–425.
- Loh, W.-Y. (2011). Classification and regression trees. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1(1):14–23.
- A unified approach to interpreting model predictions. In Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., editors, Advances in Neural Information Processing Systems 30, pages 4765–4774. Curran Associates, Inc.
- A conceptual model for pricing health and life insurance using wearable technology: Pricing insurance using wearable technology. Risk Management and Insurance Review, 21(3):389–411.
- Naylor, M. (2017). Types of insurance. In Insurance Transformed: technological disruption, pages 41–45. Palgrave Macmillan, Cham.
- Generalized linear models. Journal of the Royal Statistical Society: Series A (General), 135(3):370–384.
- Predicting motor insurance claims using telematics data—xgboost versus logistic regression. Risks, 7(2):70.
- Tweedie’s compound poisson model with grouped elastic net. Journal of Computational and Graphical Statistics, 25(2):606–625.
- Predictive analytics of insurance claims using multivariate decision trees. Dependence Modeling, 6(1):377–407.
- The development of insurtech in europe and the strategic response of incumbents. In Disruptive Technology in Banking and Finance: An International Perspective on FinTech, pages 135–160. Springer International Publishing.
- Taking the human out of the loop: A review of bayesian optimization. Proceedings of the IEEE, 104(1):148–175.
- Decision tree methods: applications for classification and prediction. Shanghai Archives of Psychiatry, 27(2):130–135.
- Wearables and the internet of things: considerations for the life and health insurance industry. British Actuarial Journal, 24:1–31.
- Suryavanshi, U. (2022). The insurtech revolution in insurance industry: Emerging trends, challenges and opportunities. International Journal of Management and Development Studies, 11:12–19.
- The INSURTECH Book: The Insurance Technology Handbook for Investors, Entrepreneurs and FinTech Visionaries. John Wiley & Sons.
- Unravelling the predictive power of telematics data in car insurance pricing. Journal of the Royal Statistical Society: Series C (Applied Statistics), 67(5):1275–1304.
- Wang, Q. (2021). The impact of insurtech on chinese insurance industry. Procedia Computer Science, 187:30–35.
- Telematic driving profile classification in car insurance pricing. Annals of actuarial science, 11(2):213–236.
- Data analytics for non-life insurance pricing. Technical Report 16-68, Swiss Finance Institute.
- A framework for the evaluation of insurtech. Risk Management and Insurance Review, 23(4):305–329.
- Insurance premium prediction via gradient tree-boosted tweedie compound poisson models. Journal of Business & Economic Statistics, 36(3):456–470.
- Tweedie gradient boosting for extremely unbalanced zero-inflated data. Communications in Statistics-Simulation and Computation, 51(9):5507–5529.
- Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 67:301–320.