Utilizing the LightGBM Algorithm for Operator User Credit Assessment Research (2403.14483v1)
Abstract: Mobile Internet user credit assessment is an important way for communication operators to establish decisions and formulate measures, and it is also a guarantee for operators to obtain expected benefits. However, credit evaluation methods have long been monopolized by financial industries such as banks and credit. As supporters and providers of platform network technology and network resources, communication operators are also builders and maintainers of communication networks. Internet data improves the user's credit evaluation strategy. This paper uses the massive data provided by communication operators to carry out research on the operator's user credit evaluation model based on the fusion LightGBM algorithm. First, for the massive data related to user evaluation provided by operators, key features are extracted by data preprocessing and feature engineering methods, and a multi-dimensional feature set with statistical significance is constructed; then, linear regression, decision tree, LightGBM, and other machine learning algorithms build multiple basic models to find the best basic model; finally, integrates Averaging, Voting, Blending, Stacking and other integrated algorithms to refine multiple fusion models, and finally establish the most suitable fusion model for operator user evaluation.
- Meindert Fennema “International networks of banks and industry” Springer Science & Business Media, 2012
- Işık Biçer, Deniz Sevis and Taner Bilgiç “Bayesian credit scoring model with integration of expert knowledge and customer data” In International Conference 24th ini EURO Conference “Continuous Optimization and Information Technologies in the FFinancial Sector”(MEC EurOPT 2010), 2010, pp. 324–329
- Sadanori Konishi “Statistical model evaluation and information criteria” In Multivariate Analysis, Design of Experiments, and Survey Sampling CRC Press, 1999, pp. 393–424
- “Identification of a standard AI based technique for credit risk analysis” In Benchmarking: An International Journal 23.5 Emerald Group Publishing Limited, 2016, pp. 1381–1390
- Erik Hofmann “Big data and supply chain decisions: the impact of volume, variety and velocity properties on the bullwhip effect” In International Journal of Production Research 55.17 Taylor & Francis, 2017, pp. 5108–5126
- Ning Chen, Bernardete Ribeiro and An Chen “Financial credit risk assessment: a recent review” In Artificial Intelligence Review 45 Springer, 2016, pp. 1–23
- “Ensemble Methodology: Innovations in Credit Default Prediction Using LightGBM, XGBoost, and LocalEnsemble” In arXiv preprint arXiv:2402.17979, 2024
- “Enhancing Credit Card Fraud Detection: A Neural Network and SMOTE Integrated Approach” In Journal of Theory and Practice of Engineering Science 4.02, 2024, pp. 23–30
- “Ensemble learning: A survey” In Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 8.4 Wiley Online Library, 2018, pp. e1249
- Leo Breiman “Random forests” In Machine learning 45 Springer, 2001, pp. 5–32
- “Gradient boosted decision trees for high dimensional sparse output” In International conference on machine learning, 2017, pp. 3182–3190 PMLR
- “Xgboost: A scalable tree boosting system” In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785–794
- “Lightgbm: A highly efficient gradient boosting decision tree” In Advances in neural information processing systems 30, 2017