Loan Default Prediction based on Machine Learning (LightGBM Model)

Authors

  • Xingzhe Dong

DOI:

https://doi.org/10.54691/bcpbm.v25i.1857

Keywords:

Loan default prediction, LightGBM algorithm, Machine learning

Abstract

With the rapid development of Internet finance, the number of online loan platforms has been increasing, and the scale of loan business is gradually expanding. At the same time, the bad debt and non-performing loan ratios have also risen sharply, the reasons for which include incomplete information of lenders, imprudent management of capital chain, inaccurate default prediction, etc. Therefore, loan platforms need a set of efficient and accurate loan default prediction solutions to ensure the healthy operation of Internet finance and avoid huge losses for platforms and investors. Current loan default prediction models mainly contain traditional machine learning methods such as logistic regression and decision tree, as well as integrated models such as Random Forest and GBDT. In this paper, we use LightGBM machine learning model to process massive loan default information using the lender information dataset on Tianchi Platform, and the process includes data preprocessing, feature engineering, model training and evaluation. The experimental results show that the LightGBM algorithm has a strong ability to predict loan default, with a final AUC value reaching about 0.73. This method is helpful for banks and Internet loan platforms to conduct background investigation and default prediction of loan applicants, so as to strengthen the ability of loan risk management.

Downloads

Download data is not yet available.

References

Financial Statistics Report for the first half of 2022 (PBC.gov.cn), 2022, 7.

Zhang Hanping. Research on Default prediction of personal Loan based on LightGBM Model [D]. Central China normal university, 2021. DOI: 10.27159 /, dc nki. Ghzsu. 2021.000706.

Akwaa Sekyi, Ellis Kofi; Bosompra, Portia. (2015) . Determinants of business loan default in Ghana. Junior Scientific Researcher, 2015, vol. 1, Num. 1, p. 10-26. http://hdl.handle.net/10459.1/65930.

Luo Fangke, Chen Xiaohong. [LUO F K, Chen X H. Credit risk assessment of individual small loan based on Logistic regression model and its application. Theory and practice of finance and economics, 2017, 38 (01) : 30-35, DOI: 10.16339 / j.carol carroll nki HDXBCJB. 2017.01.005.

Yuan Xin, Duan Huaqiong. Based on decision tree algorithm of Titanic data prediction [J]. Computer knowledge and technology, 2020 (22) : 185-186 + 199. DOI: 10.14004 / j.carol carroll nki CKT. 2020.2622.

https://towardsdatascience.com/decision-trees-in-machine-learning-641b9c4e8052

Liu X D, Li F, Wang J, et al. credit risk assessment of online lending under the background of big data: a case study of renrendai [J]. Statistics and information forum,2016,31(05):41-48.]

Liu Yiding, Xu Jing. Financial early warning model based on strong Adaboost classifiers [J]. Journal of modern business, 2020 (31) : 187-188. The DOI: 10.14097 / j.carol carroll nki. 5392/2020.31.075.

Rezazadeh, Alireza. (2020). A Generalized Flow for B2B Sales Predictive Modeling: An Azure Machine-Learning Approach. Forecasting. 2. 267-283. 10.3390/ Forecast2030015.

Nathan Aim. (2020). Mathematics behind ROC-AUc Interpretation, from https://medium.com/analytics-vidhya/mathematics-behind-roc-auc-interpretation-e4e6f202a015

Tang Yifeng. Research on Loan default prediction model based on XGBoost algorithm and LightGBM algorithm [J]. Modern computer,2021,27(32):33-37.

Zhang Hanping. Based on the research of personal loan default LightGBM model [D]. Central China normal university, 2021. The DOI: 10.27159 /, dc nki. Ghzsu. 2021.000706.

Downloads

Published

2022-08-30

How to Cite

Dong, X. (2022). Loan Default Prediction based on Machine Learning (LightGBM Model). BCP Business & Management, 25, 457-468. https://doi.org/10.54691/bcpbm.v25i.1857