P2P Investment Data Analytics: A Case Study of Lending Club
DOI:
https://doi.org/10.54691/bcpbm.v38i.3979Keywords:
P2P lending; default risk; machine learning; supervised learning; Lending Club.Abstract
In this study, a forecasting model is constructed for default risk by applying several machine learning models, including logistic regression, random forest, gaussian naive bayes, artificial neural networks, support vector machine, and XGBoost. Cross validation and voting ensemble learning are also utilized for best suitable hyperparameters as well. Dataset comes from Lending Club and then cleaned, normalized and standardized it into an ideal input dataset which are splitting into training and testing ones later. Among the methods, XGBoost achieved the best performance.
Downloads
References
Lee, Samuel C., and Edward T. Lee. "Fuzzy sets and neural networks." (1974): 83-103.
Wang, H., Li, C., Gu, B., & Min, W. (1984). “Does AI-based credit scoring improve financial inclusion? Evidence from online payday lending”. In 40th international conference on information systems. ICIS 2019.
Chen, D., & Han, C. (2012). A comparative study of online P2P lending in the USA and China. Journal of Internet Banking and Commerce, 17(2), 1.
Reddy, S., & Gopalaraman, K. (2016). Peer to peer lending, default prediction-evidence from lending club. The Journal of Internet Banking and Commerce, 21(3).
Mills KG, McCarthy B (2016) The State of Small Business Lending: Innovation and Technology and the Implications for Regulation. HBS Working Paper No. 17-042.
Zhang, Z., Gao, G., & Shi, Y. (2014). Credit risk evaluation using multi-criteria optimization classifier with kernel, fuzzification and penalty factors. European Journal of Operational Research, 237(1), 335-348.
Freedman, S., & Jin, G. Z. (2017). The information value of online social networks: Lessons from peer-to-peer lending. International Journal of Industrial Organization, 51, 185-222.
Abdou, H., J. Pointon, & A. El-Masry (2008): “Neural nets versus conventional techniques in credit scoring in Egyptian banking.” Expert Systems with Applications 35(3): pp. 1275–1292.
Lessmann, S., B. Baesens, H. V. Seow, & L. C. Thomas (2015): “Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research.” European Journal of Operational Research 247(1): pp.124–136.
Wendler, T. & S. Grottrup (2016): Data Mining with SPSS Modeler. Springer International Publishing.Ma Kunlong. Short term distributed load forecasting method based on big data. Changsha: Hunan University, 2014.






