Interpretable Data Mining Approaches to Predict Term Deposits Subscriptions

Authors

  • Qingyang Chen

DOI:

https://doi.org/10.54691/bcpbm.v44i.4841

Keywords:

Telemarketing; Machine Learning; Feature Importance; Predictive models; Random Forest

Abstract

Under the covid pandemic, informatization has accelerated. Customer behaviors have changed dramatically in almost every industry, which requires companies and organizations to analyze the latest customer features and develop specialized marketing contents. Banks play a major role in the financial markets, and term deposit subscription is one of the most important products of banks. In this study, a Portuguese retail bank’s telemarketing data on selling term deposits has been analyzed which includes 16 input variables. All 16 variables have been converted to numeric inputs by using one-hot encoding method. For testing, the dataset has been divided into train and test groups by 80% and 20% respectively. To make this study more interpretable, two tree-structured data mining models, decision tree and random forest classifier, has been trained and tested. The input variable called “Duration” can only be known after the outcome is known so it has been dropped when creating predictive models. Two feature importance histograms and three confusion matrics are generated to visualize the customer features and evaluate the models. Area under Curve-Receiver Operating Characteristic Score (AUC-ROC scores) are also computed to compare the accuracy of models. Testing results have proved that “duration” has a severe relationship with clients’ subscriptions and should not be included in predictive models. Random forest classifier is better than decision tree and can generate comparable accuracy to previous models which included “duration.” Balance and age are the top two influential customer features on term deposit subscriptions.

Downloads

Download data is not yet available.

References

M. A. T. Rony, M. M. Hassan, E. Ahmed, A. Karim, S. Azam and D. S. A. A. Reza, Identifying Long-Term Deposit Customers: A Machine Learning Approach, 2021 2nd International Informatics and Software Engineering Conference (IISEC), 2021, pp. 1-6, doi: 10.1109/IISEC54230.2021.9672452.

Parvatiyar A, Sheth J N. Customer relationship management: Emerging practice, process, and discipline. Journal of Economic & Social Research, 2001, 3(2).

Kotler, Philip, and Kevin Lane Keller. A framework for marketing management. Boston, MA: Pearson, 2016.

Borugadda P, Nandru P, Madhavaiah C. Predicting the success of bank telemarketing for selling long-term deposits: An application of machine learning algorithms. St. Theresa Journal of Humanities and Social Sciences, 2021, 7(1): 91-108.

Zhuang Q R, Yao Y W, Liu O. Application of data mining in term deposit marketing. Proceedings of the International Multi Conference of Engineers and Computer Scientists. 2018, 2.

Ling C X, Li C. Data mining for direct marketing: Problems and solutions, Kdd. 1998, 98: 73-79.

Moro et al., S. Moro, P. Cortez and P. Rita. A Data-Driven Approach to Predict the Success of Bank Telemarketing. Decision Support Systems, Elsevier, 62:22-31, 2014

Song Y Y, Ying L U. Decision tree methods: applications for classification and prediction. Shanghai archives of psychiatry, 2015, 27(2): 130.

Yu Q, et al. Pose-guided matching based on deep learning for assessing quality of action on rehabilitation training. Biomedical Signal Processing and Control 72, 2022, 103323.

Patel Jigneshkumar L., and Ramesh K. Goyal. Applications of artificial neural networks in medical science. Current clinical pharmacology 2.3, 2007, 217-226.

Downloads

Published

2023-04-27

How to Cite

Chen, Q. (2023). Interpretable Data Mining Approaches to Predict Term Deposits Subscriptions. BCP Business & Management, 44, 345-350. https://doi.org/10.54691/bcpbm.v44i.4841