Predictive models performance in financial services for identifying at-risk customers
DOI:
10.47709/ijmdsa.v3i4.5161Keywords:
Independently and jointly used in assessment, financing, and credit, risky customers, artificial intelligence, pre-processing data, assessment parameters, post-model quantification.Dimension Badge Record
Abstract
It is therefore essential in financial service where the concept of pre-emptive modeling is used to select customers that are most likely to engage in negative behaviors like loan defaulting, fraud or churn. This paper will seek to analyze different modeling techniques like Logistic regression, Decision trees, Random forests, Gradient boosting, and Neural network among others. The paper pays considerable attention to the data preprocessing step; problems such as imbalance ratio and missing values as well as important parameters such as precision, recall, and AUC-ROC to measure the efficiency of the models. Some of these problems that have been highlighted in the review include; data privacy, compliance to laws and regulations among others and model interpretability which plays an important role in the financial industry. This means that new achievements like XAI, Real-time analytics, Federated learning have contributed to the improvement of model interpretability, model size and data security. Such include technological applications to increase accuracy of Risk prediction while cutting costs, and help the financial institutions retain the trust of the customer given the ever changing regulation. By integrating these novelties and ethical issues, some more financial institutions should enhance the centrality of the presented predictions to improve the identification of the risks and customers’ appeal.
Abstract viewed = 63 times
References
Gilbert R, Widom CS, Browne K, Fergusson D, Webb E, Janson S. Burden and consequences of child maltreatment in high-income countries. Lancet 2009; 373(9657):68–81.
Anda RF, Butchart A, Felitti VJ, Brown DW. Building a framework for global surveillance of the public health implications of adverse childhood experiences. Am J Prev Med 2010; 39(1):93–8.
Putnam-Hornstein E, Webster D, Needell B, Magruder J. A public health approach to child maltreatment surveillance: evidence from a data linkage project in the U.S. Child Abuse Rev 2011; 20(4):256–73
WHO. Prevention of child abuse and neglect: making the links between human rights and public health. Geneva: Department of Injuries and Violence Prevention, WHO, 2001.
Mikton C, Butchart A. Child maltreatment prevention: a systematic review of reviews. Bull WHO 2009; 87(5):353–61.
Butchart A. Epidemiology: the major missing element in the global response to child maltreatment? Am J Prev Med 2008; 34(4S):S103–5?
Gilbert R, Fluke J, O'Donnell M, et al. Child maltreatment: variation in trends and policies in six developed countries. Lancet 2012; 379(9817): 758–72
Coulton CJ, Crampton DS, Irwin M, Spilsbury JC, Korbin JE. How neighborhoods influence child maltreatment: a review of the literature and alternative pathways. Child Abuse Negl 2007; 31(11–2):1117–42.
Stith SM, Liu T, Davies LC, et al. Risk factors in child maltreatment: a metaanalytic review of the literature. Aggress Violent Behav 2009; 14(1):13–29.
Prinz RJ, Sanders MR, Shapiro CJ, Whitaker DJ, Lutzker JR. Population-based prevention of child maltreatment: the U.S. Triple P System Population Trial. Prev Sci 2009; 10(1):1–12.
Bilukha O, Hahn RA, Crosby A, et al. The effectiveness of early childhood home visitation in preventing violence: a systematic review. Am J Prev Med 2005; 28(2S1):11–39?
Billings J, Dixon J, Mijanovich T, Wennberg D. Case findings for patients at risk of readmission to hospital: development of algorithm to identify high risk patients. BMJ 2006; 333:327.
Cabeças, A. (2014). Comportamento dos clientes da banca portuguesa em período de crise económica [Universidade Autónoma de Lisboa].
Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., & Wirth, R. (2000). CRISP-DM 1.0: Step-by-Step Data Mining Guide. DaimlerChrysler.
Cohen, D., Gan, C., Yong, H. H. A., & Chong, E. (2007). Customer Retention by Banks in New Zealand. Banks and Bank Systems, 2(1), 40–55. Customer Retention by Banks in New Zealand
Coussement, K., Lessmann, S., & Verstraeten, G. (2017). A comparative analysis of data preparation algorithms for customer churn prediction: A case study in the telecommunication industry. Decision Support Systems, 95, 27–36.
Lima Lemos, R. A., Silva, T. C., & Tabak, B. M. (2022). Propension to customer churn in a financial institution: a machine learning approach. Neural Computing and Applications, 34(14), 11751–11768.
Ejeu, E. (2021). Analysis of customer churn in Ugandan commercial banks: a case study of Stanbic Bank [College of Business and Management Sciences].
Hadden, J. (2008). A Customer Profiling Methodology for Churn Prediction [School of Applied Sciences].
Ismail, M. R., Awang, M. K., Rahman, M. N. A., & Makhtar, M. (2015). A Multi-Layer Perceptron Approach for Customer Churn Prediction. International Journal of Multimedia and Ubiquitous Engineering, 10(7), 213–222.
Oluwatoyin, A. M., Misra, S., Wejin, J., Gautam, A., Behera, R. K., & Ahuja, R. (2023). Customer Churn Prediction in Banking Industry Using Power Bi (pp. 767–774).
Oyeniyi, A. O., & Adeyemo, A. B. (2015). Customer Churn Analysis in Banking Sector Using Data Mining Techniques. African Journal of Computing & ICT African Journal of Computing & ICT Reference Format: A. O.
Hui, S. H., Khai, W. K., XinYing, C., & Wai, P. W. (2023). Prediction of customer churn for ABC Multistate Bank using machine learning algorithms/Hui Shan Hon...[et al.]. Malaysian Journal of Computing (MJoC), 8(2), 1602-1619.
Asselman, A., Khaldi, M., & Aammou, S. (2021). Enhancing the prediction of student performance based on the machine learning XGBoost algorithm. Interactive Learning Environments, 1-20. DOI: https://doi.org/10.1080/10494820.2021.1928235
Berk, R. (2012). Criminal justice forecasts of risk: A machine learning approach. Springer Science & Business Media. Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.
Chandar, M., Laha, A., & Krishna, P. (2006, March). Modeling churn behavior of bank customers using predictive data mining techniques. In National Conference on Soft computing techniques for engineering applications (SCT-2006) (pp. 24-26).
Chen, S., Webb, G. I., Liu, L., & Ma, X. (2020). A novel selective naïve Bayes algorithm. Knowledge-Based Systems, 192, 105361. Cica, D., Sredanovic, B., Tesic, S., & Kramar, D. (2020). Predictive modeling of turning operations under different cooling/lubricating conditions for sustainable manufacturing with machine learning techniques. Applied Computing and Informatics. DOI: https://doi.org/10.1016/j.aci.2020.02.001
Costache, R., Arabameri, A., Moayedi, H., Pham, Q. B., Santosh, M., Nguyen, H., & Pham, B. T. (2022). Flash-flood potential index estimation using fuzzy logic combined with deep learning neural network, naïve Bayes, XGBoost and classification and regression tree.
Geocarto International, 37(23), 6780-6807. Costache, R., Hong, H., & Wang, Y. (2019). Identification of torrential valleys using GIS and a novel hybrid integration of artificial intelligence, machine learning and bivariate statistics. Catena, 183, 104179.
Cunningham, P., & Delany, S. J. (2021). K-nearest neighbour classifiers-a tutorial. ACM Computing Surveys (CSUR), 54(6), 1-25.
Lima Lemos, R. A., Silva, T. C., & Tabak, B. M. (2022). Propension to customer churn in a financial institution: A machine learning approach. Neural Computing and Applications, 34, 11751-11768.
Guan, D., Yuan, W., Lee, Y. K., Najeebullah, K., & Rasel, M. K. (2014). A review of ensemble learning based feature selection. IETE Technical Review, 31(3), 190-198.
Guliyev, H., & Tato?lu, F. Y. (2021). Customer churn analysis in banking sector: Evidence from explainable machine learning models. Journal of Applied Microeconometrics, 1(2), 85-99.
Gumus, M., & Kiran, M. S. (2017, October). Crude oil price forecasting using XGBoost. In 2017 International conference on computer science and engineering (UBMK) (pp. 1100- 1103). IEEE.
Ha Thanh, N., & Vy, N. (2022). Building a proper churn prediction model for Vietnam’s mobile banking service. International Journal of Advanced and Applied Sciences, 9(7), 139-149.
He, B., Shi, Y., Wan, Q., & Zhao, X. (2014). Prediction of customer attrition of commercial banks based on SVM model. Procedia computer science, 31, 423-430.
Hung, S. Y., Yen, D. C., & Wang, H. Y. (2006). Applying data mining to telecom churn management. Expert Systems with Applications, 31(3), 515-524.
Kaur, H., & Kumari, V. (2022). Predictive modelling and analytics for diabetes using a machine learning approach. Applied Computing and Informatics, 18, 90-100.
Kaur, I., & Kaur, J. (2020, November). Customer churn analysis and prediction in banking industry using machine learning. In 2020 Sixth International Conference on Parallel, Distributed and Grid Computing (PDGC) (pp. 434-437). IEEE.
Keramati, A., Ghaneei, H., & Mirmohammadi, S. M. (2016). Developing a prediction model for customer churn from electronic banking services using data mining. Financial Innovation, 2(1), 1-13.
Khodabandehlou, S., & Rahman, M. Z. (2017). Comparison of supervised machine learning techniques for customer churn prediction based on analysis of customer behavior. Journal of Systems and Information Technology, 19, 65-93.
Hou, L., Sharma, B. Y., Li, H., Chen, C., Li, Y., & Chen, H. (2018). Developing a new intelligent system for the diagnosis of tuberculous pleural effusion. Computer Methods and Programs in Biomedicine, 153, 211-225.
Malhotra, D. K., Malhotra, K., & Malhotra, R. (2020). Evaluating consumer loans using machine learning techniques. In Applications of Management Science. Emerald Publishing Limited.
Malik, E. F., Khaw, K. W., & Chew, X. Y. (2022). A new hybrid data preprocessing technique for fraud detection prediction. Computing and Informatics, 41, 981-1001.
Marcelino, P., de Lurdes Antunes, M., Fortunato, E., & Gomes, M. C. (2021). Machine learning approach for pavement performance prediction. International Journal of Pavement Engineering, 22(3), 341-354.
Meza Ramirez, C. A., Greenop, M., Ashton, L., & Rehman, I. U. (2021). Applications of machine learning in spectroscopy. Applied Spectroscopy Reviews, 56(8-10), 733-763.
Oh, G., Song, J., Park, H., & Na, C. (2022). Evaluation of random forest in crime prediction: Comparing three-layered random forest and logistic regression. Deviant Behavior, 43(9), 1036-1049.
Patgiri, R., Varshney, U., Akutota, T., & Kunde, R. (2018, November). An investigation on intrusion detection system using machine learning. In 2018 IEEE Symposium Series on Computational Intelligence (SSCI) (pp. 1684-1691). IEEE.
Patro, K. K., Jaya Prakash, A., Jayamanmadha Rao, M., & Rajesh Kumar, P. (2022). An efficient optimized feature selection with machine learning approach for ECG biometric recognition. IETE Journal of Research, 68(4), 2743-2754.
Pisal, N. S., Abdul Rahman, S., Hanafiah, M., & Kamarudin, S. I. (2022). Prediction of life expectancy for Asian population using machine learning ALGORITHMS. Malaysian Journal of Computing (MJoC), 7(2), 1150-1161.
Rahman, M., & Kumar, V. (2020, November). Machine learning based customer churn prediction in banking. In 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA) (pp. 1196-1201). IEEE.
Sahin, E. K. (2020). Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest. SN Applied Sciences, 2(7), 1-17.
Shafie, S., Ooi, S. P., & Khaw, K. W. (2023). Prediction of employee promotion using hybrid sampling method with machine learning architecture. Malaysian Journal of Computing, 8(1), 1264-1286.
Taunk, K., De, S., Verma, S., & Swetapadma, A. (2019, May). A brief review of nearest neighbor algorithm for learning and classification. In 2019 International Conference on Intelligent Computing and Control Systems (ICCS) (pp. 1255-1260). IEEE.
Tékouabou, S. C., Gherghina, ?. C., Toulni, H., Mata, P. N., & Martins, J. M. (2022). Towards Explainable Machine Learning for Bank Churn Prediction Using Data Balancing and Ensemble-Based Methods. Mathematics, 10(14), 2379.
Wang, M., & Chen, H. (2020). Chaotic multi-swarm whale optimizer boosted support vector machine for medical diagnosis. Applied Soft Computing, 88, and 105946.
Yan, C., Liu, X., Boota, M. W., & Pan, Z. (2022). A Comparison of Machine Learning Methods Applied to the Automated Selection of River Networks. The Cartographic Journal, 59(3), 187-202.
Zhang, H., Zimmerman, J., Nettleton, D., & Nordman, D. J. (2020). Random forest prediction intervals. The American Statistician, 74(4), 392-406.
Downloads
ARTICLE Published HISTORY
Issue
Section
License
Copyright (c) 2024 Ashok Ghimire

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.