ac

Evaluating the Efficacy of Machine Learning Models in Credit Card Fraud Detection

Authors

  • Gregorius Airlangga Universitas Katolik Atmajaya

DOI:

10.47709/cnahpc.v6i2.3814

Keywords:

Credit Card, Machine Learning, Fraud Detection, Ensemble Methods, XGBoost

Dimension Badge Record



Abstract

This research evaluates the effectiveness of various machine learning models in detecting credit card fraud within a dataset comprising 555,719 transactions. The study meticulously compares traditional and advanced models, including Logistic Regression, Support Vector Machines (SVM), Random Forest, Gradient Boosting, k-Nearest Neighbors (k-NN), Naive Bayes, AdaBoost, LightGBM, XGBoost, and Multilayer Perceptrons (MLP), in terms of accuracy and reliability. Through a robust methodology involving extensive data preprocessing, feature engineering, and a 5-fold stratified cross-validation, the research identifies XGBoost as the most effective model, demonstrating a near-perfect mean accuracy of 0.9990 with minimal variability. The results emphasize the significance of model choice, data preparation, and the potential of ensemble and boosting techniques in managing the complexities of fraud detection. The findings not only contribute to the academic discourse on fraud detection but also suggest practical applications for real-world systems, aiming to enhance security measures in financial transactions. Future research directions include exploring hybrid models and adapting to evolving fraud tactics through continuous learning systems.

Downloads

Download data is not yet available.
Google Scholar Cite Analysis
Abstract viewed = 405 times

References

Al-Hashedi, K. G., & Magalingam, P. (2021). Financial fraud detection applying data mining techniques: A comprehensive review from 2009 to 2019. Computer Science Review, 40, 100402.

Ali, A., Abd Razak, S., Othman, S. H., Eisa, T. A. E., Al-Dhaqm, A., Nasser, M., … Saif, A. (2022). Financial fraud detection based on machine learning: a systematic literature review. Applied Sciences, 12(19), 9637.

Aziz, R. M., Baluch, M. F., Patel, S., & Ganie, A. H. (2022). LGBM: a machine learning approach for Ethereum fraud detection. International Journal of Information Technology, 14(7), 3321–3331.

Baesens, B., Höppner, S., & Verdonck, T. (2021). Data engineering for fraud detection. Decision Support Systems, 150, 113492.

Bi, T., Xia, B., Xing, Z., Lu, Q., & Zhu, L. (2023). On the way to sboms: Investigating design issues and solutions in practice. ACM Transactions on Software Engineering and Methodology.

Cha, G.-W., Moon, H.-J., & Kim, Y.-C. (2021). Comparison of random forest and gradient boosting machine models for predicting demolition waste based on small datasets and categorical variables. International Journal of Environmental Research and Public Health, 18(16), 8530.

Dixit, P., Bhattacharya, P., Tanwar, S., & Gupta, R. (2022). Anomaly detection in autonomous electric vehicles using AI techniques: A comprehensive survey. Expert Systems, 39(5), e12754.

Gabudeanu, L., Brici, I., Mare, C., Mihai, I. C., & ?cheau, M. C. (2021). Privacy intrusiveness in financial-banking fraud detection. Risks, 9(6), 104.

Ghimire, S., Nguyen-Huy, T., Deo, R. C., Casillas-Perez, D., & Salcedo-Sanz, S. (2022). Efficient daily solar radiation prediction with deep learning 4-phase convolutional neural network, dual stage stacked regression and support vector machine CNN-REGST hybrid model. Sustainable Materials and Technologies, 32, e00429.

Hashemi, S. K., Mirtaheri, S. L., & Greco, S. (2022). Fraud detection in banking data by machine learning techniques. IEEE Access, 11, 3034–3043.

Hilal, W., Gadsden, S. A., & Yawney, J. (2022). Financial fraud: a review of anomaly detection techniques and recent advances. Expert Systems With Applications, 193, 116429.

Hussain, T., Min Ullah, F. U., Muhammad, K., Rho, S., Ullah, A., Hwang, E., … Baik, S. W. (2021). Smart and intelligent energy monitoring systems: A comprehensive literature survey and future research guidelines. International Journal of Energy Research, 45(3), 3590–3614.

Irofti, P., Puatrascu, A., & Baltoiu, A. (2021). Fraud detection in networks. Enabling AI Applications in Data Science, 517–536.

Janani, S., Sivarathinabala, M., Anand, R., Ahamad, S., Usmani, M. A., & Basha, S. M. (2023). Machine Learning Analysis on Predicting Credit Card Forgery. International Conference On Innovative Computing And Communication, 137–148.

Jeffrey, N., Tan, Q., & Villar, J. R. (2024). Using Ensemble Learning for Anomaly Detection in Cyber--Physical Systems. Electronics, 13(7), 1391.

Jha, B. K., Sivasankari, G. G., & Venugopal, K. R. (2020). Fraud detection and prevention by using big data analytics. 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC), 267–274.

Kavzoglu, T., & Teke, A. (2022). Predictive Performances of ensemble machine learning algorithms in landslide susceptibility mapping using random forest, extreme gradient boosting (XGBoost) and natural gradient boosting (NGBoost). Arabian Journal for Science and Engineering, 47(6), 7367–7385.

Kelvin, K. L. (2023). Credit Card Fraud Prediction.

Kumar, S., Gunjan, V. K., Ansari, M. D., & Pathak, R. (2022). Credit card fraud detection using support vector machine. Proceedings of the 2nd International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications: ICMISC 2021, 27–37.

Lee, S. M., & Lee, D. (2020). “Untact”: a new customer service strategy in the digital age. Service Business, 14(1), 1–22.

Maiti, M., Vukovi?, D., Mukherjee, A., Paikarao, P. D., & Yadav, J. K. (2022). Advanced data integration in banking, financial, and insurance software in the age of COVID-19. Software: Practice and Experience, 52(4), 887–903.

Mamdouh Farghaly, H., Shams, M. Y., & Abd El-Hafeez, T. (2023). Hepatitis C Virus prediction based on machine learning framework: a real-world case study in Egypt. Knowledge and Information Systems, 65(6), 2595–2617.

Maulidi, A., & Ansell, J. (2021). Tackling practical issues in fraud control: a practice-based study. Journal of Financial Crime, 28(2), 493–512.

Mekterovi?, I., Karan, M., Pintar, D., & Brki?, L. (2021). Credit card fraud detection in card-not-present transactions: Where to invest? Applied Sciences, 11(15), 6766.

Mienye, I. D., & Sun, Y. (2022). A survey of ensemble learning: Concepts, algorithms, applications, and prospects. IEEE Access, 10, 99129–99149.

Nikiforov?, L. (2022). Use of innovative information technology in e-commerce and digital economy. Innovation and Sustainability.? 1: 65--71.

Rong, G., Alu, S., Li, K., Su, Y., Zhang, J., Zhang, Y., & Li, T. (2020). Rainfall induced landslide susceptibility mapping based on Bayesian optimized random forest and gradient boosting decision tree models—A case study of Shuicheng County, China. Water, 12(11), 3066.

Sánchez-Aguayo, M., Urquiza-Aguiar, L., & Estrada-Jiménez, J. (2021). Fraud detection using the fraud triangle theory and data mining techniques: A literature review. Computers, 10(10), 121.

Sood, P., & Bhushan, P. (2020). A structured review and theme analysis of financial frauds in the banking industry. Asian Journal of Business Ethics, 9, 305–321.

Taherdoost, H. (2021). A review on risk management in information systems: Risk policy, control and fraud detection. Electronics, 10(24), 3065.

Teles, G., Rodrigues, J. J. P. C., Rabelo, R. A. L., & Kozlov, S. A. (2021). Comparative study of support vector machines and random forests machine learning algorithms on credit operation. Software: Practice and Experience, 51(12), 2492–2500.

Wang, L., Zhang, Z., Zhang, X., Zhou, X., Wang, P., & Zheng, Y. (2021). A Deep-forest based approach for detecting fraudulent online transaction. In Advances in computers (Vol. 120, pp. 1–38). Elsevier.

Zhu, X., Ao, X., Qin, Z., Chang, Y., Liu, Y., He, Q., & Li, J. (2021). Intelligent financial fraud detection practices in post-pandemic era. The Innovation, 2(4).

Downloads

ARTICLE Published HISTORY

Submitted Date: 2024-05-01
Accepted Date: 2024-05-03
Published Date: 2024-05-28

How to Cite

Airlangga, G. (2024). Evaluating the Efficacy of Machine Learning Models in Credit Card Fraud Detection. Journal of Computer Networks, Architecture and High Performance Computing, 6(2), 829-837. https://doi.org/10.47709/cnahpc.v6i2.3814