ac

Comparative Analysis of Machine Learning Models for Credit Card Fraud Detection in Imbalanced Datasets

Authors

  • Gregorius Airlangga Atma Jaya Catholic University of Indonesia

DOI:

10.47709/cnahpc.v6i2.3816

Keywords:

Credit Card Fraud, Machine Learning, Imbalanced Datasets, Random Forest, Precision

Dimension Badge Record



Abstract

This study presents a comprehensive evaluation of various machine learning models for detecting credit card fraud, emphasizing their performance in handling highly imbalanced datasets. We focused on three models: Logistic Regression, Random Forest, and Multilayer Perceptron (MLP), using a dataset comprising 555,719 transactions, each annotated with 22 attributes. Logistic Regression served as a baseline, Random Forest was evaluated for its high accuracy and low dependency on hyperparameter tuning, and MLP was tested for its capability to identify non-linear patterns. The models were assessed using ROC AUC, Matthews Correlation Coefficient (MCC), and precision-recall curves to determine their effectiveness in distinguishing fraudulent transactions. Results indicated that the Random Forest model outperformed others with a ROC AUC of 0.9868 and an MCC of 0.6638, showing substantial superiority in managing class imbalances and complex data interactions. Logistic Regression, although useful as a benchmark, exhibited limitations with a high number of false positives. MLP showed potential but was prone to a significant false positive rate, suggesting a need for further model refinement. The findings highlight the importance of choosing appropriate models and feature engineering techniques in fraud detection systems and suggest avenues for future research in real-time model deployment and advanced algorithmic strategies

Downloads

Download data is not yet available.
Google Scholar Cite Analysis
Abstract viewed = 93 times

References

Ahmad, I., Khan, S., & Iqbal, S. (2024). Guardians of the vault: unmasking online threats and fortifying e-banking security, a systematic review. Journal of Financial Crime.

Alarfaj, F. K., Malik, I., Khan, H. U., Almusallam, N., Ramzan, M., & Ahmed, M. (2022). Credit card fraud detection using state-of-the-art machine learning and deep learning algorithms. IEEE Access, 10, 39700–39715.

Alawida, M., Omolara, A. E., Abiodun, O. I., & Al-Rajab, M. (2022). A deeper look into cybersecurity issues in the wake of Covid-19: A survey. Journal of King Saud University-Computer and Information Sciences, 34(10), 8176–8206.

Ali, A., Abd Razak, S., Othman, S. H., Eisa, T. A. E., Al-Dhaqm, A., Nasser, M., … Saif, A. (2022). Financial fraud detection based on machine learning: a systematic literature review. Applied Sciences, 12(19), 9637.

Almahmoud, S., Hammo, B., Al-Shboul, B., & Obeid, N. (2022). A hybrid approach for identifying non-human traffic in online digital advertising. Multimedia Tools and Applications, 81(2), 1685–1718.

Alzahrani, R. A., & Aljabri, M. (2022). AI-based techniques for Ad click fraud detection and prevention: Review and research directions. Journal of Sensor and Actuator Networks, 12(1), 4.

Botchey, F. E., Qin, Z., & Hughes-Lartey, K. (2020). Mobile money fraud prediction—a cross-case analysis on the efficiency of support vector machines, gradient boosted decision trees, and na{"i}ve bayes algorithms. Information, 11(8), 383.

Chung, J., & Lee, K. (2023). Credit Card Fraud Detection: An Improved Strategy for High Recall Using KNN, LDA, and Linear Regression. Sensors, 23(18), 7788.

Craja, P., Kim, A., & Lessmann, S. (2020). Deep learning for detecting financial statement fraud. Decision Support Systems, 139, 113421.

Darwish, S. M. (2020). A bio-inspired credit card fraud detection model based on user behavior analysis suitable for business management in electronic banking. Journal of Ambient Intelligence and Humanized Computing, 11(11), 4873–4887.

Hussein, A. S., Khairy, R. S., Najeeb, S. M. M., & Alrikabi, H. T. S. (2021). Credit Card Fraud Detection Using Fuzzy Rough Nearest Neighbor and Sequential Minimal Optimization with Logistic Regression. International Journal of Interactive Mobile Technologies, 15(5).

Kanchana, M., Naresh, R., Deepa, N., Pandiaraja, P., & Stephan, T. (2022). Credit Card Fraud Detection Techniques Under IoT Environment: A Survey. In Transforming Management with AI, Big-Data, and IoT (pp. 141–154). Springer.

Kanu, C., Nnam, M. U., Ugwu, J. N., Achilike, N., Adama, L., Uwajumogu, N., & Obidike, P. (2023). Frauds and forgeries in banking industry in Africa: a content analyses of Nigeria Deposit Insurance Corporation annual crime report. Security Journal, 36(4), 671–692.

Kelvin, K. L. (2023). Credit Card Fraud Prediction.

Kumar, S., Gunjan, V. K., Ansari, M. D., & Pathak, R. (2022). Credit card fraud detection using support vector machine. Proceedings of the 2nd International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications: ICMISC 2021, 27–37.

Kumar, V. P., Pallavi, L., & Prakash, K. B. (2021). Role of recent technologies in cognitive systems. Cognitive Engineering for next Generation Computing: A Practical Analytical Approach, 231–264.

Kumaraswamy, N., Markey, M. K., Ekin, T., Barner, J. C., & Rascati, K. (2022). Healthcare fraud data mining methods: A look back and look ahead. Perspectives in Health Information Management, 19(1).

Liang, S., Wu, H., Zhen, L., Hua, Q., Garg, S., Kaddoum, G., … Yu, K. (2022). Edge YOLO: Real-time intelligent object detection system based on edge-cloud cooperation in autonomous vehicles. IEEE Transactions on Intelligent Transportation Systems, 23(12), 25345–25360.

McIver, S. (2021). Can Generative Adversarial Networks Help Us Fight Financial Fraud?

Merghadi, A., Yunus, A. P., Dou, J., Whiteley, J., ThaiPham, B., Bui, D. T., … Abderrahmane, B. (2020). Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth-Science Reviews, 207, 103225.

Mondal, I. A., Haque, M. E., Hassan, A.-M., & Shatabda, S. (2021). Handling imbalanced data for credit card fraud detection. 2021 24th International Conference on Computer and Information Technology (ICCIT), 1–6.

Ni, L., Li, J., Xu, H., Wang, X., & Zhang, J. (2023). Fraud feature boosting mechanism and spiral oversampling balancing technique for credit card fraud detection. IEEE Transactions on Computational Social Systems.

Putrevu, J., & Mertzanis, C. (2023). The adoption of digital payments in emerging economies: challenges and policy responses. Digital Policy, Regulation and Governance, (ahead-of-print).

Reddy, S. R. B., Kanagala, P., Ravichandran, P., Pulimamidi, R., Polireddi, N. S. A., & others. (2024). Effective fraud detection in e-commerce: Leveraging machine learning and big data analytics. Measurement: Sensors, 101138.

Ren, Y.-S., Ma, C.-Q., Kong, X.-L., Baltas, K., & Zureigat, Q. (2022). Past, present, and future of the application of machine learning in cryptocurrency research. Research in International Business and Finance, 63, 101799.

Rusia, M. K., & Singh, D. K. (2023). A comprehensive survey on techniques to handle face identity threats: challenges and opportunities. Multimedia Tools and Applications, 82(2), 1669–1748.

Saporta, G., & Maraney, S. (2022). Practical Fraud Prevention. “ O’Reilly Media, Inc.”

Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN Computer Science, 2(3), 160.

Udayakumar, R., Sreekumar, K., Nivedha, C. S., Sivasubramanian, S., & Umadevi, V. (2023). A Predictive Analysis of Autism Spectrum Disorder Using Ensemble Techniques of Machine Learning. 2023 3rd International Conference on Technological Advancements in Computational Sciences (ICTACS), 558–565.

Vasantha, S. V, Thullimilli, S., Ganesh, V. S., Shankar, M. S., Reddy, M. V., & Hariharan, S. (2023). Application of Resampling Techniques on Sepsis Data to Balance Model Performance with Accuracy, AUC, Kappa and MCC. 2023 2nd International Conference on Automation, Computing and Renewable Systems (ICACRS), 1174–1179.

Zhu, X., Ao, X., Qin, Z., Chang, Y., Liu, Y., He, Q., & Li, J. (2021). Intelligent financial fraud detection practices in post-pandemic era. The Innovation, 2(4).

Downloads

ARTICLE Published HISTORY

Submitted Date: 2024-05-02
Accepted Date: 2024-05-03
Published Date: 2024-06-03

How to Cite

Airlangga, G. (2024). Comparative Analysis of Machine Learning Models for Credit Card Fraud Detection in Imbalanced Datasets. Journal of Computer Networks, Architecture and High Performance Computing, 6(2), 858-866. https://doi.org/10.47709/cnahpc.v6i2.3816