Credit Card Fraud Identification using Logistic Regression and Random Forest
DOI:
https://doi.org/10.31185/wjcms.184Keywords:
Fraud, ML, Random Forest, Transaction, Logistic RegressionAbstract
Fraud is an ancient yet ever-changing profession. Because of the digitization of money, financial transactions, banks, fraudsters now have a limitless number of possibilities to perpetrate crime from behind a screen, anywhere around the world. Fraud has a broad influence, with direct ramifications for business and the economy. It is of great worry to cybercrime organizations as recent studies have proven that ML algorithms may successfully be utilized to identify fraudulent transactions in massive amounts of payment data. Such techniques may identify fraudulent transactions in real time, which human auditors may miss. In this research, we apply supervised ML algorithms to the issue of fraud identification by analyzing simulated financial transaction data that is available to the public. Our aim is to show how supervised ML methods may be utilized to successfully identify data with extreme class disproportion. By way of example, we show how exploratory analysis may be utilized to identify fraudulent from real purchases. We also show that Random Forest outperform Logistic Regression when applied to a clearly distinguished dataset.
References
C. Jiang, J. Song, G. Liu, L. Zheng, and W. Luan, “Credit card fraud detection: A novel approach using aggregation strategy and feedback mechanism,” IEEE Internet of Things Journal, vol. 5, no. 5, pp. 3637–3647, 2018.
R. B. Sulaiman, V. Schetinin, and P. Sant, “Review of machine learning approach on credit card fraud detection,” Human-Centric Intelligent Systems, vol. 2, no. 1-2, pp. 55–68, 2022
A. Karthikeya, Y. B. Sai, S. Hariharan, A. C. Rao, J. D. Jignash, and A. B. Prasad, “Prevention of Cyber Attacks Using Deep Learning,” 2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS), vol. 1, pp. 1332–1336, 2023.
A. Ali, S. Razak, S. H. Othman, T. A. E. Eisa, A. Al-Dhaqm, and M. Nasser, “Financial fraud detection based on machine learning: a systematic literature review,” Applied Sciences, vol. 12, no. 19, pp. 9637–9637, 2022.
X. Lei, U. H. Mohamad, A. Sarlan, M. Shutaywi, Y. I. Daradkeh, and H. O. Mohammed, “Development of an intelligent information system for financial analysis depend on supervised machine learning algorithms,” Information Processing & Management, vol. 59, no. 5, pp. 103036–103036, 2022.
Z. Zhao and T. Bai, “Financial Fraud Detection and Prediction in Listed Companies Using SMOTE and Machine Learning Algorithms,” Entropy, vol. 24, no. 8, pp. 1157–1157, 2022.
H. Ye, L. Xiang, and Y. Gan, “Detecting financial statement fraud using random forest with SMOTE,” IOP Conference Series: Materials Science and Engineering, vol. 612, pp. 52051–52051, 2019.
M. A. Sharma, B. G. Raj, B. Ramamurthy, and R. H. Bhaskar, “Credit Card Fraud Detection Using Deep Learning Based on Auto-Encoder,” ITM Web of Conferences, vol. 50, 2022.
O. Vynokurova, D. Peleshko, O. Bondarenko, V. Ilyasov, V. Serzhantov, and M. Peleshko, “Hybrid machine learning system for solving fraud detection tasks,” 2020 IEEE Third International Conference on Data Stream Mining & Processing (DSMP), pp. 1–5, 2020.
A. K. Rai and R. K. Dwivedi, “Fraud detection in credit card data using unsupervised machine learning based scheme,” 2020 international conference on electronics and sustainable communication systems (ICESC), pp. 421–426, 2020.
S. K. Majhi, S. Bhatachharya, R. Pradhan, and S. Biswal, “Fuzzy clustering using salp swarm algorithm for automobile insurance fraud detection,” Journal of Intelligent & Fuzzy Systems, vol. 36, no. 3, pp. 2333–2344, 2019.
N. Rtayli and N. Enneya, “Selection features and support vector machine for credit card risk identification,” Procedia Manufacturing, vol. 46, pp. 941–948, 2020.
K. Randhawa, “Credit Card Fraud Detection Using AdaBoost and Majority Voting,” IEEE Access, vol. 6, pp. 14277–14284, 2018.
G. Liu, J. Tang, Y. Tian, and J. Wang, “Graph Neural Network for Credit Card Fraud Detection,” 2021 International Conference on Cyber-Physical Social Intelligence (ICCSI), pp. 1–6, 2021.
F. Carcillo, Y. A. L. Borgne, O. Caelen, and G. Bontempi, “Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization,” International Journal of Data Science and Analytics, vol. 5, pp. 285–300, 2018.
T. . Ntnu, “Synthetic Financial Datasets for Fraud Detection.” https://www.kaggle.com/ntnu-testimon/paysim1.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Wang Yundong, Alexander Zhulev, Omar G. Ahmed
This work is licensed under a Creative Commons Attribution 4.0 International License.