Comparing the Performance of Multivariate Hotelling’s T2 Control Chart and Naive Bayes Classifier for Credit Card Fraud Detection

Ichwanul kahfi Prasetya, Devi Putri Isnawarty, Abdullah Fahmi, Salman Alfarizi Pradana Andikaputra, Wibawati Wibawati

Abstract


Credit card is a transaction tool using a card which is a substitute for legitimate cash in transactions. The use of computer technology is needed for various kinds of electronic transactions. In the world of technology, the term machine learning is not new and technological developments are increasingly rapid in recent years. Statistical process control method (SPC) is one of the measuring instruments used to improve the performance of public services. Hotelling T^2 control chart is a method in SPC that can be used to control the process. Methods that are widely used in the detection and classification of documents one of them is Naive Bayes Classifier (NBC) which has several advantages, among others, simple, fast and high accuracy. Those two methods will be used to detecting o2utlier of this dataset. The study used the credit card fraud registry with some PCA as independent variables. The size of fraud transaction is very small which represented only 0.172% of the 284,807 transactions. This research will use Area Under Curve (AUC) as the performance goodness test. A comparison of the accuracy of NBC and Hotelling's T2 predictions shows that the performance of the T2 Hotelling method is better in detecting outliers than the NBC method

Keywords


statistics, data science, classification

Full Text:

PDF

References


M. Fauzan, "Gaya Hidup Nasabah dan Keputusan Penggunaan Kartu Kredit," Esensi: jurnal Bisnis dan Manajemen, vol. 7, no. 2, pp. 181-192, 2017.

B. Indonesia, "Edukasi Kartu kredit," 2020. [Online]. Available: https://www.bi.go.id/id/edukasi-perlindungan-konsumen/edukasi/produkdan-jasa-sp/kartu kredit/Contents/Default.aspx. [Accessed 10 October 2022].

K. Chaudhary, J. Yadav and B. Mallick, "A review of Fraud Detection Techniques: Credit Card," Internal Journal of Computer Applications, vol. 45, no. 1, pp. 39-44, 2012.

Nurhayati, I. Soekarno and M. Cahyono, "A Study of Hold-Out and K-Fold Cross Validation for Accuracy of Groundwater Modeling in Tidal Lowland Reclamation Using Extreme Learning Machine," in 2nd International Conference on Technologu, Informatics, Management, Engineering & Environment, 2014.

R. Kohavi, G. John, R. Long, D. Manley and K. Peger, "MLC++: A machine learning library in C++, in Tools with Articial Intelligence," IEEE Society Press, 1994740-743.

J. Han, M. Kamber and J. Pei, Data Mining Concepts, USA: Morgan Kaufmann, 2012.

M. Erpianti, "ANALISIS PENGENDALIAN KUALITAS DATA DENGAN METODE T2 HOTELLING INDIVIDUAL," Buletin Ilmiah Math, Stat, dan Terapannya, vol. 09, no. 03, pp. 431-436, 2020.

D. Spiegelhalter and K. Rice, "Bayesian Statistics," Scholarpedia, vol. 4, no. 8, p. 5230, 2006.

R. Feldman and J. Sanger, The Text Mining Handbook:Advanced Approaches in Analyzing Unstructured Data, New York: Cambridge University Press, 2007.

M. Bekkar, H. K. Djemma and T. Alitouche, "Evaluation Measure for Models Assesment over Imbalanced Data Sets," Journal of Information Engineering and Applications, vol. 3, pp. 27-38, 2013.

N. Sulistiawanti, M. Ahsan, and H. Khusna, “Multivariate Exponentially Weighted Moving Average (MEWMA) and Multivariate Exponentially Weighted Moving Variance (MEWMV) Chart Based on Residual XGBoost Regression for Monitoring Water Quality.,” Eng. Lett., vol. 31, no. 3, 2023.

H. Khusna, M. Mashuri, S. Suhartono, D. D. Prastyo, and M. Ahsan, “Multioutput least square SVR based multivariate EWMA control chart: The performance evaluation and application,” Cogent Eng., Oct. 2018, doi: 10.1080/23311916.2018.1531456.

H. Khusna, M. Mashuri, Suhartono, D. D. Prastyo, and M. Ahsan, “Multioutput Least Square SVR Based Multivariate EWMA Control Chart,” J. Phys. Conf. Ser., vol. 1028, no. 1, p. 12221, 2018.




DOI: http://dx.doi.org/10.12962/j27213862.v7i1.18755

Refbacks

  • There are currently no refbacks.




Creative Commons License
Inferensi by Department of Statistics ITS is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Based on a work at https://iptek.its.ac.id/index.php/inferensi.

ISSN:  0216-308X

e-ISSN: 2721-3862

Web
Analytics Made Easy - StatCounter View My Stats