Bank Customer Segmentation Model Using Machine Learning

Authors

  • Vira Bunga Tiara Universitas Buana Perjuangan Karawang
  • Amril Mutoi Siregar Buana Perjuangan University https://orcid.org/0000-0001-8746-3283
  • Dwi Sulistya Kusumaningrum Kusumaningrum Buana Perjuangan University
  • Tatang Rohana Buana Perjuangan University

DOI:

https://doi.org/10.23887/janapati.v13i1.75233

Keywords:

Classification, Deposit Bank, Machine Learning, PCA, Supervised Algorithm

Abstract

Banks generally carry out marketing strategies by offering deposit products directly to customers. However, this method is less effective because it requires individualized communication without considering the customer's interest in the product offered. Therefore, this research aims to categorize the classification of bank customers into Yes and No. This research uses a dataset of bank deposits taken from KTM. This research uses a bank deposit dataset taken from Kaggle, the data consists of 11162 rows with 17 attributes.  PCA technique was used for feature selection which was optimized by reducing the dimensionality of the dataset before modeling. It was found that the best model accuracy was SVM RBF kernel with C parameters achieving 80.51% accuracy and ANN 80.78%, but ANN showed a higher ROC graph than SVM because ANN performance results were faster than SVM. Thus, the overall performance measurement of ANN is much better.

Author Biographies

Amril Mutoi Siregar, Buana Perjuangan University

Department of Informatics, Faculty of Computer Science, Buana Perjuangan University

Dwi Sulistya Kusumaningrum Kusumaningrum, Buana Perjuangan University

Department of Informatics, Faculty of Computer Science, Buana Perjuangan University

Tatang Rohana, Buana Perjuangan University

Department of Informatics, Faculty of Computer Science, Buana Perjuangan University

References

F. Izhari, “Teknik Machine Learning untuk Bank Marketing Dataset,” Semin. Nas. Inform., 2021, [Online]. Available: https://www.ejournal.pelitaindonesia.ac.id/ojs32/index.php/SENATIKA/article/view/1187

A. D. R. Prabowo and M. Muljono, “Prediksi Nasabah Yang Berpotensi Membuka Simpanan Deposito Menggunakan Naive Bayes Berbasis Particle Swarm Optimization,” Techno.Com, vol. 17, no. 2, pp. 208–219, 2018, doi: 10.33633/tc.v17i2.1648.

K. N. Abd Halim*, A. S. Mohd Jaya, and A. F. A. Fadzil, “Data Pre-Processing Algorithm for Neural Network Binary Classification Model in Bank Tele-Marketing,” Int. J. Innov. Technol. Explor. Eng., vol. 9, no. 3, pp. 272–277, 2020, doi: 10.35940/ijitee.c8472.019320.

M. A. Sheikh, A. K. Goel, and T. Kumar, “An Approach for Prediction of Loan Approval using Machine Learning Algorithm,” Proc. Int. Conf. Electron. Sustain. Commun. Syst. ICESC 2020, no. Icesc, pp. 490–494, 2020, doi: 10.1109/ICESC48915.2020.9155614.

D. Wang, “Research on bank marketing behavior based on machine learning,” ACM Int. Conf. Proceeding Ser., pp. 150–154, 2020, doi: 10.1145/3421766.3421800.

E. Villamosm, I. Kar, and D. Tansz, “Machine Learning Based Customer Decision Support,” pp. 1196–1201, 2020.

M. Al Hammadi, “Identifying Prospective Clients for Long-Term Bank Deposit,” 2022, [Online]. Available: https://scholarworks.rit.edu/theses

A. N. Puteri, A. Arizal, and A. D. Achmad, “Feature Selection Correlation-Based pada Prediksi Nasabah Bank Telemarketing untuk Deposito,” MATRIK J. Manajemen, Tek. Inform. dan Rekayasa Komput., vol. 20, no. 2, pp. 335–342, 2021, doi: 10.30812/matrik.v20i2.1183.

M. Rasikh, A. Riyyasy, W. N. Aghniya, and H. Tantyoko, “Penerapan Algoritma Machine Learning Untuk Memprediksi Term Deposit Nasabah Perbankan,” vol. 8798, 2023.

F. Aziz and B. L. E. Panggabean, “Klasifikasi Nasabah Potensial menggunakan Algoritma Ensemble Least Square Support Vector Machine dengan AdaBoost,” J. Inform. J. Pengemb. IT, vol. 8, no. 3, pp. 269–274, 2023, doi: 10.30591/jpit.v8i3.5675.

Z. Song, “A Data Mining Based Fraud Detection Hybrid Algorithm in E-bank,” Proc. - 2020 Int. Conf. Big Data, Artif. Intell. Internet Things Eng. ICBAIE 2020, pp. 44–47, 2020, doi: 10.1109/ICBAIE49996.2020.00016.

A. F. R. N. L. N. S. Z. N. Taufik Hidayatulloh 1, “Algoritma C4.5 Untuk Menentukan Kelayakan Pemberian Kredit,” J. Larik, vol. Vol.2No.2, no. 2, pp. 1–11, 2022.

L. Liu, “A Self-Learning BP Neural Network Assessment Algorithm for Credit Risk of Commercial Bank,” Wirel. Commun. Mob. Comput., vol. 2022, 2022, doi: 10.1155/2022/9650934.

K. Alkhatib and S. Abualigah, “Predictive Model for Cutting Customers Migration from banks: Based on machine learning classification algorithms,” 2020 11th Int. Conf. Inf. Commun. Syst. ICICS 2020, pp. 303–307, 2020, doi: 10.1109/ICICS49469.2020.239544.

M. Z. Khan et al., “The Performance Analysis of Machine Learning Algorithms for Credit Card Fraud Detection,” Int. J. online Biomed. Eng., vol. 19, no. 3, pp. 82–98, 2023, doi: 10.3991/ijoe.v19i03.35331.

A. Abdulhafedh, “Incorporating K-means, Hierarchical Clustering and PCA in Customer Segmentation,” J. City Dev., vol. 3, no. 1, pp. 12–30, 2021, doi: 10.12691/jcd-3-1-3.

B. Adamyk, A. Skirka, K. Snihur, and O. Adamyk, “Analysis of Trust in Ukrainian banks based on Machine Learning Algorithms,” 2019 9th Int. Conf. Adv. Comput. Inf. Technol. ACIT 2019 - Proc., pp. 234–239, 2019, doi: 10.1109/ACITT.2019.8779974.

P. Hemalatha and G. M. Amalanathan, “A Hybrid Classification Approach for Customer Churn Prediction using Supervised Learning Methods: Banking Sector,” Proc. - Int. Conf. Vis. Towar. Emerg. Trends Commun. Networking, ViTECoN 2019, pp. 1–6, 2019, doi: 10.1109/ViTECoN.2019.8899692.

M. A. Rahim, M. Mushafiq, S. Khan, and Z. A. Arain, “RFM-based repurchase behavior for customer classification and segmentation,” J. Retail. Consum. Serv., vol. 61, no. September 2020, p. 102566, 2021, doi: 10.1016/j.jretconser.2021.102566.

V. Djurisic, L. Kascelan, S. Rogic, and B. Melovic, “Bank CRM Optimization Using Predictive Classification Based on the Support Vector Machine Method,” Appl. Artif. Intell., vol. 34, no. 12, pp. 941–955, 2020, doi: 10.1080/08839514.2020.1790248.

M. H. Effendy, D. Anggraeni, Y. S. Dewi, and A. F. Hadi, “ Classification of Bank Deposit Using Naïve Bayes Classifier (NBC) and K –Nearest Neighbor ( K -NN) ,” Proc. Int. Conf. Math. Geom. Stat. Comput. (IC-MaGeStiC 2021), vol. 96, pp. 163–166, 2022, doi: 10.2991/acsr.k.220202.031.

F. Itoo, Meenakshi, and S. Singh, “Comparison and analysis of logistic regression, Naïve Bayes and KNN machine learning algorithms for credit card fraud detection,” Int. J. Inf. Technol., vol. 13, no. 4, pp. 1503–1511, 2021, doi: 10.1007/s41870-020-00430-y.

A. Hidayanti, A. M. Siregar, S. Arum, P. Lestari, and Y. Cahyana, “Model Analisis Kasus Covid-19 di Indonesia Menggunakan,” J. Pengkaj. dan Penerapan Tek. Inform., vol. 15, no. 1, pp. 91–101, 2022.

R. Sistem, “JURNAL RESTI Comparison of the Accuracy of Drug User Classification Models Using,” vol. 5, no. 158, pp. 1348–1353, 2023.

I. E. Tsolas, V. Charles, and T. Gherman, “Supporting better practice benchmarking: A DEA-ANN approach to bank branch performance assessment,” Expert Syst. Appl., vol. 160, p. 113599, 2020, doi: 10.1016/j.eswa.2020.113599.

N. Ghatasheh, H. Faris, I. AlTaharwa, Y. Harb, and A. Harb, “Business analytics in telemarketing: Cost-sensitive analysis of bank campaigns using artificial neural networks,” Appl. Sci., vol. 10, no. 7, pp. 8–13, 2020, doi: 10.3390/app10072581.

Y. Deng, D. Li, L. Yang, J. Tang, and J. Zhao, “Analysis and prediction of bank user churn based on ensemble learning algorithm,” Proc. 2021 IEEE Int. Conf. Power Electron. Comput. Appl. ICPECA 2021, pp. 288–291, 2021, doi: 10.1109/ICPECA51329.2021.9362520.

P. Appiahene, Y. M. Missah, and U. Najim, “Predicting Bank Operational Efficiency Using Machine Learning Algorithm: Comparative Study of Decision Tree, Random Forest, and Neural Networks,” Adv. Fuzzy Syst., vol. 2020, 2020, doi: 10.1155/2020/8581202.

C. Chen, L. Geng, and S. Zhou, “Design and implementation of bank CRM system based on decision tree algorithm,” Neural Comput. Appl., vol. 33, no. 14, pp. 8237–8247, 2021, doi: 10.1007/s00521-020-04959-8.

D. Boughaci and A. A. K. Alkhawaldeh, “Appropriate machine learning techniques for credit scoring and bankruptcy prediction in banking and finance: A comparative study,” Risk Decis. Anal., vol. 8, no. 1–2, pp. 15–24, 2020, doi: 10.3233/RDA-180051.

Downloads

Published

2024-03-31

How to Cite

Bunga Tiara, V., Siregar, A. M., Kusumaningrum, D. S. K., & Rohana, T. (2024). Bank Customer Segmentation Model Using Machine Learning. Jurnal Nasional Pendidikan Teknik Informatika : JANAPATI, 13(1), 66–78. https://doi.org/10.23887/janapati.v13i1.75233

Issue

Section

Articles