Abstract
Every process is digitized in the current society. Transfer of money from one account holder to another has become possible in seconds because of advanced technologies in information processing. Not only this in all sectors like railways, insurance, health sector, fashion technology, education sector, sales and business sectors, and advertisement sectors every firm digitized its operations. One such sector is banking where every individual based on his or her financial status will be considered for crediting loan, and credit card etc. If the credit score of the loan availing person is high banks will be ready to provide him with the loan but the availing person can opt for any one of the banks on his or her own willing. Such scenario happens in credit card churn prediction also. Hence the banks should take healthy measures to retain the existing credit card holders without any churn. Withholding existing customers of a firm plays an important role to increase the overall revenue of the firm and retains the good name of the firm in competitive market. Hence every organization takes key measures to withhold existing customers using customer management models. Because customer retention is a crucial task as it reduces the time, money and workforce needed for adding new customers to the firm. Customers retention technique in credit card churn prediction (C3P) was done using only supervised classification techniques. But it could not end with better results. So, through many proven hybrid classification techniques we can bring better accuracy in C3P. Also C3P lags in highly efficient techniques like rough set theory. Hence in this work initially we perform data processing techniques and in second stage we propose modified rough K-means algorithm used for clustering credit card holders and in next stage hold-out method divides the cluster data into testing and training clusters. At last classification is performed using various algorithms like support vector machine, random forest, decision tree, K-nearest neighbor, and Naive Bayes. Finally we evaluate the work using precision, recall (sensitivity), specification, accuracy, and misclassification error.
Similar content being viewed by others
References
Yeh, I.C., et al.: The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert Syst. Appl. 36, 2473–2480 (2009)
Oyeniyi, A.O., Adeyemo, A.B., Oyeniyi, A.O., Adeyemo, A.B.: Customer churn analysis in banking sector using data mining techniques. Afr J. of Comp and ICTs. 8(3), 165–174 (2015)
Farquad, M. A., et al.: Data mining using rules extracted from SVM: an application to churn prediction in bank credit cards. In: RSFDGrC ’09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. pp. 390–397 (2009)
Nie, G., et al.: Credit card churns forecasting by logistic regression and decision tree. Expert Syst. Appl. 38, 15273–15285 (2011)
Wang, N., Niu, D. X.: Credit card customer churn prediction based on the RST and LS-SVM. IEEE conference on Service and systems and Service Management (2009)
Zhao, J., et al.: Bank customer churn prediction based on support vector machine: taking a commercial bank’s VIP customer churn. IEEE Conference on Wireless Communication, Networking and Mobile computing (2008)
Zhao, Y., et al.: Customer churn prediction using improved one-class support vector machine. DMA, LNAI 3584, pp. 300–306. Springer-Verlag Berlin Heidelberg, Berlin (2005)
Shao, H., et al.: Construction of Bayesian classifiers with GA for predicting customer retention. IEEE Conference on Natural Computation (2005)
Huang, Ying, Huang, Y., Kechadi, T.: An effective hybrid learning system for telecommunication churns Prediction. J. Expert Syst. Appl. 40, 5635–5647 (2013)
Bose, I., Chen, X.: Hybrid models using unsupervised clustering for prediction of customer churn. J. Organ. Comput. and Electron. Commer. 19, 133–151 (2009)
Huang, B.Q., Kechadi, M.T., Buckley, B.: Customer churns prediction in telecommunications. J. Expert Syst. Appl. 39(1), 1414–1425 (2012)
Khashei, M., Hamadani, A.Z., Bijari, M.: A novel hybrid classification model of artificial neural networks and multiple linear regression models. J. Expert Syst. Appl. 39(3), 2606–2620 (2012)
Tsai, C.F., Lu, Y.H.: Customer churns prediction by hybrid neural networks. J. Expert Syst. Appl. 36(10), 12547–12553 (2009)
Yeshwanth, V, Raj, V.V., Saravanan, M.: Evolutionary churn prediction in mobile networks using hybrid learning. In: Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, (FLAIRS), Palm Beach, Florida, USA, May 18–20. AAAI Press (2011)
Hung, S.Y., et al.: Applying data mining to telecom churn management. J. Expert Syst. Appl. 31(3), 515–524 (2006)
Zhang, Y., et al .: A hybrid KNN-LR classifier and its application in customer churn prediction IEEE conference on system, man and cybernetics (2007)
Lee et al,.: Customer churn prediction by hybrid model. Advance data mining and application Springer (2006)
Kawale, J., Pal, A., Srivastava, J.: Churn prediction in MMORPGs: a social influence based approach. Proc. 2009 Int. Conf. Comput. Sci. Eng. 4, 423–428 (2009)
Suznjevic, M., Stupar, I., Matijasevic, M.: MMORPG player behavior model based on player action categories. Proceedings of the 10th Annual Workshop on Network and Systems Support for Games. IEEE Press (2011)
Chen, K. T., Lei, C. L., Network game design: hints and implications of player interaction. In: Proceedings of 5th ACM SIGCOMM Workshop on Network and System Support for Games. pp. 1–9 ( 2006)
Soeini, R.A., Rodpysh, K.V.: Applying data mining to insurance customer churn management. Int. Proc. Comput. Sci. Inf. Technol. 30, 82–92 (2012)
Verbeke, W., Martens, D., Baesens, B.: Social network analysis for customer churn prediction. Appl. Soft Comput. 14, 431–446 (2014)
Malik, Z.K., Hussain, A., Jonathan, W.: An online generalized eigenvalue version of Laplacian Eigenmaps for visual big data. Neurocomputing 173, 127–136 (2016)
Anil Kumar, D., Ravi, V.: Predicting credit card customer churn in banks using data mining. Int. J. Data Anal. Tech. Strateg. 1(1), 4–28 (2008)
Nie, G., et al.: Finding the hidden pattern of credit card holders churn: a case of China, ICCS 2009 part II. LNCS 5545(1), 561–569 (2009)
Wang, G., et al.: Predicting credit card holder churn in banks of China using data mining and MCDM. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (2010)
Chiang, D.A., et al.: Goal-oriented sequential pattern for network banking churn analysis. Expert Syst. Appl. 25, 293–302 (2003)
Jinbo, S., et al.: The application of AdaBoost in customer churn prediction. IEEE Conference on Service and Systems and Service Management (2007)
Mutanen, T., et al.: Customer churn prediction: a case study in banking. ECML/PKDD 2006 Workshop on Practical Data Mining: Applications, Experiences and Challenges (2010)
Olle, G., Hybrid, A.: Churn prediction model in mobile telecommunication industry. Int. J. e-Educ. e-Bus. e-Manag. Learn. 4(1), 55–62 (2014)
Peters, G.: Some refinements of rough k-means clustering. Pattern Recognit. 39, 1481–1491 (2006)
Kim, K., Jun, C.-H., Lee, J.: Improved churn prediction in telecommunication industry by analyzing a large network. Expert Syst. Appl. 41(15), 6575–6584 (2014)
Dataset. https://archive.ics.uci.edu/ml/datasets/default+of+credit+card+clients
Lingras, P., West, C.: Interval set clustering of web users with rough k-means. J. Intell. Inform. Syst. 23, 5–16 (2004)
Kumar, P.: Comparative study of K-means, pam and rough K-means algorithms using cancer datasets. 2009 International Symposium on Computing, Communication, and Control (ISCCC 2009)
Lingras, P., West, C.: Interval set clustering of web users with rough kmeans, Technical Report 2002–002. St. Marys University, Halifax, Canada, Department of Mathematics and Computer Science (2002)
Parmar, D., et al.: MMR: an algorithm for clustering categorical data using rough set theory. Data Knowl. Eng. 63(2007), 879–893 (2007)
Amin, A., et al.: Churn prediction in telecommunication industry using rough set approach. New Trends in Computational Collective Intelligence, pp. 83–95. Springer International Publishing, Berlin (2015)
Amin, A., et al.: Customer churn prediction in the telecommunication sector using a rough set approach. Neurocomputing 237, 242–254 (2017)
Data Source Link. http://www.sgi.com/tech/mlc/db/
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Rajamohamed, R., Manokaran, J. Improved credit card churn prediction based on rough clustering and supervised learning techniques. Cluster Comput 21, 65–77 (2018). https://doi.org/10.1007/s10586-017-0933-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-017-0933-1