A churn prediction model for prepaid customers in telecom using fuzzy classifiers

Published on Mar 29, 2017in Telecommunication Systems1.734
· DOI :10.1007/S11235-017-0310-7
Muhammad Azeem3
Estimated H-index: 3
,
Muhammad Usman16
Estimated H-index: 16
,
A.C.M. Fong8
Estimated H-index: 8
(WMU: Western Michigan University)
Sources
Abstract
The incredible growth of telecom data and fierce competition among telecommunication operators for customer retention demand continues improvements, both strategically and analytically, in the current customer relationship management (CRM) systems. One of the key objectives of a typical CRM system is to classify and predict a group of potential churners form a large set of customers to devise profitable and targeted retention campaigns for keeping a long-term relationship with valued customers. For achieving the aforementioned objective, several churn prediction models have been proposed in the past for the accurate identification of the customers who are prone to churn. However, these previously proposed models suffer from a number of limitations which place strong barriers towards the direct applicability of such models for accurate prediction. Firstly, the feature selection methods adopted in majority of the past work neglected the information rich variables present in call details record for model development. Secondly, selection of important features was done through statistical methods only. Although statistical methods have been applied successfully in diverse domains, however, these methods alone without the augmentation of domain knowledge have the tendency to yield erroneous results. Thirdly, the previous models have been validated mainly with benchmark datasets which do not provide a true representation of real world telecom data consisting of noise and large number of missing values. Fourthly, the evaluation measures used in the past neglected the True Positive (TP) rate, which actually highlights the ability of a model to correctly classify the percentage of churners as compared to non-churners. Finally, the classifiers used in the previous models completely neglected the use of fuzzy classification methods which perform reasonably well for data sets with noise. In this paper, a fuzzy based churn prediction model has been proposed and validated using a real data from a telecom company in South Asia. A number of predominant classifiers namely, Neural Network, Linear regression, C4.5, SVM, AdaBoost, Gradient Boosting and Random Forest have been compared with fuzzy classifiers to highlight the superiority of fuzzy classifiers in predicting the accurate set of churners.
📖 Papers frequently viewed together
2012SMC: Systems, Man and Cybernetics
3 Authors (Adnan Idris, ..., Yeon Soo Lee)
29 Citations
1 Citations
18 Citations
References21
Newest
May 27, 2015 in SIGMOD (International Conference on Management of Data)
#1Yiqing Huang (Soochow University (Suzhou))H-Index: 1
#2Fangzhou Zhu (Soochow University (Suzhou))H-Index: 3
Last. Jia Zeng (Huawei)H-Index: 11
view all 9 authors...
We show that telco big data can make churn prediction much more easier from the 3's perspectives: Volume, Variety, Velocity. Experimental results confirm that the prediction performance has been significantly improved by using a large volume of training data, a large variety of features from both business support systems (BSS) and operations support systems (OSS), and a high velocity of processing new coming data. We have deployed this churn prediction system in one of the biggest mobile oper...
78 CitationsSource
Jan 1, 2014 in IDA (Intelligent Data Analysis)
#1Sebastián Maldonado (University of Los Andes)H-Index: 18
#2Claudio Montecinos (University of Talca)H-Index: 1
200 words for Intelligent Data Systems The class imbalance problem is a relatively new challenge that has attracted growing attention from both industry and academia, since it strongly affects classification performance. Research also established that class imbalance is not an issue by itself, but its relationship with class overlapping and noise has an important impact on the prediction performance and stability. This fact has motivated the development of several approaches for classification o...
15 CitationsSource
Jan 1, 2014 in IDA (Intelligent Data Analysis)
#1Thomas Verbraken (Katholieke Universiteit Leuven)H-Index: 8
#2Wouter Verbeke (Katholieke Universiteit Leuven)H-Index: 16
Last. Bart Baesens (Katholieke Universiteit Leuven)H-Index: 69
view all 3 authors...
Customer churn prediction is becoming an increasingly important business analytics problem for telecom operators. In order to increase the efficiency of customer retention campaigns, churn prediction models need to be accurate as well as compact and interpretable. Although a myriad of techniques for churn prediction has been examined, there has been little attention for the use of Bayesian Network classifiers. This paper investigates the predictive power of a number of Bayesian Network algorithm...
22 CitationsSource
#1Wouter Verbeke (Katholieke Universiteit Leuven)H-Index: 16
#2Karel Dejaeger (Katholieke Universiteit Leuven)H-Index: 9
Last. Bart Baesens (University of Southampton)H-Index: 69
view all 5 authors...
Customer churn prediction models aim to indicate the customers with the highest propensity to attrite, allowing to improve the efficiency of customer retention campaigns and to reduce the costs associated with churn. Although cost reduction is their prime objective, churn prediction models are typically evaluated using statistically based performance measures, resulting in suboptimal model selection. Therefore, in the first part of this paper, a novel, profit centric performance measure is devel...
258 CitationsSource
#1Koen W. De Bock (Lille Catholic University)H-Index: 12
#2Dirk Van den Poel (UGent: Ghent University)H-Index: 52
Several studies have demonstrated the superior performance of ensemble classification algorithms, whereby multiple member classifiers are combined into one aggregated and powerful classification model, over single models. In this paper, two rotation-based ensemble classifiers are proposed as modeling techniques for customer churn prediction. In Rotation Forests, feature extraction is applied to feature subsets in order to rotate the input data for training base classifiers, while RotBoost combin...
84 CitationsSource
#1Wouter Verbeke (Katholieke Universiteit Leuven)H-Index: 16
#2David Martens (Katholieke Universiteit Leuven)H-Index: 30
Last. Bart Baesens (Katholieke Universiteit Leuven)H-Index: 69
view all 4 authors...
Customer churn prediction models aim to detect customers with a high propensity to attrite. Predictive accuracy, comprehensibility, and justifiability are three key aspects of a churn prediction model. An accurate model permits to correctly target future churners in a retention marketing campaign, while a comprehensible and intuitive rule-set allows to identify the main drivers for customers to churn, and to develop an effective retention strategy in accordance with domain knowledge. This paper ...
225 CitationsSource
In this article, we test the usefulness of the popular data mining models to predict churn of the clients of the Polish cellular telecommunication company. When comparing to previous studies on this topic, our research is novel in the following areas: (1) we deal with prepaid clients (previous studies dealt with postpaid clients) who are far more likely to churn, are less stable and much less is known about them (no application, demographical or personal data), (2) we have 1381 potential variabl...
79 CitationsSource
#1B. Q. Huang (UCD: University College Dublin)
#2T. M. Kechadi (UCD: University College Dublin)
Last. Tarik A. Rashid (UCD: University College Dublin)H-Index: 13
view all 6 authors...
In order to improve the prediction rates of churn prediction in land-line telecommunication service field, this paper proposes a new set of features with three new input window techniques. The new features are demographic profiles, account information, grant information, Henley segmentation, aggregated call-details, line information, service orders, bill and payment history. The basic idea of the three input window techniques is to make the position order of some monthly aggregated call-detail f...
43 CitationsSource
#1B. Q. Huang (UCD: University College Dublin)
#2B. Buckley (St. John's University)H-Index: 1
Last. T. M. Kechadi (UCD: University College Dublin)
view all 3 authors...
This paper proposes a new multiobjective feature selection approach for churn prediction in telecommunication service field, based on the optimisation approach NSGA-II. The basic idea of this approach is to modify the approach NSGA-II to select local feature subsets of various sizes, and then to use the method of searching nondominated solutions to select the global nondominated feature subsets. Finally, the method FBSM which yields the fitness thresholds is proposed to choose the global solutio...
113 CitationsSource
#1Chih-Fong Tsai (NCU: National Central University)H-Index: 33
#2Mao-Yuan Chen (NCU: National Central University)H-Index: 1
Multimedia on demand (MOD) is an interactive system that provides a number of value-added services in addition to traditional TV services, such as video on demand and interactive online learning. This opens a new marketing and managerial problem for the telecommunication industry to retain valuable MOD customers. Data mining techniques have been widely applied to develop customer churn prediction models, such as neural networks and decision trees in the domain of mobile telecommunication. Howeve...
82 CitationsSource
Cited By16
Newest
Source
#1Hemlata JainH-Index: 1
#2Ajay KhuntetaH-Index: 3
Last. Sumit SrivastavaH-Index: 11
view all 3 authors...
Source
#1Hemlata JainH-Index: 1
#2Ajay KhuntetaH-Index: 3
Last. Sumit Srivastava (Manipal University Jaipur)H-Index: 11
view all 3 authors...
Customer churn prediction in telecommunication industry is a very essential factor to be achieved and it makes direct impact to customer retention and its revenues. Developing a good and effective churn prediction model is very important however it is a time-consuming process. This study presents a very good review of customer churn, its effects, identification of its causes, business needs, methods, and all the techniques used for churn prediction. On the other hand, this study provides the bes...
Source
#1Amira Kobeissi (Bucharest University of Economic Studies)
Last. Hiba Mohammad (Bucharest University of Economic Studies)
view all 1 authors...
Source
#1Shamim Raeisi (UT: University of Tehran)
#2Hedieh Sajedi (UT: University of Tehran)H-Index: 15
The amount of data stored daily is increasing at a specific rate. E-commerce services are one of the areas where new knowledge is gathered on a daily basis. Therefore, it seems necessary to use data mining techniques in this field. This article aims to gain insight into a data set provided by the most important online food ordering service in Tehran, Iran. Data analysis can assist in discovering the causes of customer churn and also employ information to keep possession of customers. Customer ch...
Source
#1Aysenur BudakH-Index: 6
Musteri analitigi, hizli degisen pazarlarda ve kar marjlarinin kuculdugu alanlarda istatistiksel analiz yontemleri ile kârli musterilerin elde tutulmasi icin firmaya daha hizli, dinamik ve isabetli kararlar almalarina yardimci olan bir arastirma alanidir. Karayolu tasimaciligi sektoru buyuyen, gelisen ve onemini arttiran bir alan konumundadir. Bu yuzden lojistik firmalari anlik ve hizli bir sekilde veriyi kullanarak dogru kararlari almak istemektedirler. Lojistik firmasi musterilerine, kamyoncud...
Source
#1Mahreen Ahmed (National University of Sciences and Technology)H-Index: 1
#2Hammad Afzal (National University of Sciences and Technology)H-Index: 11
Last. Khawar Khurshid (National University of Sciences and Technology)H-Index: 9
view all 5 authors...
Combining multiple classifiers to create hybrid learners (ensembles) has gained popularity in recent years. Ensembles are gaining more interest in the field of data mining as they have reportedly performed best predictions as compared to individual classifiers. This has resulted in experimentation with new ways of ensemble creation. This paper presents a study on creation of novel hybrid ways of combining multiple ensemble models using ‘over production and choose approach.’ In contrast to the or...
2 CitationsSource
Jan 17, 2020 in ICML (International Conference on Machine Learning)
#1Xing Wang (Victoria University of Wellington)H-Index: 1
#2Khang Nguyen (IBM)H-Index: 1
Last. Binh P. Nguyen (Victoria University of Wellington)H-Index: 11
view all 3 authors...
With a wealth of information on hand from the Internet, customers now can easily identify and switch to alternatives. In addition to this, a consensus has been reached that the cost of securing new customers is substantially higher than the cost of retaining the current customers. Therefore, customer retention has become an essential part of operating strategy for any organisation. Churn prediction is a practice of data analysis on the historical data, which is aiming to predict if a customer wi...
4 CitationsSource
#1T.S. Sharma (Bhagwan Parshuram Institute of Technology)H-Index: 2
#2Prachi Gupta (Bhagwan Parshuram Institute of Technology)H-Index: 1
Last. Mohit Goel (Bhagwan Parshuram Institute of Technology)H-Index: 1
view all 4 authors...
Customer churn is a critical problem faced by many industries these days. It is 5–10 times more valuable to keep a long-term customer than acquiring a new one. This paper addresses the problem of customer churn with respect to telecommunication industry as churn rate is quite high in this industry (ranging from 10 to 60%) in comparison to others. Predicting customer churn in advance can help these companies in retaining their customers. The paper proposes XGBoost algorithm as a model with the be...
3 CitationsSource