journal article Open Access Oct 12, 2023

An Exploration of Clustering Algorithms for Customer Segmentation in the UK Retail Market

Analytics Vol. 2 No. 4 pp. 809-823 · MDPI AG
View at Publisher Save 10.3390/analytics2040042
Abstract
Recently, peoples’ awareness of online purchases has significantly risen. This has given rise to online retail platforms and the need for a better understanding of customer purchasing behaviour. Retail companies are pressed with the need to deal with a high volume of customer purchases, which requires sophisticated approaches to perform more accurate and efficient customer segmentation. Customer segmentation is a marketing analytical tool that aids customer-centric service and thus enhances profitability. In this paper, we aim to develop a customer segmentation model to improve decision-making processes in the retail market industry. To achieve this, we employed a UK-based online retail dataset obtained from the UCI machine learning repository. The retail dataset consists of 541,909 customer records and eight features. Our study adopted the RFM (recency, frequency, and monetary) framework to quantify customer values. Thereafter, we compared several state-of-the-art (SOTA) clustering algorithms, namely, K-means clustering, the Gaussian mixture model (GMM), density-based spatial clustering of applications with noise (DBSCAN), agglomerative clustering, and balanced iterative reducing and clustering using hierarchies (BIRCH). The results showed the GMM outperformed other approaches, with a Silhouette Score of 0.80.
Topics

No keywords indexed for this article. Browse by subject →

References
28
[1]
Lekhwar, S., Yadav, S., and Singh, A. (2019). Information and Communication Technology for Intelligent Systems: Proceedings of ICTIS 2018, Volume 2, Springer.
[2]
Gwozdz, W., Steensen Nielsen, K., and Müller, T. (2017). An environmental perspective on clothing consumption: Consumer segments and their behavioral patterns. Sustainability, 9. 10.3390/su9050762
[3]
An "Customer segmentation using online platforms: Isolating behavioral and demographic segments for persona creation via aggregated user data" Soc. Netw. Anal. Min. (2018) 10.1007/s13278-018-0531-0
[4]
Fotaki "Exploring big data opportunities for online customer segmentation" Int. J. Bus. Intell. Res. (IJBIR) (2014) 10.4018/ijbir.2014070105
[5]
Hicham "Analysis of Unsupervised Machine Learning Techniques for an Efficient Customer Segmentation using Clustering Ensemble and Spectral Clustering" Int. J. Adv. Comput. Sci. Appl. (2022)
[6]
Turkmen "Customer Segmentation with Machine Learning for Online Retail Industry" Eur. J. Soc. Behav. Sci. (2022)
[7]
Ramanathan "Impact of customer loyalty and service operations on customer behaviour and firm performance: Empirical evidence from UK retail sector" Prod. Plan. Control (2017) 10.1080/09537287.2017.1309707
[8]
Li "A review of industrial big data for decision making in intelligent manufacturing" Eng. Sci. Technol. Int. J. (2022)
[9]
Arunachalam "Benefit-based consumer segmentation and performance evaluation of clustering approaches: An evidence of data-driven decision-making" Expert Syst. Appl. (2018) 10.1016/j.eswa.2018.03.007
[10]
Oussous "Big Data technologies: A survey" J. King Saud Univ.-Comput. Inf. Sci. (2018)
[11]
Jin, D.H., and Kim, H.J. (2018). Integrated understanding of big data, big data analysis, and business intelligence: A case study of logistics. Sustainability, 10. 10.3390/su10103778
[12]
Jayakrishnan "Understanding big data analytics (BDA) and business intelligence (BI) towards establishing organisational performance diagnostics framework" Int. J. Recent Technol. Eng. (2019)
[13]
Mathew, A., Scholar, P.G., and Jobin, T.J. (2021, January 17). Role of Big Data Analysis and Machine Learning in Ecommerce-Customer Segmentation. Proceedings of the National Conference on Emerging Computer Applications (NCECA), Online.
[14]
Seyedan "Predictive big data analytics for supply chain demand forecasting: Methods, applications, and research opportunities" J. Big Data (2020) 10.1186/s40537-020-00329-2
[15]
Ushakova "Big data to the rescue? Challenges in analysing granular household electricity consumption in the United Kingdom" Energy Res. Soc. Sci. (2020) 10.1016/j.erss.2020.101428
[16]
Fontanini, A.D., and Abreu, J. (2018, January 5–10). A data-driven BIRCH clustering method for extracting typical load profiles for big data. Proceedings of the 2018 IEEE Power & Energy Society General Meeting (PESGM), Portland, OR, USA. 10.1109/pesgm.2018.8586542
[17]
Lorbeer "Variations on the clustering algorithm BIRCH" Big Data Res. (2018) 10.1016/j.bdr.2017.09.002
[18]
Firdaus "Development of bank’s customer segmentation model based on rfm+ b approach" Int. J. Innov. Comput. Inf. Cont. (2021)
[19]
Hossain, A.S. (2017, January 7–9). Customer segmentation using centroid based and density based clustering algorithms. Proceedings of the 2017 3rd International Conference on Electrical Information and Communication Technology (EICT), Khulna, Bangladesh. 10.1109/eict.2017.8275249
[20]
Punhani, R., Arora, V.S., Sabitha, S., and Shukla, V.K. (2021, January 17–18). Application of clustering algorithm for effective customer segmentation in E-commerce. Proceedings of the 2021 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE), Dubai, United Arab Emirates. 10.1109/iccike51210.2021.9410713
[21]
Diamantaras, K.I., and Kung, S.Y. (1996). Principal Component Neural Networks: Theory and Applications, John Wiley & Sons, Inc.
[22]
Weingessel "Local PCA algorithms" IEEE Trans. Neural Netw. (2000) 10.1109/72.883408
[23]
Ogunleye, B., Maswera, T., Hirsch, L., Gaudoin, J., and Brunsdon, T. (2023). Comparison of topic modelling approaches in the banking context. Appl. Sci., 13. 10.3390/app13020797
[24]
Least squares quantization in PCM

S. Lloyd

IEEE Transactions on Information Theory 1982 10.1109/tit.1982.1056489
[25]
Zhang "Gaussian mixture model clustering with incomplete data" ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) (2021)
[26]
Müllner, D. (2011). Modern hierarchical, agglomerative clustering algorithms. arXiv.
[27]
Shirole "Customer segmentation using rfm model and k-means clustering" Int. J. Sci. Res. Sci. Technol. (2021) 10.32628/ijsrst2183118
[28]
Kansal, T., Bahuguna, S., Singh, V., and Choudhury, T. (2018, January 21–22). December Customer segmentation using K-means clustering. Proceedings of the 2018 International Conference on Computational Techniques, Electronics and Mechanical Systems (CTEMS), Belgaum, India. 10.1109/ctems.2018.8769171
Metrics
59
Citations
28
References
Details
Published
Oct 12, 2023
Vol/Issue
2(4)
Pages
809-823
License
View
Cite This Article
Jeen Mary John, Olamilekan Shobayo, Bayode Ogunleye (2023). An Exploration of Clustering Algorithms for Customer Segmentation in the UK Retail Market. Analytics, 2(4), 809-823. https://doi.org/10.3390/analytics2040042