Identifying Customer Segments in E-Commerce: A Data-Driven Framework Using Transactional Patterns

Document Type : Research Paper

Authors

1 M.Sc. Student, Department of Industrial Engineering, K. N. Toosi University of Technology (KNTU), Tehran, Iran.

2 Associate Professor, School of Industrial Engineering, College of Engineering, University of Tehran, Tehran, Iran.

3 Associate Professor, Department of Industrial Engineering, K. N. Toosi University of Technology (KNTU), Tehran, Iran.

Abstract

This study proposes a structured and interpretable approach to customer segmentation by enhancing the classical RFM (Recency, Frequency, Monetary) model. The research addresses the common limitation of traditional RFM-based methods, which often overlook behavioral diversity and lack flexibility for practical, data-driven decision-making. The main objective is to develop a segmentation framework that provides actionable insights based on transparent, rule-based logic rather than opaque clustering algorithms. Using transactional data from a leading e-commerce platform in the Netherlands, the methodology applies quartile-based scoring to RFM indicators and maps customers into six distinct behavioral segments. Visual analytics are employed to support interpretation, enabling businesses to tailor engagement strategies for each group. The results demonstrate improved segment differentiation, managerial interpretability, and relevance to real-world applications. The study also highlights the model’s adaptability to other customer-centric industries, with future research directions focused on incorporating machine learning and behavioral enrichment for Customer Lifetime Value (CLV) prediction.

Keywords

Main Subjects


[1] M. Abirami and V. Pattabiraman, "Data mining approach for intelligent customer behavior analysis for a retail store," Proc. 3rd Int. Symp. Big Data Cloud Comput. Challenges (ISBCC), Springer, 2016, pp. 283–291.
[2] M. A. Ardakani, M. H. Karimi-Zarchi, and D. Shishebori, "Analyzing and forecasting of coronavirus time-series data: Performance comparison of machine learning and statistical models," Adv. Ind. Eng., vol. 58, no. 2, 2024, pp. 291–306.
[3] M. Frasquet, M. Ieva, and C. Ziliani, "Online channel adoption in supermarket retailing," J. Retail. Consum. Serv., vol. 59, 2021, Art. no. 102374.
[4] H. Gao, X. Li, and Y. Zhang, "A time-varying Cox model for dynamic churn risk prediction in subscription-based businesses," Expert Syst. Appl., vol. 230, 2024, Art. no. 120698.
[5] M. C. Gül and D. Şen, "Segmenting retail customers using RFM analysis and designing effective push notifications: A data-driven approach," Int. J. Retail Distrib. Manag., vol. 50, no. 3, 2022, pp. 321–339.
[6] S. Gupta, A. Patel, and S. Ray, "Transformer-based churn prediction in subscription-based businesses: A deep learning approach," J. Intell. Syst., vol. 9, no. 1, 2024, pp. 200–219.
[7] Ů. Hamidi, S. J. Sadjadi, A. B. Naeini, and S. R. Moosavi Tabatabaei, "An integrated production-marketing planning model with cubic production cost function and imperfect production process," J. Ind. Syst. Eng., vol. 10, Special Issue, 2017, pp. 91–108.
[8] M. Hernant and S. Rosengren, "The impact of adding an online channel to traditional retailing: A study of consumer behavior in a multichannel context," J. Retail. Consum. Serv., vol. 36, 2017, pp. 78–86.
[9] S. Khan, L. Zhu, X. Yu, Z. Zhang, M. A. Rahim, M. Khan, X. Du, and M. Guizani, "Accountable credential management system for vehicular communication," Veh. Commun., vol. 25, 2020, Art. no. 100279.
[10] A. Kumar, R. Bezawada, R. Rishika, R. Janakiraman, and P. K. Kannan, "From social to sale: The effects of firm-generated content in social media on customer behavior," J. Mark., vol. 80, no. 1, 2018, pp. 7–25.
[11] N. M. Larsen, V. Sigurdsson, and J. Breivik, "The use of observational technology to study in-store behavior: Consumer choice, video surveillance, and retail analytics," Behav. Analyst, vol. 40, 2017, pp. 343–371.
[12] C. Lee and K. Park, "Comparing traditional and ensemble learning methods for churn prediction in the telecom industry," J. Bus. Anal., vol. 6, no. 2, 2024, pp. 90–105.
[13] K. Melis, K. Campo, E. Breugelmans, and L. Lamey, "The impact of online versus offline channel additions on customer behavior," Int. J. Res. Mark., vol. 33, no. 3, 2016, pp. 725–735.
[14] M. Mohammadi, M. R. Rasouli, and M. S. Pishvaee, "Data driven approaches for customer centric and service dominant value propositions: A systematic literature review," J. Ind. Syst. Eng., vol. 14, no. 1, 2021, pp. 51–75.
[15] E. Montaguti, S. Kuester, and V. R. Rao, "Advertising investments in a recessionary environment: Enhancing customer loyalty via multichannel strategies," J. Mark., vol. 80, no. 3, 2016, pp. 89–106.
[16] E. Park, Y. Jang, J. Kim, N. J. Jeong, K. Bae, and A. P. Del Pobil, "Determinants of customer satisfaction with airline services: An analysis of customer feedback big data," J. Retail. Consum. Serv., vol. 51, 2019, pp. 186–190.
[17] K. Pauwels and S. A. Neslin, "Building with bricks and mortar: The revenue impact of opening physical stores in a multichannel environment," J. Retail., vol. 91, no. 2, 2015, pp. 182–197.
[18] M. A. Rahim, M. Mushafiq, S. Khan, and Z. A. Arain, "RFM-based repurchase behavior for customer classification and segmentation," J. Retail. Consum. Serv., vol. 61, 2021, Art. no. 102566.
[19] J. R. Saura, P. R. Palos-Sanchez, and M. B. Correia, "Using machine learning and sentiment analysis to improve customer segmentation and product strategies," Technol. Forecast. Soc. Change, vol. 190, 2023, Art. no. 122331.
[20] S. E. Schaeffer and S. V. R. Sanchez, "Forecasting client retention—A machine-learning approach," J. Retail. Consum. Serv., vol. 52, 2020, Art. no. 101918.
[21] T. Wang, "Enhancing digital marketing through customer segmentation using reinforcement learning and purchase frequency metrics," J. Mark. Anal., vol. 13, no. 2, 2025, pp. 109–127.
[22] H. Y. Wong, Y. Tan, and T. Teo, "Integrating clustering techniques with behavioral metrics for e-commerce customer segmentation," J. Bus. Res., vol. 165, 2024, Art. no. 112984.
[23] Y. Xu, L. Zhang, and H. Liu, "Deep clustering-based RFM segmentation for e-commerce personalization," Electron. Commer. Res. Appl., vol. 58, 2023, Art. no. 101152.
[24] D. Zammit and C. Zerafa, "A simplified and numerically stable approach to the BG/NBD churn prediction model," arXiv Prepr., arXiv:2502.12912, 2025.
[25] M. Ziari, "A data-driven pricing model for distribution systems considering competition," Adv. Ind. Eng., vol. 58, no. 1, 2024, pp. 179–195.