Predicting Employee Turnover Using Explainable Machine Learning Models: A Comparative Study of XGBoost, LightGBM, CatBoost, and Deep Neural Networks

Authors

    Gabriela Torres Department of Business Administration and Innovation, Tecnológico de Monterrey, Monterrey, Mexico
    Priya Nair * Department of Human Resource Management, Ted Rogers School of Management, Toronto Metropolitan University, Toronto, Canada priya.nair@torontomu.ca
    Mikko Lahtinen Department of Industrial Engineering and Management, Aalto University, Espoo, Finland
https://doi.org/10.61838/

Keywords:

Employee Turnover Prediction, Explainable Artificial Intelligence, Human Resource Analytics, Machine Learning

Abstract

Objective: This study aimed to develop and compare explainable machine learning models for employee turnover prediction using XGBoost, LightGBM, CatBoost, and Deep Neural Networks and to identify the most influential predictors of turnover risk through explainable artificial intelligence techniques.

Methods and Materials: This quantitative predictive analytics study was conducted using a dataset of 4,862 employees from 37 organizations across multiple industries in Canada. Employee demographic, organizational, behavioral, performance-related, and psychological variables were collected from human resource information systems and validated survey instruments. The dataset was preprocessed through normalization, feature engineering, categorical encoding, and missing-value imputation procedures. Four machine learning algorithms, including XGBoost, LightGBM, CatBoost, and Deep Neural Networks, were developed and optimized using grid search and Bayesian hyperparameter tuning. The dataset was divided into training, validation, and testing subsets using stratified sampling. Model performance was evaluated using accuracy, precision, recall, F1-score, area under the receiver operating characteristic curve (AUC-ROC), area under the precision-recall curve (AUC-PR), Matthews correlation coefficient, and balanced accuracy. Explainability was achieved through SHapley Additive exPlanations (SHAP), Local Interpretable Model-Agnostic Explanations (LIME), and partial dependence analyses.

Findings: The comparative analysis revealed that all machine learning models achieved strong predictive performance; however, CatBoost outperformed the other algorithms, achieving an accuracy of 91.9%, an F1-score of 0.878, and an AUC-ROC value of 0.958. XGBoost demonstrated the second-highest performance with an accuracy of 91.3% and an AUC-ROC of 0.951, followed by LightGBM (accuracy = 90.6%, AUC-ROC = 0.944) and the Deep Neural Network (accuracy = 89.7%, AUC-ROC = 0.936). ROC curve analyses confirmed the superior discriminative ability of CatBoost. SHAP-based explainability analyses identified job satisfaction, employee engagement, organizational commitment, perceived organizational support, promotion frequency, salary growth, performance ratings, overtime hours, absenteeism, and training participation as the most influential predictors of employee turnover. The confusion matrix of the CatBoost model further demonstrated strong classification capability with high true positive and true negative rates and minimal classification errors.

Conclusion: The findings demonstrate that explainable machine learning models can accurately predict employee turnover and provide meaningful insights into the organizational, behavioral, and psychological factors associated with employee departure. Among the evaluated algorithms, CatBoost offered the optimal balance between predictive accuracy and interpretability. The integration of explainable artificial intelligence techniques enhanced transparency and practical applicability, enabling organizations to identify high-risk employees and implement proactive retention strategies. These results support the adoption of explainable machine learning as an effective decision-support tool for workforce analytics, talent management, and organizational sustainability.

Downloads

Download data is not yet available.

References

Akasheh, M. A., Hujran, O., Malik, E. F., & Zaki, N. (2024). Enhancing the Prediction of Employee Turnover With Knowledge Graphs and Explainable AI. IEEE Access, 12, 77041-77053. https://doi.org/10.1109/access.2024.3404829

Alqahtani, H., Almagrabi, H., & Alharbi, A. (2024). Employee Attrition Prediction Using Machine Learning Models: A Review Paper. International Journal of Artificial Intelligence & Applications, 15(2), 23-49. https://doi.org/10.5121/ijaia.2024.15202

Chandgadkar, V., Pathak, P., Solanki, A., Shrivastava, A., Maktum, T., & Pulgam, N. (2024). ETPSR: Employee Turnover Prediction With Security Recommendations. https://doi.org/10.21203/rs.3.rs-4261760/v1

Dhamija, S. (2023). Analysing Theoretical Models for Predicting Employee Attrition: A Comparative Study in the FMCG Sector. Journal of Advanced Zoology, 44(S-3), 1179-1191. https://doi.org/10.17762/jaz.v44is-3.1295

Gazi, M. S., Nasiruddin, M., Dutta, S., Sikder, R., Huda, C. B., & Islam, M. Z. (2024). Employee Attrition Prediction in the USA: A Machine Learning Approach for HR Analytics and Talent Retention Strategies. Journal of Business and Management Studies, 6(3), 47-59. https://doi.org/10.32996/jbms.2024.6.3.6

Gurung, N., Gazi, M. S., & Islam, M. Z. (2024). Strategic Employee Performance Analysis in the USA: Deploying Machine Learning Algorithms Intelligently. Journal of Business and Management Studies, 6(3), 01-14. https://doi.org/10.32996/jbms.2024.6.3.1

Hadiyanto, H., & Anggoro, Y. (2025). A Skill Matrix Chart Case Study in Global Manufacturing. Systems Research and Behavioral Science, 42(6), 1902-1916. https://doi.org/10.1002/sres.3128

Kiran, P. R., Chaubey, A., & Shastri, R. K. (2023). Role of HR Analytics and Attrition on Organisational Performance: A literature Review Leveraging The SCM-Tbfo Framework. Benchmarking an International Journal, 31(9), 3102-3129. https://doi.org/10.1108/bij-06-2023-0412

Kumar, P., Gaikwad, S. B., Ramya, S. T., Tiwari, T., Tiwari, M., & Kumar, B. (2023). Predicting Employee Turnover: A Systematic Machine Learning Approach for Resource Conservation and Workforce Stability. 117. https://doi.org/10.3390/engproc2023059117

Kundu, B., Shyam, R., & Mondal, B. (2024). The Role of Predictive Analytics in Enhancing Employee Retention in India’s Start-Up Ecosystem. Interantional Journal of Scientific Research in Engineering and Management, 08(11), 1-6. https://doi.org/10.55041/ijsrem39218

Lalitha, V., Prithiv, R. S., & Lokesh, P. (2023). A Novel Analysis of Employee Attrition Rate by Maneuvering Machine Learning. Advances in Science and Technology. https://doi.org/10.4028/p-sw6mmb

Márquez, B. Y., Realyvásquez-Vargas, A., Lopez-Esparza, N., & Ramos, C. E. (2023). Application of Ordinary Least Squares Regression and Neural Networks in Predicting Employee Turnover in the Industry. Archives of Advanced Engineering Science, 2(1), 30-36. https://doi.org/10.47852/bonviewaaes32021326

Nandal, M., Grover, V., Sahu, D., & Dogra, M. (2024). Employee Attrition: Analysis of Data Driven Models. Eai Endorsed Transactions on Internet of Things, 10. https://doi.org/10.4108/eetiot.4762

Patnaik, T. (2024). The Incorporation of HR Metrics in HR Management: A Theoretical Framework With a Specific Focus on Indian Corporates. Ajbr, 5409-5417. https://doi.org/10.53555/ajbr.v27i3s.3115

Preena, R. E. (2024). Techniques to Predict Employee Attrition Using Optimized Levy Fruit Fly Optimization Algorithm. https://doi.org/10.21203/rs.3.rs-4127736/v1

Qin, R., Qi, X., Yuan, Y., & Alataş, B. (2026). Employee Turnover Prediction Research of Human Resource Management on Machine Learning Algorithms and Big Data Analysis. Journal of Organizational and End User Computing, 38(1), 1-27. https://doi.org/10.4018/joeuc.399146

Ravesangar, K., & Narayanan, S. (2024). Adoption of HR Analytics to Enhance Employee Retention in the Workplace: A Review. Hum. Resour. Manag. Serv., 6(3), 3481. https://doi.org/10.18282/hrms.v6i3.3481

Reddy, J. S. (2023). Big Data-Based Framework for Prediction of Employee Attrition by Using Deep Data People Analytics. https://doi.org/10.52783/jier.v3i2.432

Swetha, M., Mahalakshmi, S., Pushpa, S., Madihalli, A. T., Ananya, A., & Bhardwaj, A. (2022). Machine Learning Approach for Forecasting Job Appeasement and Employee Corrosion. https://doi.org/10.15439/2023r17

Talebi, H., Bardsiri, A. K., & Bardsiri, V. K. (2025). Machine Learning Approaches for Predicting Employee Turnover: A Systematic Review. Engineering Reports, 7(8). https://doi.org/10.1002/eng2.70298

Tang, M., Zhao, T., Hu, Z., & Li, Q. (2023). Research on Risk Prediction and Early Warning of Human Resource Management Based on Machine Learning and Ontology Reasoning. Tehnicki Vjesnik - Technical Gazette, 30(6). https://doi.org/10.17559/tv-20230810000869

Tripathi, A. K., Kumari, P., Katiyar, P., Chaudhary, V. S., & Sharma, M. (2024). Workforce Attrition and Engagement Analysis Using Machine Learning. https://doi.org/10.55524/csistw.2024.12.1.26

Xu, Y. (2026). Improving Employee Retention With Machine Learning for Proactive Turnover Risk Detection and Strategic Interventions. https://doi.org/10.21203/rs.3.rs-8378596/v1

Zheng, X. (2025). Construction of a Big Data-Driven Predictive Analysis Platform for Hospital Talent Attrition. Blockchain in Healthcare Today, 8(3). https://doi.org/10.30953/bhty.v8.433

Downloads

Additional Files

Published

2026-05-01

Submitted

2025-11-05

Revised

2026-02-13

Accepted

2026-02-21

How to Cite

Torres, G., Nair, P., & Lahtinen, M. . (2026). Predicting Employee Turnover Using Explainable Machine Learning Models: A Comparative Study of XGBoost, LightGBM, CatBoost, and Deep Neural Networks. International Journal of Innovation Management and Organizational Behavior (IJIMOB), 6(3), 1-13. https://doi.org/10.61838/