Global Gender Inequality Through Explainable AI: Machine Learning, Clustering, and SHAP Insights


Çelik S., Köroğlu C. Z.

SOCIAL SCIENCE COMPUTER REVIEW, cilt.0, sa.0, ss.1-34, 2026 (SCI-Expanded, SSCI, Scopus)

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 0 Sayı: 0
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1177/08944393261419809
  • Dergi Adı: SOCIAL SCIENCE COMPUTER REVIEW
  • Derginin Tarandığı İndeksler: Scopus, Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), IBZ Online, Educational research abstracts (ERA), Index Islamicus, INSPEC, Library, Information Science & Technology Abstracts (LISTA), Political Science Complete, Psycinfo, Public Affairs Index
  • Sayfa Sayıları: ss.1-34
  • Uşak Üniversitesi Adresli: Evet

Özet

Objective: This paper analyzes gender equality across countries in the year 2024 by using the GGGI, with the intention of disentangling the unseen structural and non-deterministic patterns. Instead of repeating the process of calculating the index, it is openly recognizing the compositional feature of the GGGI and the unseen similarities between the indices. Methods: This research employs a global cross-sectional study of 146 countries over the four primary GGGI sectors: economic participation, education, health and survival, and empowerment. Where OLS is only employed as a diagnostic test, as its almost perfect fit (R2∼1) is squarely mechanical and lacks relevance for inference. Apart from ensemble models employed for predictions, K-means clustering, SHAP analysis, and GridSearchCV optimization are also used. Findings: The out-of-sample predictions demonstrate high levels of predictive accuracy, with Gradient Boosting models yielding an R2 of approximately 0.90 and an RMSE of approximately 0.045, indicating that there is significant nonlinear information beyond index aggregation. Unsupervised clustering techniques show that there are seven distinct country clusters that go beyond traditional geographic and income divisions, which can be identified with more than 93% accuracy. The SHAP results show that empowerment and economic participation are drivers, while there is insignificant variation in healthcare. Contribution: This study identifies the boundaries of regression analysis in index research, as well as the advantages of machine learning analysis in determining structural patterns related to gender equity.