Global Sustainability Performance and Regional Disparities: A Machine Learning Approach Based on the 2025 SDG Index


Çelik S., ÖZTÜRK Ö. F., Akkucuk U., ŞAŞMAZ M. Ü.

Sustainability (Switzerland), cilt.17, sa.16, 2025 (SCI-Expanded, SSCI, Scopus) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 17 Sayı: 16
  • Basım Tarihi: 2025
  • Doi Numarası: 10.3390/su17167411
  • Dergi Adı: Sustainability (Switzerland)
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Scopus, Aerospace Database, Agricultural & Environmental Science Database, CAB Abstracts, Communication Abstracts, Food Science & Technology Abstracts, Geobase, INSPEC, Metadex, Veterinary Science Database, Directory of Open Access Journals, Civil Engineering Abstracts
  • Anahtar Kelimeler: K-Means clustering, machine learning, Random Forest, regional disparities, ROC analysis, sustainability performance, sustainable development goals
  • Uşak Üniversitesi Adresli: Evet

Özet

Sustainability performance varies significantly across countries, yet global assessments overlook the underlying structural trends. This study bridges this gap using machine learning to uncover meaningful clustering in global sustainability outcomes based on the 2025 Sustainable Development Goals (SDG) Index. We applied K-Means clustering to group 166 countries into five standardized indicators: SDG score, spillover effects, regional score, population size, and recent progress. The five-cluster solution was confirmed by the Elbow and Silhouette procedures, with ANOVA and MANOVA tests subsequently indicating statistically significant cluster differences. For the validation and interpretation of the results, six supervised learning algorithms were employed. Random Forest, SVM, and ANN performed best in classification accuracy (97.7%) with perfect ROC-AUC scores (AUC = 1.0). Feature importance analysis showed that SDG and regional scores were most predictive of cluster membership, while population size was the least. This supervised–unsupervised hybrid approach offers a reproducible blueprint for cross-country benchmarking of sustainability. It also offers actionable insights for tailoring policy to groups of countries, whether high-income OECD nations, emerging markets, or resource-scarce countries. Our findings demonstrate that machine learning is a useful tool for revealing structural disparities in sustainability and informing cluster-specific policy interventions toward the 2030 Agenda.