Advanced Prediction of Chronic Kidney Failure Using Biostatistical and Machine Learning Models: An Age-Stratified Analytical Study Based on Simulated Iraqi Patient Data

Taghreed Abdel-Zahra; Karrar Al-Sayeab; Mohammmed Neamah

doi:10.71428/PJS.2026.0109

PDF

Published: 2026-04-08

DOI: https://doi.org/10.71428/PJS.2026.0109

Keywords:

Chronic Kidney Disease (CKD), Kidney Failure Prediction, Machine Learning, Random Forest, XGBoost, Support Vector Machine (SVM)

Taghreed Abdel-Hussein Abdel-Zahra

Department of Family and Community Medicine - Jaber Ibn Hayyan College of Medicine and Pharmaceutical Sciences, Iraq

Karrar R. AL-Sayeab

Jabir Ibn Hayyan University for Medical and Pharmaceutical Sciences, Iraq

Mohammmed Hussein Neamah

University of Kufa College of pharmacy, Iraq

Abstract

Chronic kidney disease has become one of the major public health issues in Iraq due to the increasing rates of diabetes, high blood pressure, and an ageing population. The aim of this study was to investigate how age influences the creation of predictive models for the various stages of kidney disease. Key attributes were identified, and a synthetic dataset of 1,000 patients from Iraq was generated. To do this, several predictive models were employed, including Logistic Regression, Random Forest, Support Vector Machines (SVM), and Extreme Gradient Boosting (XGBoost). A 10-fold cross-validation was performed on all models to assess their stability and generalizability. The models were assessed, and their performance was measured using accuracy, sensitivity, specificity, receiver operating characteristic (ROC), and area under the curve (AUC), as well as on calibration tests, decision curve analysis, interaction tests, and survival analysis. Of all the models evaluated, Random Forest and XGBoost were found to have the best discriminative ability, with AUCs of 0.88 and 0.89, respectively. From the analyses conducted, individuals aged 60 years and older had a significantly higher likelihood of having kidney disease. The most significant predictors were older age, higher serum creatinine levels, and the presence of diabetes and hypertension. The results underscore the clinical value of predictive models for early risk stratification and emphasize the value of predictive technology for information-based management of chronic kidney disease.

Issue

Vol. 2 No. 1 (2026)

Section

Articles

How to Cite

Advanced Prediction of Chronic Kidney Failure Using Biostatistical and Machine Learning Models: An Age-Stratified Analytical Study Based on Simulated Iraqi Patient Data. (2026). Pharaonic Journal of Science, 2(1), 101-125. https://doi.org/10.71428/PJS.2026.0109

Pharaonic Journal of Science

Article Sidebar

Main Article Content

Abstract

Article Details

Issue

Section

How to Cite