Abstract
Blood Center/Blood Hospital-Based Donor Center
W. Alton Russell, PhD (he/him/his)
Assistant Professor
McGill School of Population and Global Health
Montreal, Quebec, Canada
Disclosure information not submitted.
Selected models used Catboost and gradient boosted machine algorithms. In the ‘hemoglobin only’ RISE dataset (n=3,488 donations), the selected models’ cross-validation RMSPE was 6.5 for predicting return hemoglobin and 24.2 for predicting return log10 ferritin. When externally validating to the US (n=60,403), SA (n=253,537), and the NL (n=514,117), RMSPE increased by < 15% whether predicting hemoglobin (RMSPE=7.2, 7.3, and 5.9) or log10 ferritin (RMSPE=22.9, 27.6, and 19.1; Figure A). In the ‘hemoglobin and ferritin’ RISE dataset (n=2,625 donations), the selected models’ cross-validation RMSPE was 6.6 for predicting return hemoglobin and 14 for predicting return log10 ferritin. When externally validating to the US (n=11,025), SA (n=12,564) and NL (n=179,423), RMSPE increased by < 2% when predicting hemoglobin (RMSPE=4.6, 6.7, 6.3), and increased by < 33% when predicting log10 ferritin (RMSPE=14.9, 18.9 [+32%], 15).
Conclusions: Machine learning models generalized well across diverse settings, particularly when predicting return hemoglobin. Measuring baseline ferritin had a small impact on ability to predict return hemoglobin but greatly improved prediction of return log10 ferritin. Limitations include that our cross validation RMSPE may be optimistic due to overfitting and that each cohort’s policy on which donations are tested for ferritin could bias results.