Predicting the Availability of Hematopoietic Stem Cell Donors Using Machine Learning

Li, Ying, Masiliune, Ausra, Winstone, David, Gasieniec, Leszek ORCID: 0000-0003-1809-9814, Wong, Prudence ORCID: 0000-0001-7935-7245, Lin, Hong, Pawson, Rachel, Parkes, Guy and Hadley, Andrew
(2020) Predicting the Availability of Hematopoietic Stem Cell Donors Using Machine Learning. Biology of Blood and Marrow Transplantation, 26 (8). pp. 1406-1413.

This is the latest version of this item.

[img] Text
YBBMT-D-19-00843.pdf - Author Accepted Manuscript
Available under License : See the attached licence file.

Download (1MB) | Preview


Hematopoietic stem cell transplantation (HSCT) is firmly established as an important curative therapy for patients with hematologic malignancies and other blood disorders. Apart from finding HLA-matched donors during the HSCT process, donor availability remains a key consideration as the time taken from diagnosis to transplant is recognized to adversely affect patient outcome. In this study, we aimed to develop and validate a machine learning approach to predict the availability of stem cell donors. We retrospectively collected a data set containing 10,258 verification typing requests made during the HSCT process in the British Bone Marrow Registry (BBMR) between January 1, 2013, and December 31, 2018. Three machine learning algorithms were implemented and compared, including boosted decision trees (BDTs), logistic regression, and support vector machines. Area under the receiver operating characteristic curve (AUC) was primarily used to assess the algorithms. The experimental results showed that BDTs performed better in predicting the availability of BBMR donors. The overall predictive power of the model, using AUC on the test cohort of 2052 records, was found to be 0.826. Our findings show that machine learning can predict the availability of donors with a high degree of accuracy. We propose the use of the BDT machine learning approach to predict the availability of BBMR donors and use the predictive scores during the HSCT process to ensure patients with blood cancers or disorders receive a transplant at the optimum time.

Item Type: Article
Uncontrolled Keywords: allogeneic hematopoietic stem cell transplantation, machine learning, donor availability, donor selection
Depositing User: Symplectic Admin
Date Deposited: 09 Sep 2020 10:00
Last Modified: 18 Jan 2023 23:46
DOI: 10.1016/j.bbmt.2020.03.026
Related URLs:

Available Versions of this Item