Volume 206, Issue 3 pp. 919-923
SHORT REPORT
Open Access

A machine learning-based workflow for predicting transplant outcomes in patients with sickle cell disease

Haiou Li

Haiou Li

Sickle Cell Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Vandana Sachdev

Vandana Sachdev

Echocardiography Laboratory, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Xin Tian

Xin Tian

Office of Biostatistics Research, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
My-Le Nguyen

My-Le Nguyen

Echocardiography Laboratory, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Matthew Hsieh

Matthew Hsieh

Cellular and Molecular Therapeutics Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Courtney Fitzhugh

Courtney Fitzhugh

Cellular and Molecular Therapeutics Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Emily Limerick

Emily Limerick

Cellular and Molecular Therapeutics Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Wynona Coles

Wynona Coles

Cellular and Molecular Therapeutics Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Nancy Asomaning

Nancy Asomaning

Sickle Cell Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Anna Conrey

Anna Conrey

Sickle Cell Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Colin O. Wu

Colin O. Wu

Office of Biostatistics Research, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Search for more papers by this author
Swee Lay Thein

Corresponding Author

Swee Lay Thein

Sickle Cell Branch, National Heart, Lung, and Blood Institute, NIH, Bethesda, Maryland, USA

Correspondence

Swee Lay Thein, Sickle Cell Branch, National Heart, Lung and Blood Institute, National Institutes of Health, 10 Center Drive, Building 10, RM 6S241A, Bethesda, MD 20892-1650, USA.

Email: [email protected]

Search for more papers by this author
First published: 22 October 2024

Haiou Li, Vandana Sachdev, Colin O. Wu and Swee Lay Thein contributed equally.

Summary

Allogeneic haematopoietic cell transplantation (HCT) with HLA-matched sibling donor remains the most established curative therapeutic option for patients with sickle cell disease (SCD). However, it is not without risks, highlighting the need for a risk stratification system. Utilizing a machine learning (ML) approach that combines clinical and imaging variables, we identified red cell distribution width and renal organ damage as important risk factors for patients undergoing HCT. This ML-based algorithm, similar to an approach previously reported for predicting mortality in patients with SCD, should be applicable to risk factor discovery in similar studies.

INTRODUCTION

Sickle cell disease (SCD), caused by a single-point mutation in the β-haemoglobin gene, is probably the most common severe inherited disease in the world.1 The clinical manifestations are multisystemic with progressive organ damage and premature mortality. Current drug therapies suffer from poor patient adherence and are not effective in all patients. Recently, the US Food and Drug Administration (FDA) approval of two genetic therapies—Casgevy and Lyfgenia—has increased therapeutic options,2 but allogeneic haematopoietic cell transplantation (HCT) with HLA-matched sibling donor remains the more established and relatively more accessible option for patients. Nonetheless, these curative interventions are high-risk procedures involving potentially significant toxicities and organ complications.2-5 In 2021, the American Society of Haematology published a guideline with eight recommendations on HCT for SCD, but it was acknowledged that the evidence for all recommendations is of low certainty due to a lack of randomized clinical trials.6 A lack of universal end-points in HCT trials was also noted, highlighting an immediate need for predictive biomarkers of mortality associated with HCT. In prior work, we developed a phenotypic risk score to predict mortality in SCD,4 and we demonstrated that our machine learning (ML)-based approach had superior performance compared with competing models. However, we recognized that SCD patients who underwent HCT were censored and not included in the analysis. Here, we applied a similar ML approach to develop a risk stratification workflow to predict both mortality and transplant outcomes specifically in adults with SCD undergoing HCT.

METHODS

Study design and participants

The study included SCD patients who underwent HCT (full HLA-matched siblings and haploidentical, Table S1) at the NIH Clinical Centre, between September 2004 and February 2020 (Clinicaltrials.gov: NCT#00061568, NCT#00977691, NCT#02105766 and NCT#03077542) (Figure 1). All transplant protocols were approved by the National Heart, Lung, and Blood Institutional Review Board, and all subjects signed informed consent prior to participation. Baseline echocardiogram (echo) and laboratory measurements were defined as those performed within 30 days of the consent date, and the final follow-up for all patients was in March 2023. Of 119 patients, 17 were originally excluded, 16 due to lack of echo data, and one due to lack of laboratory data. Seven patients aged <18 years were excluded. Initially, 97 covariates (demographics, echo, electrocardiogram (EKG), vitals, laboratory, and pulmonary function tests (PFTs)) were considered as candidate risk factors for mortality and transplant failure (Table S2); however, those with >30% missing values were excluded resulting in 73 variables that were included in the analysis (Figure 1).

Details are in the caption following the image
(A) Study design and analysis schema. 119 patients were recruited from 2004 to 2020 with 98 risk factors. And 24 out of 119 were excluded due to age requirements and/or lack of available measurements within 30-day time window compared with consent date. For the model construction phase, 95 patients with 73 variables were under study (25 variables excluded due to high missingness). The final analytical sample was determined based on the availability of the developed risk score which led to 111 patients. (B) Kaplan–Meier Mortality Estimate based on the Risk Score Categories on 111 Patients. All-cause mortality estimates based on risk scores calculated by summing up four categorized risk factors: Blood urea nitrogen (BUN), lactate dehydrogenase (LDH), red blood cell distribution width (RDW), and absolute reticulocyte count. The 4-year mortality and overall survival estimates in the four risk score groups (risk scores 0, 1, 2, 3) with a C-statistic of 0.815; the 4-year mortality rates were 0%, 3.2%, 7.9%, and 66.7%, respectively.

Study outcomes

The primary outcome was all-cause mortality ascertained by the date between the consent date and the date of the last follow-up, proxy interview, medical records, National Death Index (NDI), and Social Security Death Index (SSDI) search. Transplant failure was defined as graft failure by the return of acute sickle-related complications, including acute painful episodes, acute chest syndrome, or anaemia, with or without evidence of haemolysis. A secondary analysis was conducted based on the composite transplant outcome, of which event was identified by transplant failure or death (whichever occurred first).

Statistical analysis

We used a two-step statistical ML approach to select the top influential covariates for mortality (see previous study7), which combined the process of random forest (RF), nested RF, and stepwise Cox proportional hazard model. The feature selection was based on 95 patients and the ML-model was evaluated on all patients with available final selected risk factors (N = 111). Details of the ML methods and statistical analyses are available in the Appendix S1.

RESULTS

Characteristics of the study cohort

A total of 111 SCD (106 HbSS and HbSβ0 thal) patients underwent HCT, 74 (67%) with a fully HLA-matched related donor and 37 (33%) with a haploidentical donor (Table S1). The median age of participants was 31.8 (24.7–37.8) years, 38% were female, other demographic details are shown in Table S1. Over a median follow-up of 6.9 years (maximum follow-up of 19.0 years), 17 deaths were observed (median age [IQR] 40 [27–47] years) and the 4-year mortality was estimated at 7.2% (95% confidence interval [CI] 2.3%–12.0%) (Figure S1), compared with 14% in the general SCD population.7 For the HCT procedure, 27 graft failures (24.3%) were observed during a median time of 5.24 years after the transplant.

Machine learning for ranking variable importance

Using the model building cohort of 95 patients with 73 covariates (Figure 1), variable importance (VIMP) was calculated by the random survival forests (RSF) method for mortality prediction (Figure S2A). Based on the VIMP ranking, a nested RSF was applied which suggested that the optimal number of variables to reach the maximal model performance was eight achieved by eight covariates. These top eight predictors were blood urea nitrogen (BUN), creatinine, left ventricular posterior wall (LVPW) diastolic wall thickness, lactate dehydrogenase (LDH), red blood cell distribution width (RDW), estimated right ventricular systolic pressure (RVSP), absolute reticulocyte count and direct bilirubin (Figure S2B).

Cut-off point and risk score development based on stepwise Cox regression

The top eight selected risk variables were entered into a multivariable model (model 1) with a resulting a C-statistic of 0.831. In order to develop an easy-to-apply risk classification, the optimal cut-off points were chosen for each of eight risk factors based on the survival classification and regression tree8 (CART) (Table S3 and Figure S3). A stepwise Cox proportional hazard regression was further applied to correct for potential overfitting, which identified four independent predictors—BUN, LDH, RDW, and absolute reticulocyte count—that were associated with all-cause mortality. Model 2, based on these four categorial risk factors, performed better with a higher C-statistic of 0.840 compared with Model 1 (Table S4). In Model 3, we assigned equal weight to each of the 4 predictors and compiled a risk score as a sum of all the binary factors in the cohort (Table S4). Performance of the simplified risk score using the four categorized factors was similar compared with the model using eight factors with C-statistic = 0.842 (Figure S4A). We applied the HCT-Comorbidity Index (HCT-CI) risk stratification tool to our cohort and found that our new phenotypic risk score had better discrimination and predictive capacity (C-statistic, 0.842 vs. 0.679, Figure S4B).9, 10

Since the top four predictors in the phenotypic risk score did not include echo variables, we reinstated the 16 transplant subjects that were initially excluded (due to lack of echo data) and repeated the analyses. The C-statistic of 0.815 using Model 3 on the cohort of 111 illustrates the robustness of our developed risk scoring system and 4-year mortality estimates were 0%, 3.2%, 7.9%, and 66.7% based on risk scores of 0, 1, 2, and 3, respectively (Figure 1B). The significance remained when controlling for the transplant donor type (C-statistic 0.871, Figure S5).

For the secondary analysis, amongst the 111 subjects in the full cohort, a higher mortality was observed in the 27 who failed HCT compared with those who engrafted (40.7% vs. 7.1%). Besides the mortality stratification, the risk scores of 0, 1, 2, and 3 could also identify the patients' risk of transplant failure, not taking into account the donor type (C-statistic 0.649) with 4-year disease progression estimates of 9.1%, 22%, 32.8%, and 74.1% (Figure S6; Table S5). Accounting for the type of transplant donor improved C-statistic to 0.769 (Figure S7; Table S5).

DISCUSSION

Considering the significant risks associated with HCT in SCD patients, the development of a risk stratification system to identify patients at the highest risk of early mortality and significant morbidity after HCT, is a priority. This system could serve as both an assessment and decision-making tool to aid in determining eligibility for curative therapies. Furthermore, developing an algorithm that aligns the risks of graft failure and survival after HCT with the SCD disease severity will facilitate monitoring of patient disease progression for timely HCT intervention. Based on a single-centre cohort of 111 SCD patients, our ML-based workflow selected a set of baseline biomarkers, alongside the transplant donor type, that were associated with both SCD mortality and transplant outcomes. Four critical and readily accessible clinical variables—BUN, LDH, RDW, and absolute reticulocyte count—were identified and the risk models utilizing these risk factors showed enhanced performance in predicting the risk of mortality and graft failure.

The variable identified as having the greatest importance by RSF was BUN, a global biomarker of renal complications in SCD, and it was found to be more predictive in our study than other biomarkers of renal function, such as serum creatinine, eGFR, and urine albumin/creatinine ratio.7 Lactate dehydrogenase has previously been shown to be associated with the severity of vaso-occlusive episodes and muti-organ failure, as well as survival in SCD.11, 12 Our current findings underscore the importance of routine blood laboratory parameters as prognostic indicators, particularly absolute reticulocyte count and RDW, which have been associated with acute SCD pain episodes13 and vascular pathology14 in transplant outcomes.

The proposed risk prediction workflow (Figure 1A and Figure S8), which integrates ML techniques for biomarker selection and model building, outperformed the established HCT-CI index in predicting outcomes for our SCD transplant cohort. A main limitation of the current study is the lack of a validation cohort for our risk prediction algorithm, which was derived from a limited, single-centre cohort. Interval validation based on bootstrap and sample splitting approach was applied to evaluate the performance of the novel risk score and proposed workflow. However, given the heterogeneity in SCD severity and patient phenotypes amongst different centres, we recommend implementing the ML workflow, rather than the specific biomarkers identified, for tailored centre-specific risk identification and transplant recommendation (Figure S8). This workflow is both versatile and adaptable for risk stratification and the selection of SCD transplant candidates across various centres.

AUTHOR CONTRIBUTIONS

Vandana Sachdev, Haiou Li, Colin O. Wu, and Swee Lay Thein designed the study. Haiou Li and Vandana Sachdev contributed equally as the first authors. Colin O. Wu and Swee Lay Thein contributed equally as senior authors. My-Le Nguyen, Matt Hsieh, Courtney Fitzhugh, Emily Limerick, Nancy Asomaning, and Anna Conrey contributed to data acquisition, curation, and analysis. Haiou Li and Vandana Sachdev verified the underlying data. Haiou Li, Vandana Sachdev, Xin Tian, and Colin O. Wu contributed to the statistical analysis. Haiou Li, Vandana Sachdev, Xin Tian, Colin O. Wu, and Swee Lay Thein wrote the first draft of the manuscript. All authors critically reviewed the draft and approved the final version for publication.

FUNDING INFORMATION

This research was supported by the Intramural Research Programme of the National Heart, Lung, and Blood Institute, NIH, Department of Health and Human Services (DHHS).

CONFLICT OF INTEREST STATEMENT

The authors have no competing interests.

ETHICS STATEMENT

All protocols were approved by the NHLBI Institutional Review Board. Written informed consent was obtained from all individuals enrolled in the study.

CLINICAL TRIAL REGISTRATION (INCLUDING TRIAL NUMBER)

Clinicaltrials.gov: NCT#00061568, NCT#00977691, NCT#02105766 and NCT#03077542.

DATA AVAILABILITY STATEMENT

The datasets generated and/or analysed in the current study are available from the corresponding author upon reasonable request.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.