- 1Transplant Research Center, Research Institute for Convergence of Biomedical Science and Technology, Pusan National University Yangsan Hospital, Yangsan, Republic of Korea
- 2Department of Internal Medicine, School of Medicine, Pusan National University, Busan, Republic of Korea
- 3Department of Information Convergence Engineering, Pusan National University, Yangsan, Republic of Korea
- 4School of Biomedical Convergence Engineering, Pusan National University, Yangsan, Republic of Korea
- 5Center for Artificial Intelligence Research, Pusan National University, Busan, Republic of Korea
Optimizing lung transplant candidate selection is crucial for maximizing resource efficiency and improving patient outcomes. Using data from the International Society for Heart and Lung Transplantation (ISHLT) registry (29,364 patients), we developed a deep learning model to predict 1-year survival after lung transplantation. Initially, 25 pretransplant factors were identified, and their importance was assessed using SHapley Additive exPlanations values. We refined the model by selecting the top 10 most influential factors and compared its performance with the original model. Additionally, we conducted external validation using an independent in-house dataset. Among the 29,364 patients, 4,729 (16.1%) died within 1 year, while 24,635 survived. The Gradient Boosting Machine (GBM) model achieved the highest performance (AUC: 0.958, accuracy: 0.949). Notably, the streamlined model using only the top 10 factors maintained identical performance (AUC: 0.958, accuracy: 0.949). The in-house dataset used for external validation showed significant compositional differences compared to the ISHLT dataset. Despite these differences, the GBM model performed well (AUC: 0.852, accuracy: 0.764). Notably, the Multilayer Perceptron model demonstrated superior generalization with an AUC of 0.911 and accuracy of 0.870. Our machine learning-based approach effectively predicts 1-year mortality in lung transplant recipients using a minimal set of pretransplant factors.
Introduction
Lung transplantation is a critical intervention for patients with end-stage lung disease, providing significant survival benefits. As the population ages and medical technologies advance, the demand for lung transplants continues to rise [1]. However, the global supply of organ donors is insufficient to meet this growing demand [2]. Despite advancements in surgical techniques and immunosuppressive therapies, the post-transplant environment remains challenging, with various factors influencing outcomes [3]. Given the scarcity of medical resources, prioritizing patients with a low risk of mortality post-transplant is imperative. Predicting 1-year mortality following lung transplantation is, therefore, a critical goal to optimize patient care and enhance clinical decision-making.
Traditional methods for predicting mortality after lung transplantation have primarily relied on clinical judgment and risk-scoring systems [4–7]. While these scoring systems aid in prioritizing candidates, they often fail to account for the complexity of individual patient trajectories. This is due to the limited number of variables they include and their heavy reliance on clinical judgment. Additionally, several factors traditionally considered important, such as recipient age, body mass index (BMI), and the duration of preoperative mechanical ventilation, have not consistently shown the expected prognostic impact in previous studies [8]. A recent meta-analysis found that only one factor, postoperative extracorporeal membrane oxygenation (ECMO) use, was significantly associated with 1-year mortality, while other commonly accepted risk factors demonstrated minimal prognostic significance [8]. These limitations emphasize the need for more accurate, personalized risk prediction methods, as existing models may lack the granularity required for individualized treatment.
In this context, machine learning approaches offer the potential to develop predictive algorithms that can integrate diverse patient data and identify subtle patterns that traditional methods may overlook. Machine learning models can enhance pretransplant risk stratification, assist clinicians in selecting and counseling candidates, guide post-transplant surveillance strategies, and inform interventions to mitigate adverse outcomes. By providing timely insights into individual patient trajectories, these models can improve resource allocation and patient-centered care. Despite their potential, however, machine learning models predicting patient prognosis post-lung transplantation have not been extensively studied [9–11]. In this study, we developed and validated a machine learning-based model to predict 1-year mortality among lung transplant recipients using data from the International Society for Heart and Lung Transplantation (ISHLT). Additionally, we performed external validation using the in-house dataset from our hospital.
Materials and Methods
Patients who underwent lung transplantation and who were registered in the ISHLT registry from June 2009 to June 2018 were included (Figure 1). By early 2019, 45 centers worldwide had submitted data to the ISHLT International Thoracic Organ Transplant Registry using a secure, web-based data entry system. Detailed spreadsheets of the data elements collected in the Registry are available on the ISHLT International thoracic organ transplant website1. We excluded patients < 18 years, those who had undergone retransplantation or multiple organ transplants, and those with unavailable follow-up status or missing data. Consequently, 29,364 patients were available for analysis. The ethics committees and review board of Pusan National University Yangsan Hospital (PNUYH) (55-2024-128) approved the current study, and informed consent was waived due to the retrospective nature of the study. This study adhered to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) reporting guideline. Initially, key variables were selected through regression analysis to develop a predictive model for 1-year mortality following lung transplantation. This analysis identified 25 variables significantly associated with 1-year mortality. We then evaluated feature importance using SHapley Additive exPlanations (SHAP) values (Figure 2A) (Supplementary Figure S1) and developed a refined model incorporating the top 10 most important features. The performance of this refined model was compared with that of the original model, which included all 25 variables. Finally, external validation was conducted using our hospital’s in-house dataset.

Figure 1. Patient inclusion. A total of 29,364 lung transplant patients were included in this study.
Machine Learning-Based Model Development
We employed six machine learning models to predict 1-year mortality after lung transplantation: Logistic Regression (LR) [12], Support Vector Machine (SVM) [13], Random Forest (RF) [14], Gradient Boosting Machine (GBM) [15], Balanced Random Forest (BRF) [16], and a neural network model, Multilayer Perceptron (MLP) [17]. To evaluate model performance, we conducted five-fold cross-validation. The entire dataset was divided into five subsets, and during each iteration, one subset was used for validation while the remaining four were used for training. This process was repeated until each subset had been used for validation once, and the final results were reported as the average performance across all five iterations (Supplementary Figure S2). To interpret the predictions of our models, we employed SHAP [18], a widely used method for estimating the contribution of each feature to a prediction. SHAP provides explanations based on Shapley values derived from game theory. Specifically, we utilized the Python SHAP package [18] and applied the TreeExplainer for GBM. We used the scikit-learn package [19] for most models, except for BRF, which was implemented using the imbalanced-learn package [20]. Default model settings were applied without additional hyperparameter tuning. However, due to class imbalance in the ISHLT dataset, class weight adjustments were applied for LR, RF, SVM, and BRF to account for the imbalance. Additionally, survival analysis based on model predictions was performed using the lifelines Python package [21]. Finally, external validation of the developed models was conducted using our hospital’s in-house dataset, further assessing their generalizability and robustness.

Figure 2. Shapley Additive explanations (SHAP). A SHAP plot illustrates the influence of each feature’s value on the prediction outcome, with red indicating an increase in prediction and blue indicating a decrease. It demonstrates notable impacts in the following sequence: albumin, mean pulmonary artery pressure, and functional status. Increasing albumin levels correlate with decreased mortality (blue), while higher mean pulmonary artery pressure correlates with increased mortality (red). (A) SHAP plot of 25 features, (B) SHAP plot of 10 features.
Statistical Analyses
Variables with excessive missing values (defined as more than 20% missing data) were removed, and patient samples with any missing data were excluded. This approach ensures that the analysis is based on complete cases, minimizing potential biases from missing data. Continuous variables are reported as either the mean ± standard deviation or the median with interquartile range (IQR), depending on their distribution. Statistical comparisons of continuous variables were performed using the Student’s t-test or the Mann-Whitney U test, as appropriate. Categorical variables are expressed as frequencies and percentages and analyzed using the chi-square or Fisher’s exact test, as appropriate. To identify factors associated with 1-year mortality after lung transplantation, we conducted univariate regression analysis. Additionally, we validated the GBM model by comparing the actual survival curves of patients, grouped based on the model’s predictions. Patients predicted by the model to die were classified into the high-risk group (death group), and those predicted to survive were classified into the low-risk group (survival group). Survival curves for these two groups were illustrated using Kaplan-Meier plots, and the log-rank test was performed to compare survival rates between the groups. All statistical analyses were performed using R software (version 4.2.0; R Foundation for Statistical Computing, Vienna, Austria2). Statistical significance was set at P < 0.05.
Results
Patient Characteristics
Among the 29,364 patients included in the study, 4,729 (16.1%) died within 1 year, whereas 24,635 survived. The baseline pretransplant characteristics of the patients are summarized in Table 1. Their average age was 53.4 ± 13.4 years, with 17,032 (58%) males, and the average body mass index (BMI) was 24.9 ± 4.5. The primary diagnoses were as follows: chronic obstructive pulmonary disease (COPD) in 27.9%, idiopathic pulmonary fibrosis (IPF) in 26.8%, cystic fibrosis in 14.6%, nonidiopathic interstitial pneumonia (nonIIP) ILD in 7.3%, nonIPF idiopathic interstitial pneumonia (IIP) in 4.7%, alpha-1 antitrypsin deficiency in 3.4%, idiopathic pulmonary arterial hypertension (IPAH) in 2.8%, noncystic fibrosis (nonCF) bronchiectasis in 2.6%, sarcoidosis in 2.5%, nonIPAH pulmonary hypertension in 1.7%, connective tissue disease-associated ILD (CTDILD) in 1.3%, obliterative bronchiolitis in 0.9%, lymphangioleiomyomatosis in 0.8%, and other diagnoses in 2.6%. Pretransplant, 1,757 patients (6%) were admitted to the intensive care unit (ICU), 1,515 (5.2%) were hospitalized in general wards, and 26,092 (88.9%) were not hospitalized. Additionally, 1,021 patients (3.5%) were on ventilators, and 642 (2.2%) were on extracorporeal membrane oxygenation (ECMO). Pretransplant diabetes was present in 3,288 patients (11.2%), pretransplant dialysis was performed in 61 patients (0.2%), and 64 patients had a history of organ transplantation other than the lung. Additionally, 781 (2.7%) and 2,601 (8.9%) patients had previous heart and lung surgeries, respectively.
Machine Learning-Based Model Performance and Model Interpretation
We performed univariate regression analysis of 1-year mortality. Table 2 presents the odds ratio of the 25 factors used in the model. We evaluated the performance of six machine learning models with 25 pretransplant features for predicting 1-year mortality after lung transplantation. The results are presented in Table 3. We used area under the curve (AUC), accuracy, sensitivity, specificity, positive predictive value, and negative predictive value as evaluation metrics. Most models exhibited high performance, with AUC and accuracy either surpassing or closely approaching 0.9. The GBM model achieved the highest performance, with an AUC and accuracy of 0.958 and 0.949, respectively.

Table 3. Performance of 1-year mortality prediction model after lung transplantation with 25 features.
Feature importance was analyzed using SHAP, with Figure 2 displaying the feature importance for the GBM model. The most important feature was albumin, with lower levels associated with higher mortality 1-year post-transplant. Other significant predictors included mean pulmonary artery pressure, creatinine levels, total bilirubin, and age.
Based on the SHAP results, we trained a new model using the top-10 most important features (Table 4). Despite reducing the number of features from 25 to 10, the performances of the six models remained largely unchanged. Some models, including LR, RF, SVM, and MLP, exhibited slight improvements in AUC, although accuracy was slightly decreased. The GBM model continued to demonstrate the highest performance with an AUC and accuracy of 0.958 and 0.949, respectively, even with only 10 features.

Table 4. Performance of 1-year mortality prediction model after lung transplantation with 10 features.
Actual Survival Curves: Low-Risk vs. High-Risk Groups by the GBM Model
We further validated the GBM model by comparing the actual survival curves of patients who were classified into two groups based on the model’s predictions. Patients predicted by the model to die were assigned to the high-risk group (death group), while those predicted to survive were placed in the low-risk group (survival group). As shown in Figure 3, the survival curves demonstrated a statistically significant difference between the two groups (p < 0.001). Among the predicted high-risk group, 91.4% experienced actual mortality, while 95.5% of the predicted low-risk group survived. These findings highlight the model’s robust predictive performance for distinguishing between mortality and survival outcomes.

Figure 3. Actual survival curves: low-risk vs. high-risk groups by the GBM model. Graph depicting actual survival curves of the group whose survival was predicted by the GBM model.
External Validation on In-House Dataset of PNUYH
To evaluate the generalizability of our model, we conducted an external validation using data from Pusan National University Yangsan Hospital (PNUYH) (Table 5). The model was developed using all 29,364 samples included in this study and subsequently validated on the in-house dataset of PNUYH. Between January 2012 and March 2024, a total of 228 adult patients (aged ≥18 years) underwent lung transplantation at PNUYH. After excluding 12 patients who underwent retransplantation, 216 patients were included in the external validation cohort. Among them, 70 (32.4%) died within 1 year of transplantation. Key preoperative characteristics of these patients are summarized in Supplementary Table S1. Notably, the mortality group had a significantly higher BMI compared to the survival group (23.2 vs. 21.3, p = 0.001). Serum albumin levels were lower in the mortality group (2.5 vs. 3.7, p < 0.001), and steroid use was more frequent (47.1% vs. 24.0%, p = 0.001). Additionally, a history of lung surgery prior to transplantation was more common in the mortality group (5.7% vs. 0.7%, p = 0.021).

Table 5. Results of External Validation for 1-Year Mortality Prediction Model After Lung Transplantation Using 10 Features on in-house dataset of PNUYH.
The PNUYH dataset used for external validation had a significantly different distribution compared to the ISHLT dataset used for training (Supplementary Table S2), with all 10 model variables showing statistically significant differences between the two cohorts. Despite these differences, the GBM, which had the highest performance in the ISHLT dataset, demonstrated excellent external validation results (AUC: 0.852, accuracy: 0.764). Among all tested models, the highest AUC was observed for the SVM model (0.936). However, when considering both AUC and accuracy, the best-performing model was the MLP, a deep learning-based approach. The MLP achieved an AUC of 0.911 and an accuracy of 0.870, consistently outperforming other models across all evaluation metrics. Given the substantial distributional differences between the ISHLT and PNUYH datasets, the MLP model’s strong generalization performance underscores its robustness. These findings highlight the model’s ability to maintain high predictive performance in an external population, supporting its potential clinical utility in lung transplant candidate selection.
Discussion
In this study, we developed and validated a deep learning-based model to predict 1-year survival following lung transplantation using a large, multicenter, international dataset. Our model demonstrated strong predictive performance, effectively identifying key determinants of post-transplant survival. By leveraging GBM techniques, we constructed a highly accurate and robust prediction model. Notably, a simplified version of our model, incorporating only the 10 most influential predictors, achieved performance comparable to that of more complex models utilizing 25 variables. Furthermore, the model’s generalizability was confirmed through external validation using the in-house dataset from PNUYH. The GBM model achieved an AUC of 0.852 and an accuracy of 0.764, and the MLP model demonstrated superior performance, achieving an AUC of 0.911 and accuracy of 0.870. These findings underscore the potential clinical applicability of our model in improving risk stratification and decision-making for lung transplant recipients.
The 10 predictors identified in this study were critical in assessing the potential benefit of lung transplantation. Among these, age, BMI, creatinine levels, total bilirubin levels, and mean pulmonary artery pressure have been well-established as critical indicators for assessing patient urgency and potential benefit in previous Lung Allocation Scores (LAS) [22] and the current Composite Allocation Score (CAS) [23]. Factors such as albumin levels, chronic steroid use, and prior lung surgery, newly highlighted in our study, further emphasize their potential to refine patient assessments and improve prediction accuracy. Albumin is traditionally a marker of nutritional status and inflammation, with lower levels associated with poorer post-transplant outcomes in previous studies [24]. This underscores the importance of evaluating nutritional and inflammatory status during pretransplant assessments. Chronic steroid use, identified in earlier studies, increases post-transplant morbidity and mortality [25], with some suggesting that long-term steroid use may be a contraindication for surgery [26]. Long-term steroid usage increases the risk of infection, poor wound healing, and other complications, all of which may be important predictors of transplant outcomes. Furthermore, prior major lung resection has been recognized as a significant risk factor for increased perioperative mortality and complications such as the need for dialysis [27]. This increased risk can be attributed to factors such as altered anatomy, potential for adhesions, and bleeding, all of which complicate the transplant procedure. These findings highlight the importance of thorough preoperative assessment in patients with a history of major lung resection.
Traditionally, prognosis following lung transplantation has been significantly influenced by the underlying primary disease necessitating the transplant. Different lung diseases impact post-transplant outcomes due to their distinct pathophysiology, patient demographics, and associated comorbidities. For example, patients with cystic fibrosis (CF) generally exhibit better post-transplant survival rates compared to those with idiopathic pulmonary fibrosis or chronic obstructive pulmonary disease [8]. In that regard, patient diagnosis plays an important role in both LAS and CAS, and several studies have incorporated diagnoses into prediction models. In our study, only cystic fibrosis was included as a predictor, but it had the lowest importance in the model (Figure 2B). Additionally, no patients with cystic fibrosis were included in the PNUYH in-house dataset used for external validation. Nevertheless, our model demonstrated excellent predictive performance, suggesting that preoperative conditions, such as organ function, nutritional status, and preoperative hospitalization, may be more critical prognostic indicators than the underlying disease itself. Thus, general aspects of patient health prior to transplantation could be more important than the specific underlying disease in pretransplant management.
While several studies have attempted to predict mortality in lung transplant patients, accurate predictions have not always been achieved due to limited performance (Supplementary Table S3) [28–30]. A recent study accurately predicted 1-year survival using 22 factors, including postoperative variables such as operation time, donor PaO2/FiO2 ratio, postoperative ECMO time, ventilator time, ICU stay, primary graft dysfunction grade, and cold ischemic time, achieving an AUC of 0.921 in patients from a single center [9]. Although these factors provide valuable insights into post-transplant outcomes, they are not available preoperatively, limiting their utility in pretransplant decision-making and patient prioritization. Our study addresses this limitation by focusing on preoperative variables that can be assessed before transplantation, thereby enhancing the ability to predict transplant outcomes and prioritize patients more effectively. Notably, achieving similar predictive accuracy with fewer variables has significant implications for clinical practice, as it simplifies the assessment process and makes it more feasible to implement in diverse healthcare settings without compromising predictive power. Furthermore, the external validation using in-house datasets further underscores the high generalizability of our model. Despite the fundamental differences between the training and validation datasets, the model demonstrated excellent performance, emphasizing its robustness and applicability across varied populations.
The use of machine learning approaches, particularly GBM, in this study highlights the transformative potential of these methods in healthcare. Machine learning models can handle complex interactions between variables and provide more accurate predictions compared to traditional statistical methods. This study demonstrates how machine learning models, such as GBM, can capture nonlinear relationships between variables, such as pulmonary artery pressure and BMI, which may not follow linear patterns. By incorporating these nonlinear interactions, the predictive performance of the model is significantly improved compared to traditional methods. SHAP values were used to visually explore the interactions between variables, providing insights into which characteristics contribute most to predictions. This ability to visualize complex interactions enhances the interpretability of the model, offering a deeper understanding of its decision-making process. Furthermore, comparing linear models (e.g., logistic regression, support vector machines) to nonlinear models (e.g., random forest, XGBoost) illustrates how traditional methods may miss out on capturing nonlinear patterns, which are crucial for accurate prediction of post-transplant outcomes. Our study advocates for the integration of machine learning technologies into clinical workflows. This integration can enhance clinical decision-making, providing more accurate predictions and improving patient outcomes. By leveraging machine learning models, clinicians can identify high-risk patients and tailor pretransplant management strategies to optimize post-transplant survival.
One significant limitation of our study was its reliance on registry data, meaning that the model’s performance depends on the accuracy and completeness of the recorded information. We mitigated this issue by excluding instances with missing data to use the most precise data available. We developed and internally validated the model using the ISHLT registry, which offers a large and diverse sample, and externally validated it using our own cohort, demonstrating the model’s generalizability. Future research should explore incorporating additional potential predictors and leveraging longitudinal data to further refine the model. These efforts would contribute to enhancing the model’s robustness and applicability in clinical practice. Additionally, further validation and ethical considerations should be conducted before applying the model to donor lung allocation, ensuring it addresses any ethical concerns.
In conclusion, our study confirmed that a machine learning-based approach can accurately predict 1-year mortality in lung transplant recipients using a minimal set of pretransplant factors. The development of a streamlined model with high predictive accuracy facilitates better patient selection, ensuring that lung transplantation resources are utilized efficiently and patient care is optimized. This model holds promise for enhancing clinical decision-making and improving post-transplant outcomes in lung transplant recipients. Investigating the underlying mechanisms by which specific pretransplant characteristics influence post-transplant outcomes could further enhance patient management strategies.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.
Ethics Statement
The studies involving humans were approved by Pusan National University Yangsan Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because Consent was waived as there was no harm to the patient.
Author Contributions
HY contributed to the conceptualization and methodology of the study, and reviewed and edited the manuscript. DN performed the formal analysis, and wrote the original draft of the manuscript. WC and ES reviewed and edited the manuscript. SK validated the data, created visualizations, and also reviewed and edited the manuscript. All authors contributed to the article and approved the submitted version.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by institute of information and communications technology planning and evaluation (IITP) under the artificial intelligence convergence innovation human resources development (IITP-2025-RS-2023-00254177) grant funded by the Korea government (MSIT). This research was supported by a grant of Korean ARPA-H project through the Korea health industry development institute (KHIDI), funded by the Ministry of health and welfare, republic of Korea (grant number: RS-2024-00512375).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI Statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Acknowledgments
We thank the International Society for Heart and Lung Transplantation (ISHLT) registry for providing the data used in this study.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontierspartnerships.org/articles/10.3389/ti.2025.14121/full#supplementary-material
Abbreviations
AUC, area under the curve; GBM, Gradient Boosting Machine; ISHLT, International Society for Heart and Lung Transplantation; SHAP, Shapley Additive explanations.
Footnotes
1https://ishlt.org/research-data/registries/ttx-registry
References
1. Yeo, HJ, Oh, DK, Yu, WS, Choi, SM, Jeon, K, Ha, M, et al. Outcomes of Patients on the Lung Transplantation Waitlist in Korea: A Korean Network for Organ Sharing Data Analysis. J Korean Med Sci. (2022) 37:e294. doi:10.3346/jkms.2022.37.e294
2. Yeo, HJ. Allocation of Donor Lungs in Korea. J Chest Surg (2022) 55:274–6. doi:10.5090/jcs.22.047
3. Yeo, HJ, Kim, DH, Kim, YS, Jeon, D, and Cho, WH. Performance Changes Following the Revision of Organ Allocation System of Lung Transplant: Analysis of Korean Network for Organ Sharing Data. J Korean Med Sci. (2021) 36:e79. doi:10.3346/jkms.2021.36.e79
4. Grimm, JC, Valero, V, Magruder, JT, Kilic, A, Dungan, SP, Silhan, LL, et al. A Novel Risk Score that Incorporates Recipient and Donor Variables to Predict 1-year Mortality in the Current Era of Lung Transplantation. J Heart Lung Transpl (2015) 34(11):1449–54. doi:10.1016/j.healun.2015.07.001
5. Russo, MJ, Davies, RR, Hong, KN, Iribarne, A, Kawut, S, Bacchetta, M, et al. Who Is the High-Risk Recipient? Predicting Mortality after Lung Transplantation Using Pretransplant Risk Factors. J Thorac Cardiovasc Surg (2009) 138(5):1234–8. doi:10.1016/j.jtcvs.2009.07.036
6. Huppmann, P, Neurohr, C, Leuschner, S, Leuchte, H, Baumgartner, R, Zimmermann, G, et al. The Munich-LTX-Score: Predictor for Survival after Lung Transplantation. Clin Transpl (2012) 26(1):173–83. doi:10.1111/j.1399-0012.2011.01573.x
7. Belaroussi, Y, Hustache-Castaing, R, Maury, JM, Lehot, L, Rodriguez, A, Demant, X, et al. Prediction of Survival after a Lung Transplant at 1 Year (SALTO Cohort) Using Information Available at Different Key Time Points. Eur J Cardiothorac Surg (2023) 63(5):ezad167. doi:10.1093/ejcts/ezad167
8. Foroutan, F, Malik, A, Clark, KE, Buchan, TA, Yang, H, Cheong, GHL, et al. Predictors of 1-year Mortality after Adult Lung Transplantation: Systematic Review and Meta-Analyses. J Heart Lung Transpl (2022) 41(7):937–51. doi:10.1016/j.healun.2022.03.017
9. Tian, D, Yan, HJ, Huang, H, Zuo, YJ, Liu, MZ, Zhao, J, et al. Machine Learning-Based Prognostic Model for Patients after Lung Transplantation. JAMA Netw Open (2023) 6:e2312022. doi:10.1001/jamanetworkopen.2023.12022
10. Michelson, AP, Oh, I, Gupta, A, Puri, V, Kreisel, D, Gelman, AE, et al. Developing Machine Learning Models to Predict Primary Graft Dysfunction after Lung Transplantation. Am J Transpl (2024) 24:458–67. doi:10.1016/j.ajt.2023.07.008
11. Gholamzadeh, M, Abtahi, H, and Safdari, R. Machine Learning-Based Techniques to Improve Lung Transplantation Outcomes and Complications: A Systematic Review. BMC Med Res Methodol (2022) 22:331. doi:10.1186/s12874-022-01823-2
12. Hosmer, DW, Lemeshow, S, and Sturdivant, RX. Applied Logistic Regression. 3rd ed. John Wiley and Sons (2013).
13. Cortes, C, and Vapnik, V. Support-vector Networks. Machine Learn (1995) 20:273–97. doi:10.1007/bf00994018
15. Friedman, JH. Greedy Function Approximation: A Gradient Boosting Machine. Ann Stat (2001) 29:1189–232. doi:10.1214/aos/1013203451
16. Chen, C, Liaw, A, and Breiman, L. Using Random Forest to Learn Imbalanced Data. Berkeley: University of California (2004).
17. Rumelhart, DE, Hinton, GE, and Williams, RJ. Learning Representations by Back-Propagating Errors. Nature (1986) 323:533–6. doi:10.1038/323533a0
18. Lundberg, SM, and Lee, S-I. A Unified Approach to Interpreting Model Predictions. Adv Neural Inf Process Syst (2017) 30.
19. Pedregosa, F, Varoquaux, G, Gramfort, A, Michel, V, Thirion, B, and Grisel, O. Duchesnay E. Scikit-Learn: Machine Learning in Python. J Mach Learn Res (2011) 12:2825–30.
20. Lemaitre, G, Nogueira, F, and Aridas, CK. Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning. J Mach Learn Res (2017) 18:1–5.
21. Davidson-Pilon, C. Lifelines: Survival Analysis in Python. J Open Source Softw. (2019) 4:1317. doi:10.21105/joss.01317
22. Eurotransplant. LAS Calculator. (2024). Available online at: https://www.eurotransplant.org/las-calculator/ (Accessed: July, 2024).
23. Organ Procurement and Transplantation Network (OPTN). Lung CAS Calculator. (2024). Available online at: https://optn.transplant.hrsa.gov/data/allocation-calculators/lung-cas-calculator/ (Accessed: July, 2024).
24. Baldwin, MR, Arcasoy, SM, Shah, A, Schulze, PC, Sze, J, Sonett, JR, et al. Hypoalbuminemia and Early Mortality after Lung Transplantation: A Cohort Study. Am J Transpl (2012) 12:1256–67. doi:10.1111/j.1600-6143.2011.03965.x
25. Mosher, CL, Weber, JM, Frankel, CW, Neely, ML, and Palmer, SM. Risk Factors for Mortality in Lung Transplant Recipients Aged ≥65 Years: A Retrospective Cohort Study of 5,815 Patients in the Scientific Registry of Transplant Recipients. J Heart Lung Transpl (2021) 40:42–55. doi:10.1016/j.healun.2020.10.009
26. Schäfers, HJ, Wagner, TO, Demertzis, S, Hamm, M, Wahlers, T, Cremer, J, et al. Preoperative Corticosteroids. A Contraindication to Lung Transplantation? Chest (1992) 102:1522–5. doi:10.1378/chest.102.5.1522
27. Ganapathi, AM, Speicher, PJ, Castleberry, AW, Englum, BR, Osho, AA, Davis, RD, et al. The Effect of Prior Pneumonectomy or Lobectomy on Subsequent Lung Transplantation. Ann Thorac Surg (2014) 98:1922–8. doi:10.1016/j.athoracsur.2014.06.042
28. Chan, EY, Nguyen, DT, Kaleekal, TS, Goodarzi, A, Graviss, EA, Gaber, AO, et al. The Houston Methodist Lung Transplant Risk Model: A Validated Tool for Pretransplant Risk Assessment. Ann Thorac Surg (2019) 108:1094–100. doi:10.1016/j.athoracsur.2019.03.108
29. Gries, CJ, Rue, TC, Heagerty, PJ, Edelman, JD, Mulligan, MS, and Goss, CH. Development of a Predictive Model for Long-Term Survival after Lung Transplantation and Implications for the Lung Allocation Score. J Heart Lung Transpl (2010) 29:731–8. doi:10.1016/j.healun.2010.02.007
Keywords: lung transplantation, mortality, machine learning, risk factors, prediction model
Citation: Yeo HJ, Noh D, Son E, Kwon S and Cho WH (2025) Machine Learning for 1-Year Mortality Prediction in Lung Transplant Recipients: ISHLT Registry. Transpl. Int. 38:14121. doi: 10.3389/ti.2025.14121
Received: 26 November 2024; Accepted: 23 May 2025;
Published: 24 June 2025.
Copyright © 2025 Yeo, Noh, Son, Kwon and Cho. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sunyoung Kwon, c3kua3dvbkBwdXNhbi5hYy5rcg==; Woo Hyun Cho, cG9wZXllczAyMTJAaGFubWFpbC5uZXQ=
†These authors have contributed equally to this work