Evaluation of machine learning algorithms for trabeculectomy outcome prediction in patients with glaucoma

Banna, Hasan Ul; Zanabli, Ahmed; McMillan, Brian; Lehmann, Maria; Gupta, Sumeet; Gerbo, Michael; Palko, Joel

doi:10.1038/s41598-022-06438-7

Evaluation of machine learning algorithms for trabeculectomy outcome prediction in patients with glaucoma

Article
Open access
Published: 15 February 2022

Volume 12, article number 2473, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Evaluation of machine learning algorithms for trabeculectomy outcome prediction in patients with glaucoma

Download PDF

Hasan Ul Banna¹,
Ahmed Zanabli¹,
Brian McMillan¹,
Maria Lehmann¹,
Sumeet Gupta¹,
Michael Gerbo¹ &
…
Joel Palko¹

1351 Accesses
10 Citations
Explore all metrics

Abstract

The purpose of this study was to evaluate the performance of machine learning algorithms to predict trabeculectomy surgical outcomes. Preoperative systemic, demographic and ocular data from consecutive trabeculectomy surgeries from a single academic institution between January 2014 and December 2018 were incorporated into models using random forest, support vector machine, artificial neural networks and multivariable logistic regression. Mean area under the receiver operating characteristic curve (AUC) and accuracy were used to evaluate the discrimination of each model to predict complete success of trabeculectomy surgery at 1 year. The top performing model was optimized using recursive feature selection and hyperparameter tuning. Calibration and net benefit of the final models were assessed. Among the 230 trabeculectomy surgeries performed on 184 patients, 104 (45.2%) were classified as complete success. Random forest was found to be the top performing model with an accuracy of 0.68 and AUC of 0.74 using 5-fold cross-validation to evaluate the final optimized model. These results provide evidence that machine learning models offer value in predicting trabeculectomy outcomes in patients with refractory glaucoma.

A machine learning approach to explore predictors of graft detachment following posterior lamellar keratoplasty: a nationwide registry study

Article Open access 21 October 2022

A machine learning approach to predict the glaucoma filtration surgery outcome

Article Open access 24 October 2023

Development, comparison, and internal validation of prediction models to determine the visual prognosis of patients with open globe injuries using machine learning approaches

Article Open access 21 May 2024

Introduction

Glaucomatous optic neuropathy is the leading cause of irreversible blindness worldwide¹. The mainstay of glaucoma treatment is lowering intraocular pressure (IOP), which reduces its occurrence and progression^2,3,4. Lowering IOP is achieved with ocular topical medications, laser therapy, or incisional surgeries. Incisional surgeries are often necessary for patients with refractory glaucoma who are at high risk for progressive vision loss. Trabeculectomy surgery remains one of the most commonly performed incisional glaucoma surgeries⁵. However, its frequency has declined over the last decade secondary to a growing armamentarium of alternative incisional glaucoma procedures with potentially improved safety profiles^6,7,8,9. The ability to quantify a patient’s risk of failure for a given glaucoma procedure would supplement the shared decision-making process between the patient and physician when determining appropriate treatment plans.

Surgical outcome studies have applied machine learning modeling to predict surgical results for patients undergoing procedures such as corneal refractive surgery, joint replacement and a variety of neurosurgical interventions^10,11,12. Yoo et al. found machine learning algorithms statistically superior to classic clinical methods for predicting the complication of corneal ectasia following refractive surgery¹⁰. Their random forest model had the highest prediction performance of the commonly used machine learning algorithms, with an area under the receiver operating characteristic curve (AUC) of 0.967 on an external validation set. Oermann et al. evaluated several machine learning algorithms to predict morbidity and mortality following stereotactic radiosurgery for cerebral arteriovenous malformation¹¹. Their logistic regression model (average AUC 0.71) outperformed existing clinical systems (average AUC 0.63) for predicting poor surgical outcomes at all post-operative time points out to 8 years. Merali et al. applied machine learning to predict quality of life metrics following surgery to treat degenerative cervical myelopathy¹². Their best performing model utilized a random forest algorithm incorporating neurological exam findings and systemic comorbidities to predict quality of life scores with an AUC of 0.71 at 1 year. These studies highlight the objective of using machine learning to provide outcome predictions at an individual level and advance the field of precision medicine. Like most surgeries, failure of trabeculectomy procedures arises from a complex interaction between many factors. Machine learning may be best suited to model complex non-linear and conditional relationships while generating individual patient-level predictions^13,14. The objective of this study was to evaluate machine learning models in their ability to predict real world trabeculectomy outcomes using readily available preoperative patient demographic, ocular and systemic health data.

Results

Of the 296 consecutive trabeculectomy procedures performed, 230 were performed on 184 patients and included in our analysis based on our exclusion criteria. At 1 year, a total of 104 (45.2%) eyes were classified as complete successes and 126 (54.78%) as surgical failures. A total of 35 preoperative parameters were collected for model input consisting of 3 demographic parameters, 15 parameters from systemic health data and 17 ocular parameters. Six dummy variables were used to transform preoperative IOP magnitude and number of topical glaucoma medications into grouped categorical features. Continuous features included age, body mass index (BMI), preoperative visual acuity (VA) and central corneal thickness (CCT). No feature was found to have a significant positive correlation with any other as shown in Fig. 1. The systemic, demographic and ocular (SDO) dataset consisted of 39 features and the demographic and ocular (DO) dataset 24 features. Tables 1, 2 and 3 show baseline characteristics of both surgical success and failure groups. On univariate analysis, no statistically significant differences were observed in demographic and ocular features between the success and failure groups. A history of myocardial infarction (MI) was the only systemic health feature with a statistically significant difference between groups on univariate analysis (P = 0.045).

Table 1 Univariate analysis of demographic features recorded in the electronic health record system.

Full size table

Table 2 Univariate analysis of systemic features recorded in the electronic health record system.

Full size table

Table 3 Univariate analysis of ocular features recorded in the electronic health record system.

Full size table

The performance of the four predictive models evaluated with 5-fold cross validation are shown in Table 4 for the DO dataset and Table 5 for the SDO dataset. Random forest (RF) provided the highest accuracy, with values of 0.64 and 0.65 for the DO and SDO datasets, respectively. The average receiver operating characteristic curves for each model are shown in Figure 2 for both DO and SDO datasets. Random forest also showed the highest mean area under receiver operating characteristic curve (AUC), with values of 0.64 and 0.68 for the DO and SDO datasets, respectively. The RF model had the lowest sensitivity and highest specificity compared to support vector machine (SVM), logistic regression (LR) and the artificial neural network (ANN).

Table 6 lists the relative contribution of various predictor features from the SDO dataset for the LR model. Features associated with significantly increased risk of trabeculectomy failure in the LR model were use of preoperative statin therapy (OR = 0.74, P = 0.045), preoperative topical prostaglandin analogue (PGA) therapy (OR = 0.52, P = 0.041), a history of MI (OR = 0.32, P = 0.032) and male gender (OR = 0.31, P = 0.023). White race (OR = 2.88, P = 0.046) was significantly associated with trabeculectomy success in the LR model.

The random forest model was chosen for further optimization secondary to its greater accuracy and AUC on initial evaluation of the models. Feature selection using recursive feature elimination was applied to all 39 features from the SDO dataset and all 24 features from the DO dataset. This process resulted in 19 features for the DO dataset and 20 features for SDO dataset as shown in Fig. 3. After feature selection, hyperparameter tuning was performed on the RF model using a grid search scheme varying “mtry” and “number of trees”. The final optimized random forest model had 500 trees with a “mtry” of 2.

Table 4 Comparison of predictive models trained using the the demographic and ocular (DO) dataset.

Full size table

Table 5 Comparison of predictive models trained using the systemic, demographic and ocular (SDO) dataset.

Full size table

Table 6 Relative contribution of various features in the multivariate logistic regression model predicting outcomes of trabeculectomy surgical intervention.

Full size table

The performance of the optimized RF model was evaluated using 5-fold cross validation on the DO and SDO datasets. The model predicted trabeculectomy surgical outcomes with an accuracy of 0.67 and 0.68 and with a mean AUC of 0.68 and 0.74 for the DO and SDO datasets, respectively. Additional discrimination metrics such as sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) for the two models are listed in Table 7. The calibration curves for the DO and SDO random forest models are shown in Fig. 4 with corresponding slopes and intercepts. The decision curve analysis (DCA) plots for the final DO and SDO random forest models are shown in Fig. 5. Figure 6 shows the most important predictive features of the optimized RF model from the SDO dataset.

Table 7 Comparison of the optimized random forest models trained on the demographic and ocular (DO) features and systemic, demographic and ocular (SDO) features.

Full size table

Discussion

To our knowledge, this is the first study to leverage the use of machine learning algorithms to predict trabeculectomy outcomes. In this retrospective study, we developed and compared different machine learning models utilizing preoperative patient data available in electronic health records (EHR) to predict 1 year complete success of consecutive trabeculectomy surgeries. The datasets included features from traditional ocular and demographic variables in addition to readily accessible systemic health data. The RF model marginally outperformed LR, ANN and SVM models using AUC and accuracy as evaluation metrics. Models trained with the full SDO dataset moderately outperformed models trained with the DO dataset in discrimination and net benefit. The performance of our models supports the hypothesis that machine learning models using preoperative features, including systemic health data, has the potential to aid physicians and patients in surgical decision making.

Several studies have investigated the influence of patient and ocular features on trabeculectomy surgical outcomes^{15,16,17,18,19,20,21}. These studies have found younger age, black race, previous ophthalmic procedures, worse preoperative VA, preoperative IOP magnitude, greater number of preoperative IOP lowering topical medications and a history of diabetes mellitus as risk factors for trabeculectomy failure. In our logistic regression model, male gender, the use of preoperative PGA drops, history of MI and use of statin therapy were associated with a significantly increased risk of trabeculectomy failure while white race was associated with trabeculectomy success at 1 year. Although more difficult to interpret compared to LR models, our RF model showed age, preoperative VA, use of angiotensin II receptor blockers (ARBs), history of MI, CCT and BMI as the most important features in predicting trabeculectomy surgical outcomes using MDA. Using MDI, CCT, age, BMI, and preoperative VA were the most important features. The RF model provides new insights into potential factors that may influence trabeculectomy outcomes.

Our optimized RF model had an accuracy of 0.68 and AUC of 0.74 using the SDO dataset. We considered this a reasonable outcome given the complexity of the physiology and smaller sample size. Using machine learning algorithms to predict complications in deep brain stimulation surgery, a gradient boosting algorithm predicted surgical complications with an accuracy of 0.66 and AUC of 0.58 when the model was applied to their original dataset²². Lei et al applied machine learning to predict acute kidney injury after aortic arch surgery with an AUC of 0.71 using a RF model²³. Rahman et al predicted recurrence after esophageal cancer with an AUC of 0.805 using a RF model²⁴. Our classes were relatively balanced (complete success class was 45.2% of total sample) compared to other surgical outcome studies using machine learning to predict more rare surgical outcome events^22,24,25,26. It is important to note that we chose a clinically stringent definition of surgical failure to maintain more balanced classes. Our qualified success rate, which includes complete successes and eyes which met the definition of complete success but were also on supplemental medical therapy to lower IOP, was 71.3%. Another strength of the current study was the complete feature data for each patient, obviating the common practice of generating synthetic values for missing data^27,28.

The RF model moderately outperformed SVM, LR and ANNs models. Other studies have shown similar performance results for classification tasks using machine learning models on healthcare datasets^12,29. This is likely due to the intrinsic nature of these models to learn non-linear complex relationships that may be missed by LR models. The poorer performance of ANNs is suspected to be related to the small sample size available to train the ANN, since ANNs usually require larger training datasets than RF or SVM. The improved accuracy and AUC of RF compared to SVM is likely due to the ability of RF models to avoid over-fitting on datasets with low sample to feature ratios¹². Our dataset has approximately 5:1 sample to feature ratio, which can limit the convergence of SVM to a local minimum. A low sample to feature ratio is a common barrier of machine learning in the medical field and thus RF can be a good choice for classification tasks such as the one considered in this paper.

We evaluated the ’weak calibration’ of the final DO and SDO random forest models³⁰. The slopes of the calibration curves were 1.20 and 0.89 for the SDO and DO random forest models, respectively. The calibration slope evaluates the spread of the estimated risks, indicating that the DO model risk estimates are slightly extreme (i.e., too high for eyes that are high risk and too low for eyes at low risk) and SDO estimates more moderate. The intercepts of the calibration curves were − 0.12 and − 0.13 for the SDO and DO models, respectively. The calibration intercepts of the models, which assesses calibration-in-the-large, suggest that the both the SDO and DO models overestimate risk somewhat. The DCA provides complimentary model information which may help in the decision making process of whether to proceed with trabeculectomy surgery or try alternative treatment strategies. The DCA of the SDO trained model showed a positive net benefit compared to treat-all or treat-no eyes schemes across a threshold probability range from 0.06 to 0.89.

There were several limitations to the current study. First, the small sample size with a white to black ratio of approximately 10 to 1 collected from a single tertiary teaching hospital in a retrospective manner limits the generalization of the model to other populations. Prospective evaluation of the model on other datasets are necessary to determine if the model translates into benefits for future patients. Second, 21.9% of patients were lost to follow up at 1 year which likely introduces bias into this retrospective cohort. As many of these patients were referred back to their local eye care provider, it is quite possible that the drop out population had a higher complete success rate if not referred back to our institution for further management. This may partially explain the relatively lower percentage of complete success (45.21%) in our cohort compared to previous trabeculectomy outcome studies, in addition to our liberal inclusion of eyes with all glaucoma subtypes, previous ocular surgery, and the high proportion with a preoperative IOP less than 18 mmHg (31.3%)⁶. Third, our model was designed for single layer prediction of success or failure with no ability to predict the cause of failure (i.e., hypotony vs lack of sufficient IOP lowering). Future work will evaluate the ability of stacking machine learning approaches to further predict the cause of trabeculectomy failure³¹. Finally, our study included 230 trabeculectomy surgeries performed on 184 patients, with 46 patients receiving bilateral trabeculectomies, which may have lead to some data leakage. We also analyzed the performance of the optimized RF model by considering only one trabeculectomy surgery from each patient. The accuracy and AUC for the SDO features in this subset was 0.66 and 0.70, respectively, with a small reduction in accuracy and AUC as compared to the full dataset.

Despite these limitations, we believe this study is an important initial step in evaluating machine learning models to predict glaucoma surgical outcomes. We have shown that machine learning models offer value in predicting trabeculectomy success and the integration of systemic health data in additional to standard ophthalmic and demographic data can improve model performance. As surgical options in glaucoma expand, predictive models have the potential to improve patient care and aid in the surgical decision making process. Future work will focus on utilizing these algorithms with a larger dataset, such as those that can be provided by the Sight Outcomes Research Collaborative (SOURCE) Ophthalmology Data Repository³².

Methods

Patient population

Patient data was obtained retrospectively from consecutive adult patients undergoing trabeculectomy or trabeculectomy and cataract extraction with intraocular lens implantation from January 2014 to January 2018 at the West Virginia University (WVU) Eye Institute. Approval from the WVU Institutional Review Board through the office of human research protections to collect patient data with a waiver of informed consent was obtained prior to data collection. All research adhered to the tenets of the Declaration of Helsinki and was compliant with the Health Insurance Portability and Accountability Act. Data was collected for each patient via chart review from the hospital electronic health record EPIC (Epic Systems, Verona, WI) by fellowship trained glaucoma surgeons (JP, SG, and BM). The features and outcomes data were then reviewed for completeness and accuracy (JP), and organized within a secure tabular data sheet. No outliers were removed from the dataset and no missing-value management was required. Data collected included preoperative systemic health data, preoperative and postoperative ocular data and demographic data. All patients were 18 years or older with primary open-angle, pseudoexfoliation, pigment dispersion, juvenile and chronic primary angle closure glaucoma. Exclusion criteria included patients under 18 years of age, those that received an Express shunt (Alcon, Forth Worth, Texas, USA) and patient with less than 1 year of follow up at our institution.

Trabeculectomy surgical technique

All patient surgeries were completed by or with the guidance of a fellowship trained glaucoma surgeon at our institution’s outpatient surgical center. All trabeculectomies were performed in a similar manner. A subconjunctival injection of 1% lidocaine mixed with 40 mcg of mitomycin C (0.2 mg/mL) was injected superior prior to conjunctival incision. A fornix-based conjunctival flap, partial thickness rectangular scleral flap and sclerostomy were created. A peripheral iridectomy was performed in all phakic eyes and in pseudophakic eyes with any iris prolapse. The scleral flap was closed tight with interrupted 10-0 nylon sutures to allow flow only with supraphysiologic IOP. The conjunctiva was reapposed with 8-0 polyglactin interrupted wing sutures. All patients received antibiotic drop prophylaxis for 1 week and steroid drops for a minimum of 4 weeks postoperatively. Selective laser suture lysis was performed in the postoperative period for IOP titration.

Baseline features, outcome classification and class balancing

Criteria for selecting input features for the models included existing evidence in the literature suggesting a relationship between the feature and the surgical outcome, clinical domain knowledge and availability of the feature in the dataset. Baseline features included patient demographic and systemic health data (e.g., co-morbidities, chronic medications, smoking status, etc.). Preoperative ocular data included prior ocular history, current glaucoma drops and relevant exam data at the appointment in which the decision was made to proceed with trabeculectomy surgery. A full list of preoperative features are shown in Tables 1, 2 and 3. The primary outcome classification was surgical failure of trabeculectomy at the 1 year postoperative visit. Surgical failure was defined as IOP \(> 21\) mmHg or \(< 5\) mmHg at two consecutive visits after 3 months, less than a 20% IOP reduction at two consecutive visits after 3 months, a need for reoperation for glaucoma or loss of light perception vision. Eyes which had not failed by the above criteria and were not receiving supplemental medical therapy to lower IOP were considered complete success and the remaining eyes considered failures. Postoperative manipulations of the trabeculectomy site (e.g., needling revisions) were allowed in the success class if the patient did not require a return to the operating room. Postoperative manipulations were not included as features given the goal of the model to aid in preoperative surgical decision making. For predictive modeling, balanced outcome classes (labels) are important to avoid biased learning by the models. To handle class imbalance, up-sampling was carried out to equalize the frequency of the underrepresented class³⁹.

Validation and performance analysis of predictive models

Cross validation is primarily used to estimate the performance of the predictive models and to avoid over fitting. We used a k-fold cross-validation scheme with \(k=5\)^40,41. The validation data set is divided into k folds or groups of approximately equal size data sets. Folds of \(k-1\) are used to train the predictive model and the trained model is tested on the remaining 1 fold. This procedure is repeated k times allowing for each fold to be tested against the remaining \(k-1\) folds. The overall predictive performance is determined by aggregating the performance of all validation fold groups. We evaluated four metrics to analyze predictive performance: accuracy, sensitivity, specificity and AUC. The model with the best performance was selected for further recursive feature selection and hyper-parameter tuning. Improvement in surgical outcome prediction accuracy was used as a measure to select the optimal features from each dataset scenario. The set of features that produced the highest accuracy was selected for model training/testing and the remaining features were eliminated. Following feature selection, the best performing model in terms of accuracy was chosen from the entire grid of generated models. Calibration of the final SDO and DO random forest models were analyzed using a calibration plot³⁰. The calibration slope, with an optimal value of 1, was calculated to assess whether predictions were precise or too extreme. Calibration-in-the-large, or y-intercept of the calibration plot, indicating the degree to which predictions are systematically too low or high, was calculated, having an optical value of 0. A decision curve analysis (DCA) was used to evaluate the clinical usefulness of the final SDO and DO random forest models by calculating their net benefit across a range of clinical threshold probabilities^42,43. An advantage of DCA is that it incorporates preferences (patient and physician), represented as threshold probability of choosing or opting out of a treatment, across a range of probabilities. Ranked feature importance using the mean decrease in accuracy (MDA) and mean decrease in importance (MDI or Gini Importance) methods was performed on this final model. All source code is available for public use on Github at https://github.com/HasanulbannaR/ML_Trab.git.

References

Tham, Y.-C. et al. Global prevalence of glaucoma and projections of glaucoma burden through 2040: A systematic review and meta-analysis. Ophthalmology 121, 2081–2090 (2014).
Article Google Scholar
Kass, M. A. et al. The ocular hypertension treatment study: A randomized trial determines that topical ocular hypotensive medication delays or prevents the onset of primary open-angle glaucoma. Arch. Ophthalmol. 120, 701–713 (2002).
Article Google Scholar
Heijl, A. et al. Reduction of intraocular pressure and glaucoma progression: Results from the early manifest glaucoma trial. Arch. Ophthalmol. 120, 1268–1279 (2002).
Article Google Scholar
Group, C. N.-T. G. S. et al. Comparison of glaucomatous progression between untreated patients with normal-tension glaucoma and patients with therapeutically reduced intraocular pressures. Am. J. Ophthalmol. 126, 487–497 (1998).
Rathi, S., Andrews, C. A., Greenfield, D. S. & Stein, J. D. Trends in glaucoma surgeries performed by glaucoma subspecialists versus nonspecialists on medicare beneficiaries from 2008–2016. Ophthalmology (2020).
Gedde, S. J. et al. Treatment outcomes in the tube versus trabeculectomy (tvt) study after five years of follow-up. Am. J. Ophthalmol. 153, 789–803 (2012).
Article Google Scholar
Ramulu, P. Y., Corcoran, K. J., Corcoran, S. L. & Robin, A. L. Utilization of various glaucoma surgeries and procedures in medicare beneficiaries from 1995 to 2004. Ophthalmology 114, 2265–2270 (2007).
Article Google Scholar
Chen, P. P., Yamamoto, T., Sawada, A., Parrish, R. 2nd. & Kitazawa, Y. Use of antifibrosis agents and glaucoma drainage devices in the American and Japanese glaucoma societies. J. Glaucoma 6, 192–196 (1997).
Article CAS Google Scholar
Joshi, A. B. et al. 2002 survey of the American glaucoma society: Practice preferences for glaucoma surgery and antifibrotic use. J. Glaucoma 14, 172–174 (2005).
Article Google Scholar
Yoo, T. K. et al. Adopting machine learning to automatically identify candidate patients for corneal refractive surgery. NPJ Digit. Med. 2, 1–9 (2019).
Article Google Scholar
Oermann, E. K. et al. Using a machine learning approach to predict outcomes after radiosurgery for cerebral arteriovenous malformations. Sci. Rep. 6, 1–12 (2016).
Article Google Scholar
Merali, Z. G., Witiw, C. D., Badhiwala, J. H., Wilson, J. R. & Fehlings, M. G. Using a machine learning approach to predict outcome after surgery for degenerative cervical myelopathy. PLoS ONE 14, e0215133 (2019).
Article CAS Google Scholar
Joshi, R. S., Haddad, A. F., Lau, D. & Ames, C. P. Artificial intelligence for adult spinal deformity. Neurospine 16, 686 (2019).
Article Google Scholar
Shi, H.-Y., Hwang, S.-L., Lee, K.-T. & Lin, C.-L. In-hospital mortality after traumatic brain injury surgery: A nationwide population-based comparison of mortality predictors used in artificial neural network and logistic regression models. J. Neurosurg. 118, 746–752 (2013).
Article Google Scholar
Issa de Fendi, L., Cena de Oliveira, T., Bigheti Pereira, C., Pereira Bigheti, C. & Viani, G. A. Additive effect of risk factors for trabeculectomy failure in glaucoma patients: A risk-group from a cohort study. J. Glaucoma 25, e879–e883 (2016).
Chiu, H.-I., Su, H.-I., Ko, Y.-C. & Liu, C. J.-L. Outcomes and risk factors for failure after trabeculectomy in taiwanese patients: medical chart reviews from 2006 to 2017. Br. J. Ophthalmol. (2020).
Landers, J., Martin, K., Sarkies, N., Bourne, R. & Watson, P. A twenty-year follow-up study of trabeculectomy: Risk factors and outcomes. Ophthalmology 119, 694–702 (2012).
Article Google Scholar
Edmunds, B., Bunce, C. V., Thompson, J. R., Salmon, J. F. & Wormald, R. P. Factors associated with success in first-time trabeculectomy for patients at low risk of failure with chronic open-angle glaucoma. Ophthalmology 111, 97–103 (2004).
Article Google Scholar
Fontana, H., Nouri-Mahdavi, K., Lumba, J., Ralli, M. & Caprioli, J. Trabeculectomy with mitomycin c: Outcomes and risk factors for failure in phakic open-angle glaucoma. Ophthalmology 113, 930–936 (2006).
Article Google Scholar
investigators, A. et al. The advanced glaucoma intervention study (agis): 12. baseline risk factors for sustained loss of visual field and visual acuity in patients with advanced glaucoma. Am. J. Ophthalmol. 134, 499–512 (2002).
Group, C.-T. S. et al. A phase iii study of subconjunctival human anti-transforming growth factor \(\beta\)2 monoclonal antibody (cat-152) to prevent scarring after first-time trabeculectomy. Ophthalmology 114, 1822–1830 (2007).
Farrokhi, F. et al. Investigating risk factors and predicting complications in deep brain stimulation surgery with machine learning algorithms. World Neurosurg. 134, e325–e338 (2020).
Article Google Scholar
Lei, G., Wang, G., Zhang, C., Chen, Y. & Yang, X. Using machine learning to predict acute kidney injury after aortic arch surgery. J. Cardiothorac. Vasc. Anesth. 34, 3321–3328 (2020).
Article Google Scholar
Rahman, S. A. et al. Machine learning to predict early recurrence after oesophageal cancer surgery. J. Br. Surg. 107, 1042–1052 (2020).
Article CAS Google Scholar
Lu, S. et al. Machine-learning-assisted prediction of surgical outcomes in patients undergoing gastrectomy. Chin. J. Cancer Res. 31, 797 (2019).
Article Google Scholar
Jalali, A. et al. Deep learning for improved risk prediction in surgical outcomes. Sci. Rep. 10, 1–13 (2020).
Article Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. Smote: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).
Article Google Scholar
Zhang, H. Imbalanced Binary Classification On Hospital Readmission Data With Missing Values. Ph.D. thesis, UCLA (2018).
Couronné, R., Probst, P. & Boulesteix, A.-L. Random forest versus logistic regression: A large-scale benchmark experiment. BMC Bioinform. 19, 1–14 (2018).
Article Google Scholar
Van Calster, B., McLernon, D. J., Van Smeden, M., Wynants, L. & Steyerberg, E. W. Calibration: The achilles heel of predictive analytics. BMC Med. 17, 1–7 (2019).
Google Scholar
Hasan, M. M. et al. Hlppred-fuse: Improved and robust prediction of hemolytic peptide and its activity by fusing multiple feature representation. Bioinformatics 36, 3350–3356 (2020).
Article CAS Google Scholar
Bommakanti, N. K. et al. Application of the sight outcomes research collaborative ophthalmology data repository for triaging patients with glaucoma and clinic appointments during pandemics such as covid-19. JAMA Ophthalmol. 138, 974–980 (2020).
Article Google Scholar
Longadge, R. & Dongre, S. Class imbalance problem in data mining review. ar**v preprint ar**v:1305.1707 (2013).
Azim, R. et al. A decision tree based approach for microgrid islanding detection. In 2015 IEEE Power Energy Society Innovative Smart Grid Technologies Conference (ISGT), 1–5 (2015).
Han, J., Pei, J. & Kamber, M. Data mining: Concepts and techniques (Elsevier, 2011).
Liaw, A. & Wiener, M. Classification and regression by randomforest. R News2, 18–22 (2002). https://CRAN.R-project.org/doc/Rnews/.
Jordan, A. et al. On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes.. Adv. Neural. Inf. Process. Syst. 14, 841 (2002).
Google Scholar
Kuhn, M. Building predictive models in R using the caret package. J. Statist. Softw. 28, 1–26 (2008).
Meyer, D. Misc functions of the department of statistics, probability theory group. J. Statist. Softw. 28, 1–26 (2008). https://www.jstatsoft.org/v028/i05.
Hasan, M. M. et al. Neuropred-frl: An interpretable prediction model for identifying neuropeptide using feature representation learning. Brief. Bioinform. (2021).
Hasan, M. M. et al. Meta-i6ma: an interspecies predictor for identifying dna n 6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework. Brief. Bioinform. 22, bbaa202 (2021).
Vickers, A. J. & Elkin, E. B. Decision curve analysis: A novel method for evaluating prediction models. Med. Decis. Mak. 26, 565–574 (2006).
Article Google Scholar
Vickers, A. J., van Calster, B. & Steyerberg, E. W. A simple, step-by-step guide to interpreting decision curve analysis. Diagn. Progn. Res. 3, 1–8 (2019).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Ophthalmology and Visual Sciences, West Virginia University School of Medicine, Morgantown, WV, 26506, USA
Hasan Ul Banna, Ahmed Zanabli, Brian McMillan, Maria Lehmann, Sumeet Gupta, Michael Gerbo & Joel Palko

Authors

Hasan Ul Banna
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Zanabli
View author publications
You can also search for this author in PubMed Google Scholar
Brian McMillan
View author publications
You can also search for this author in PubMed Google Scholar
Maria Lehmann
View author publications
You can also search for this author in PubMed Google Scholar
Sumeet Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Michael Gerbo
View author publications
You can also search for this author in PubMed Google Scholar
Joel Palko
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.P. conceived the experiment, J.P., A.Z., B.M., M.L., S.G., M.G. collected and organized data, H.U., A.Z., J.P. analysed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Joel Palko.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Banna, H.U., Zanabli, A., McMillan, B. et al. Evaluation of machine learning algorithms for trabeculectomy outcome prediction in patients with glaucoma. Sci Rep 12, 2473 (2022). https://doi.org/10.1038/s41598-022-06438-7

Download citation

Received: 07 April 2021
Accepted: 18 January 2022
Published: 15 February 2022
DOI: https://doi.org/10.1038/s41598-022-06438-7
Springer Nature Limited

This article is cited by

Evaluation of machine learning approach for surgical results of Ahmed valve implantation in patients with glaucoma
- Seung Yeop Lee
- Dong Yun Lee
- Jaehong Ahn
BMC Ophthalmology (2024)
Artificial intelligence in glaucoma: opportunities, challenges, and future directions
- **aoqin Huang
- Md Rafiqul Islam
- Siamak Yousefi
BioMedical Engineering OnLine (2023)
A machine learning approach to predict the glaucoma filtration surgery outcome
- Luca Agnifili
- Michele Figus
- Leonardo Mastropasqua
Scientific Reports (2023)
Machine learning regression algorithms to predict short-term efficacy after anti-VEGF treatment in diabetic macular edema based on real-world data
- Ruijie Shi
- **angjie Leng
- Xue**g Lu
Scientific Reports (2023)
Uncovering the role of transient receptor potential channels in pterygium: a machine learning approach
- Yuchen Cai
- Tianyi Zhou
- Yao Fu
Inflammation Research (2023)

Evaluation of machine learning algorithms for trabeculectomy outcome prediction in patients with glaucoma

Abstract

Similar content being viewed by others

A machine learning approach to explore predictors of graft detachment following posterior lamellar keratoplasty: a nationwide registry study

A machine learning approach to predict the glaucoma filtration surgery outcome

Development, comparison, and internal validation of prediction models to determine the visual prognosis of patients with open globe injuries using machine learning approaches

Introduction

Results

Discussion

Methods

Patient population

Trabeculectomy surgical technique

Baseline features, outcome classification and class balancing

Validation and performance analysis of predictive models

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

This article is cited by

Evaluation of machine learning approach for surgical results of Ahmed valve implantation in patients with glaucoma

Artificial intelligence in glaucoma: opportunities, challenges, and future directions

A machine learning approach to predict the glaucoma filtration surgery outcome

Machine learning regression algorithms to predict short-term efficacy after anti-VEGF treatment in diabetic macular edema based on real-world data

Uncovering the role of transient receptor potential channels in pterygium: a machine learning approach

Navigation

Evaluation of machine learning algorithms for trabeculectomy outcome prediction in patients with glaucoma

Abstract

Similar content being viewed by others

Introduction

Results

Discussion

Methods

Patient population

Trabeculectomy surgical technique

Baseline features, outcome classification and class balancing

Validation and performance analysis of predictive models

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation