Combining Machine Learning with Metabolomic and Embryologic Data Improves Embryo Implantation Prediction

Cheredath, Aswathi; Uppangala, Shubhashree; C. S, Asha; Jijo, Ameya; R, Vani Lakshmi; Kumar, Pratap; Joseph, David; G.A, Nagana Gowda; Kalthur, Guruprasad; Adiga, Satish Kumar

doi:10.1007/s43032-022-01071-1

Combining Machine Learning with Metabolomic and Embryologic Data Improves Embryo Implantation Prediction

Embryology: Original Article
Open access
Published: 12 September 2022

Volume 30, pages 984–994, (2023)
Cite this article

Download PDF

You have full access to this open access article

Reproductive Sciences Aims and scope Submit manuscript

Combining Machine Learning with Metabolomic and Embryologic Data Improves Embryo Implantation Prediction

Download PDF

Aswathi Cheredath¹,
Shubhashree Uppangala²,
Asha C. S³,
Ameya Jijo¹,
Vani Lakshmi R⁴,
Pratap Kumar⁵,
David Joseph⁶,
Nagana Gowda G.A⁷,
Guruprasad Kalthur⁸ &
…
Satish Kumar Adiga ORCID: orcid.org/0000-0002-2897-4697¹

2777 Accesses
1 Altmetric
Explore all metrics

Abstract

This study investigated whether combining metabolomic and embryologic data with machine learning (ML) models improve the prediction of embryo implantation potential. In this prospective cohort study, infertile couples (n=56) undergoing day-5 single blastocyst transfer between February 2019 and August 2021 were included. After day-5 single blastocyst transfer, spent culture medium (SCM) was subjected to metabolite analysis using nuclear magnetic resonance (NMR) spectroscopy. Derived metabolite levels and embryologic parameters between successfully implanted and failed groups were incorporated into ML models to explore their predictive potential regarding embryo implantation. The SCM of blastocysts that resulted in successful embryo implantation had significantly lower pyruvate (p<0.05) and threonine (p<0.05) levels compared to medium control but not compared to SCM related to embryos that failed to implant. Notably, the prediction accuracy increased when classical ML algorithms were combined with metabolomic and embryologic data. Specifically, the custom artificial neural network (ANN) model with regularized parameters for metabolomic data provided 100% accuracy, indicating the efficiency in predicting implantation potential. Hence, combining ML models (specifically, custom ANN) with metabolomic and embryologic data improves the prediction of embryo implantation potential. The approach could potentially be used to derive clinical benefits for patients in real-time.

Development of a Novel Non-invasive Metabolomics Assay to Predict Implantation Potential of Human Embryos

Article 04 June 2024

Prediction model for day 3 embryo implantation potential based on metabolites in spent embryo culture medium

Article Open access 08 June 2023

Non-invasive metabolomic profiling of embryo culture media and morphology grading to predict implantation outcome in frozen-thawed embryo transfer cycles

Article 10 October 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Embryo morphology is independent of many factors that play crucial roles in embryo viability [1,2,3,4,5]. Despite its known limitations, assessing embryo morphology remains the standard approach for embryo quality assessment [6, 7]. To overcome these limitations, new techniques such as time-lapse imaging, metabolomics, and preimplantation genetic testing for aneuploidy (PGT-A) are being evaluated as alternative approaches for predicting embryo implantation potential [6].

Biomarkers derived from a metabolomics approach have shown contradictory results regarding predicting embryo viability and pregnancy outcomes [8,9,10,11,12,13]. Further, there is no conclusive evidence that embryo metabolomic data alone can significantly improve the prediction of assisted reproductive technology (ART) outcomes [8, 14]. Hence, there is a continued search for tools that can accurately assess embryo implantation potential alone or in conjunction with other non-invasive methods.

Artificial intelligence (AI)–based models outdo human learning and decision-making even with limited sample sizes [15, 16]. In ART, AI-based analysis combined with patient characteristics, embryo morphokinetics, or embryo microscopic image analysis has been used to predict implantation and pregnancy outcomes [17,18,19,20,21]. The combination of “omics” technology and machine learning (ML) has been suggested to be able to improve ART outcome prediction [22]. A recent study demonstrated that combining a deep learning model with day-3 metabolite profiles predicted blastocyst development [23]. However, we believe that an accurate prediction of implantation potential has a higher clinical value than that of blastulation. Therefore, our approach in this study was to explore the possibility of incorporating metabolomic profiles of human blastocyst spent culture medium (SCM) and embryologic data into ML models to enhance the accuracy of embryo implantation prediction in patients undergoing single blastocyst transfer cycles.

Materials and Methods

Patient Selection

This prospective study included 56 couples undergoing ART at a university infertility clinic between February 2019 to August 2021. The study was initiated after obtaining approval from the Institutional Ethics Committee (Ref. 429/2019). Written informed consent was obtained from all study participants. Patients fulfilling the following criteria were included in this study: (i) women less than 35 years of age having regular menstrual cycles; (ii) no medical history of surgery or any abnormalities diagnosed related to reproductive organs; (iii) absence of conditions such as endometriosis, adenomyosis, tubal abnormalities, uterine myoma, and other metabolic/endocrinological diseases, such as hypo/hyperthyroidism or hyperprolactinemia; (iv) the male partners with semen characteristics above the WHO 2010 reference range. In addition, only couples undergoing intracytoplasmic sperm injection followed by day-5 single embryo transfer were included in this study. Patient information, including demographic characteristics and data from routine clinical investigations, is presented in Table 1.

Table 1 Patient’s demographics and clinical characteristics

Full size table

Controlled Ovarian Stimulation (COS) and Oocyte Aspiration

An antagonist protocol was used for COS. Briefly, recombinant follicle-stimulating hormone (rFSH; Gonal F®; Merck Biopharma), with a dose ranging from 225 to 450 IU/day based on age, was administered from the second day of the menstrual cycle, and anti-Müllerian hormone (AMH) level and antral follicular count (AFC) were assessed. Subsequently, rFSH dose adjustment (either increase or decrease) was conducted based on the ovarian response until the day before human chorionic gonadotropin (hCG) administration. Pituitary downregulation was achieved by administering a gonadotropin-releasing hormone (GnRH) antagonist (Citrotide® 0.25 mg; Merck Biopharma) from day 5 of COS. Recombinant hCG (Ovitrelle® 250 mg; Merck Biopharma) was used to trigger the final oocyte maturation when at least four follicles reached a mean diameter of 18 mm. Oocyte cumulus complexes were collected via the ultrasound-guided transvaginal route, rinsed, and placed in ONESTEP medium (#V-OSM-20; Vitromed GmbH, Germany) at 37°C in 6% CO₂ for 2–3 h until enzymatic denudation.

Fertilization and Embryo Evaluation

Intracytoplasmic sperm injection was used to fertilize mature (metaphase II) oocytes. Injected oocytes were then washed and cultured individually in a 30-μL droplet of ONESTEP medium overlaid with oil (#V-OIL-P100; Vitromed GmbH, Germany) at 37°C, 6% CO₂, and 5% O₂ in a MIRI® Multiroom incubator (ESCO Medical, Singapore). Fertilization was assessed at 16–18 h after the intracytoplasmic sperm injection. Embryos were evaluated on day 3 and day 5 of development as per the European Society of Human Reproduction and Embryology (ESHRE) consensus [24]. On day 5, only one top-quality blastocyst (3, 1, 1 or 4, 1, 1) was selected for transfer. If the fresh transfer was not performed, embryos were cryopreserved by vitrification for subsequent transfer.

SCM samples from transferred/frozen blastocysts (n=56) along with medium control samples (droplets of ONESTEP medium without an embryo) (n=44) were carefully collected without oil contamination, and 25 μL of each was placed into a labeled sterile cryovial, snap-frozen in liquid nitrogen, and stored at −80°C until nuclear magnetic resonance (NMR) spectroscopic analysis.

NMR Sample Preparation and Analysis

A dilution solution was prepared using D₂O (deuterium oxide), with TSP (sodium salt of 2,2,3,3 tetradeutero-3-(trimethylsilyl propionate) as the standard reference compound; 0.05 g TSP/mL D₂O was diluted by a factor of 10 using D₂O. After thawing the SCM and medium control samples at room temperature, 25 μL was diluted with a 10 μL dilution solution. The mixture was then transferred to 1.7-mm NMR tubes. Thus, all the metabolites present in the samples were diluted up to 1.4 times with the dilution solution.

NMR experiments were carried out using a Bruker 800-MHz AVANCE III NMR spectrometer (Bruker Biospin Ag, Fällanden, Switzerland) equipped with a 1.7-mm cryo-probe at 298 K. One-dimensional (1D) ¹H NMR spectra were obtained using the Carr-Purcell-Meiboom-Gill (CPMG) pulse sequence. A CPMG 180° pulse train for a duration of 12 ms was used to suppress residual protein signals from the media. Each spectrum was obtained using 9615-Hz spectral width, 5-s relaxation delay, 16-k time domain points, 4 dummy scans, and 256 transients. The time domain data (free induction decay) were apodized with a shifted sine bell window function (SSB = 2) and zero-filled to 65536 points prior to Fourier transformation. TopSpin v3.6.2 (Bruker) was used for NMR data acquisition and processing.

A total of 100 1D ¹H spectra were acquired, comprising spectra related to the SCM of the embryos (n=56) and medium control samples (n=44). Based on the human metabolome database [25, 26], 13 metabolite peaks were identified: nine amino acid metabolites (leucine, Leu; isoleucine, Ile; valine, Val; methionine, Met; threonine, Thr; lysine, Lys; tyrosine, Tyr; histidine, His; phenylalanine, Phe) and four carbohydrate and metabolic intermediates (pyruvate, Pyr; lactate, Lac; citrate, Cit; glucose, Glc). Relative concentrations of the identified metabolites were then determined by normalizing the metabolite peak integrals to the peak integral of the internal standard, TSP. Further region-wise integration was performed with “intser” in TopSpin v3.6.2; each spectrum was divided into 30 integral regions.

ML Model Training and Testing Procedures

A flowchart of the ML model training and testing procedures is shown in Fig. 1. In order to compare the performance of classical ML programs, several well-known ML algorithms were considered. Nearest neighbors, linear support vector machine (SVM), radial basis function (RBF) SVM, gaussian process, decision tree, random forest, neural net, AdaBoost, and naïve Bayes were used and then compared to custom artificial neural network (ANN)–based binary classification models. As the above classical ML models have an overfitting issue, a custom ANN model was incorporated to provide a better prediction with weight regularization. The samples were randomly divided into two groups: the training set constituted 80% samples (which was used to train the models to predict embryo implantation potential) and the testing set constituted 20% samples (which was used to check and validate the performance of the models).

Input and Output Data

Prediction models were constructed using three sets of data: (i) SCM metabolites identified by NMR spectroscopy; (ii) oocyte and embryologic characteristics such as number of matured oocytes retrieved, maturation rate, fertilization rate, number of nucleolar precursor bodies (NPBs) observed in the zygote, number of embryos progressed to day 3, blastocyst rate and quality (on day 5), and the grade of the embryo preferred for the transfer (on day 3 and day 5); and (iii) various combinations of metabolites and oocyte/embryologic characteristics (selecting metabolites based on their roles in different metabolic pathways). Further, each combination involved oocyte and embryologic characteristics along with the following combination of metabolites: combination 1, Glc, Pyr, and Lac; combination 2, Glc, Pyr, and Cit; combination 3, Phe and Tyr; combination 4, Pyr, Cit, Lys, and Thr; combination 5, Glc, Pyr, Thr, Met, and Ile; and combination 6, Glc, Pyr, Cit, Ile, Leu, and Val. The exact parameters involved in each dataset are given in supplementary Table 1. The output data comprised the implantation potential of the individually transferred blastocysts. The input data were preprocessed and transformed to the same scale. The features involved both numeric and nonnumeric data. Nonnumeric data were converted to numeric data and then normalized to obtain values in a similar range.

Data Classification Using Custom ANN

The custom ANN was built using a sequential model. The variables were first initialized, after which layers were added using the dense functionality, forming the layout of the model. Subsequently, procedures involving a loss function, an Adam optimizer, and metrics (to assess model performance) were conducted. Data on 56 SCM samples were used, with 44 (80%) being used to train the model and 12 (20%) being used to test it. The model was trained using the training data for 50 epochs. Epochs refer to the number of times that the custom. ANN goes through the training data. The model parameters are noted in Supplementary Table 2. The first layer consisted of 30 or 50 neurons, with a rectified linear unit (ReLU) as the activation function followed by a single neuron with a sigmoid activation function. Adam optimizer was used with a learning rate of 0.001 with binary cross-entropy as the loss function. The classical ML model performance was assessed using several metrics, including confusion matrix, receiver operating characteristic (ROC) curve, area under the ROC curve (AUC), and accuracy, whereas accuracy and loss curves are employed in custom ANN for measuring the performance. Typically, the cross-entropy loss is used as loss function for binary classification problems involving ANN models in which the predicted output probability is compared to the actual output. The computed score penalizes the probability-based on the distance from the actual value. The logarithmic penalty yields a small value for a small difference and a large value for a large difference. The objective function involves minimizing the cross-entropy loss, and smaller values represent a better model. A perfect model has a cross-entropy loss of zero. Cross-entropy for a binary or two-class prediction problem is calculated as the mean cross-entropy across all examples. The custom ANN model was run with a batch size of 8 and a total number of 50 epochs. A similar procedure was conducted for each dataset (i.e., the metabolites, embryologic, and combination datasets).

Software

Data analysis was implemented using https://colab.research.google.com, with TensorFlow, Keras, Sklearn, and NumPy library available in Python v3.7. The plots were created using the Matplotlib library.

Statistical Analysis

The participants’ demographic and clinical data are presented as mean ± standard error of the Mean (SEM). Statistical differences in metabolite levels between SCM and medium control samples were assessed by independent-sample t tests. Statistical differences in metabolite levels among SCM samples from blastocysts that resulted in successful embryo implantation, SCM samples from blastocysts that resulted in embryos that failed to implant, and medium control samples were assessed by repeated-measure analysis of variance (ANOVA) followed by post hoc Tukey’s tests in Jamovi v1.8.1 [27]. Principal component (PC) analysis was carried out in CRAN R v4.0 [28], to explore metabolic differences based on 30 integral regions of 1D ¹H spectra from samples in the three groups. A two-dimensional bi-plot visualized the first two PCs (PC₁ and PC₂), which accounted for 99.61% of the variability in the data. The level of significance was set at <0.05 throughout the study.

Results

Patient Characteristics and Embryo Implantation Outcomes

This prospective study included 56 infertile couples who underwent a single day-5 blastocyst transfer during their ART cycle. Patient demographic and clinical characteristics are summarized in Table 1. Notably, only one top-quality day-5 blastocyst was used for transfer. The endometrial thickness in patients was 10.06 ±0.32 mm. In cases involving frozen embryo transfer cycles, patients were followed up until frozen embryo transfer. Implantation was considered successful when the beta hCG level was >100 mIU/mL on day 14 post embryo transfer. Out of the 56 patients, 23 had successful embryo implantation, and 33 had embryos that failed to implant. The implantation rate was 41%.

Variation in Relative Levels of Metabolites in SCM

To understand metabolite utilization by the blastocysts, metabolite levels were compared (i) between SCM and medium control samples, (ii) between SCM samples from successfully implanted embryos and medium control samples and between SCM samples from embryos that failed to implant and medium control samples, and (iii) between SCM samples from successfully implanted embryos and SCM samples from embryos that failed to implant. Supplementary Fig. 1 depicts a representative 1D ¹H NMR spectrum of ONESTEP medium with peak assignment. Significant reductions in the pyruvate (p<0.001) and threonine (p<0.002) levels were observed in SCM samples relative to medium control samples (Table 2), indicating that the embryos utilized the metabolites from the culture media. Although similar trends were observed in other metabolites, the differences were not significant. Further, there was a significant difference in the pyruvate level (relative to medium control) for SCM from both successfully implanted embryos (p<0.05) and embryos that failed to implant (p<0.001), and in the threonine level for SCM from successfully implanted embryos (p<0.05). Of note, statistical significance was not demonstrated in relative metabolite levels between the successful and failed implantation groups (Fig. 2A and Table 2).

Table 2 Comparison of the relative concentration of metabolites (normalized to TSP) across the study groups with the medium control

Full size table

To explore the differences in the unidentified metabolites in the NMR profiles, each spectrum was divided into 30 integral regions. PC analysis of the 30 integral regions (based on 100 samples from 56 patients) was used to explore the variance among the three groups. Fig. 2B shows the resulting two-dimensional PC bi-plot of PC₁ vs PC₂, with overlap** data points from three groups which accounted for 99.61% of the variability in the data. There were no identifiable differences in SCM metabolites (relative to medium control) between the implanted and failed embryos (Fig. 2B). Overall, using only SCM metabolite levels determined by NMR spectroscopy did not successfully discriminate among embryos based on their implantation potential.

Use of ML Models in Predicting Embryo Implantation Potential

Initially, classical ML models (nearest neighbors, linear SVM, RBF SVM, gaussian process, decision tree, random forest, neural net, AdaBoost, and naïve Bayes) alone or in conjunction with metabolomic data and/or embryologic data were used to predict the implantation potential of the embryos. Naïve Bayes, AdaBoost, and decision tree performed well when using the metabolite dataset and provided 100% accuracy even with a small dataset (Table 3). Decision tree, random forest, neural net, AdaBoost, and naïve Bayes provided 100% accuracy when using the embryologic data collected from 56 patients (Table 3). However, when combining the metabolomic and embryologic data, the prediction accuracy of all the classical ML models increased, with accuracies of 80–100%. Notably, the combination 3 and 5 datasets provided 100% accuracy in all ML models assessed. The performance of ML model was evaluated based on a confusion matrix, ROC curve, and precision-recall curve (Fig. 3A–C). The confusion matrix provides the details of false positive, false negative, true positive and true negative values. A good classifier is expected to produce a higher true positive and true negative. The classical ML model such as random forest demonstrated poor performance when metabolite data was used (Fig. 3A). In addition, ROC plots the true positive rate against the false-positive rate. For a good classifier, the ROC curve stays away from a linear line. In the sample shown for the traditional random forest model, a poor ROC curve indicates the poor classification of metabolites data (Fig. 3B). Further, the precision-recall rate measures the precision versus recall. The curve shows that random forest has poor capability in classifying the metabolite data (Fig. 3B).

Table 3 Accuracy of classical ML-based algorithms for different combination of features

Full size table

The custom ANN was also compared with the above state-of-the-art classical ML methods (nearest neighbors, linear SVM, RBF SVM, gaussian process, decision tree, random forest, neural net, AdaBoost, and naïve Bayes). Metabolite data from the NMR peaks (corresponding to 13 metabolites obtained from 56 SCM samples) were used as input data in the custom ANN model. When tested with the training data of 44 and testing data of 12 with the batch size of 8, and number of epochs of 50, the number of neurons present in the first layer was 50, and second layer was 1 with sigmoid activated function. This model had an accuracy of 100% even with a small dataset at lower epochs (Fig. 4A) and a loss of 0.0059 (Fig. 4B). Hence, custom ANN would provide good accuracy if a large dataset was available. Using the similar approach, involving custom ANN and the embryologic dataset (with the training data of 44 and testing data of 12 with the batch size of 8, and number of epochs of 50), the number of neurons present in the first layer was 30 and second layer was 1 with sigmoid activated function produced an accuracy of 91.67% for the testing dataset (Fig. 4C) and a loss of 0.1125 (Fig. 4D). The promising results suggest that custom ANN is very efficient in predicting implantation outcomes based on metabolomic or embryologic data and any of the combinations assessed.

Discussion

The lack of conclusive evidence on the value of metabolomic biomarkers for predicting ART outcomes prompted us to combine ML models with conventionally used embryological data along with NMR-identified metabolite levels. For the first time, this study incorporated data generated from SCM metabolite analysis into ML models. Interestingly, the data from this study suggests that when classical ML models are used, incorporating both metabolomic and embryologic data significantly improves the prediction accuracy compared to metabolomic data alone. Further, it is clear from our results that custom ANN models predicted the embryo implantation potential with 100% accuracy when utilizing metabolomic features.

The embryo quality and endometrial receptivity are two major determining factors in the embryo implantation process. Our study included only morphologically superior (top-graded) blastocyst on day 5. Since endometrial thickness can influence ART outcome [29, 30], we ensured that the endometrial thickness in our study subjects was comparable between positive (9.67±0.48mm) and negative implantation (10.29±0.44mm; p>0.05). In addition, women with adenomyosis and huge uterine myoma were excluded from the study as it can influence the implantation process. Thus, we believe that both embryo and endometrial factors that can influence the embryo implantation process were controlled in our experimental settings.

Metabolomics approaches have been shown to have the potential for identifying biomarkers related to embryo development and thus improving the outcomes of ART cycles [31]. Several NMR-based studies have demonstrated associations between SCM metabolites and implantation/pregnancy outcomes [11, 32, 33]. Specifically, metabolites such as pyruvate, glucose, glutamate, and amino acid turnover have been suggested as biomarkers of embryo development, implantation potential, and clinical pregnancy [11, 12, 33]. Conversely, other studies using NMR spectroscopy as an analytical tool have failed to demonstrate any associations between SCM metabolites and embryo implantation potential [9, 34, 35]. Although pyruvate and threonine levels were significantly altered in SCM from successfully implanted embryos relative to the medium control, we could not establish significant differences between the successful and failed implantation groups. This is in agreement with the recent reports that metabolomics approaches alone could not efficiently enhance ART outcome prediction [8, 14]. Technical variations in SCM sampling, processing, contamination, and analytical complexity are known to affect the results [6, 14, 36, 37]. The differences in the composition of the commercial embryo culture media, culture conditions like oxygen level, culture medium volume, embryonic developmental stage, and the sex of the embryo can also lead to inconsistencies in metabolomic-based studies [6]. Hence, there is a need to combine metabolomics approaches with other approaches to improve the predictive value.

AI-based analysis for predicting ART-related pregnancy outcomes is gaining in popularity. ML is a subtype of AI-based analysis where computer-based algorithms are used to understand the pattern present in a complex set of data and help with prediction. Several ML algorithms such as decision tree, random forest, SVM, and naïve Bayes classifier are being used in reproductive medicine; as reviewed by Wang et al. [38], external validation of van Loendersloot’s model using clinical data alone led to 64.0% accuracy [39]; whereas a naïve Bayes model with embryologic data led to 80.4% accuracy [21]. Age of the female partner, number of embryos formed, and serum E2 level on the day of trigger were identified as the best features to predict outcomes [40], although the overall accuracy was below 85%. Oocyte and embryologic characteristics such as oocyte maturity, fertilization rate, number of nuclear precursor bodies (NPBs), embryo progression to day 3, blastocyst rate and quality (on day 5), and the grade of the embryo preferred for transfer (on day 3 and day 5) were also analyzed as these parameters have demonstrated as predictive factors of embryo development and implantation potential using conventional or AI based analysis [40,41,42,43,44,45,46].

Incorporation of AI-based analysis was recently recommended for improving the efficacy of embryo implantation potential prediction by omics-based approaches [22]. A recent study has incorporated proteomic profile of euploid blastocysts and their morphology in AI-based prediction of embryo implantation potential [47]. However, there are no studies exploring the combination of ML models and NMR-derived metabolite data for predicting implantation potential. Recently, a deep learning model combined with Raman profiles generated from day-3 embryos was used to predict blastocyst development [23]. In line with the earlier results utilizing ML models for patient characteristics to predict embryo implantation potential [48,49,50], certain ML models (such as nearest neighbors, RBF SVM, decision tree, random forest, and neural net) provided accuracies of 50–67% (moderate accuracies) when metabolomic data alone was used. Moreover, when using embryologic data alone, nearest neighbors, RBF SVM, and decision tree provided accuracies of 50–67%. Although we observed 100% accuracy when combining most of the classical ML models with both metabolomic and embryologic data, the results could not be substantiated with the current small dataset, as classical ML models have an overfitting issue. Hence, a custom ANN model was employed to overcome this data issue with the regularization method. ANN models have the ability to model nonlinear and complex data, they can more effectively infer unseen data, and dropout helps to overcome the overfitting issue. Classical ML models were initially used and then compared to the advanced ANN models, which provided more than 90% accuracy for both metabolomic (100%) and embryologic (92%) data with a small sample size. Hence, ANN could be used with complex data to accurately predict outcomes in real time. In addition, ML models should be tested with a large dataset.

The strength of this study is that only day-5 blastocyst transfer cycles were used to assess our combined approach to predicting embryo implantation potential. Even after using a high-resolution (800 MHz) NMR spectrometer equipped with a cryogenically cooled micro-coil (1.7 mm) probe to profile SCM metabolites, it was not possible to obtain a differential metabolite signature between successful and failed implantation groups. Further, classical ML models have an overfitting problem, which may exaggerate the prediction when a small sample size is used, whereas ANN can overcome this issue with an added regularization to the loss function

Conclusion

The observations made in this study open up the possibility of integrating multiple datasets with ML models to improve the prediction of embryo implantation potential. Combining ML models (specifically ANN models) with metabolomic and embryologic data may improve the prediction of embryo implantation potential. This approach should be tested in large and diverse datasets and it potentially could be used to derive clinical benefits for patients in real time.

Data Availability

The data and material that support the findings of this study are available from the corresponding author upon request.

References

Niakan KK, Han J, Pedersen RA, Simon C, Pera RA. Human pre-implantation embryo development. Development. 2012;139:829–41.
Article CAS PubMed PubMed Central Google Scholar
Vanneste E, Voet T, Le CC, et al. Chromosome instability is common in human cleavage-stage embryos. Nat Med. 2009;15:577–83.
Article CAS PubMed Google Scholar
Baart EB, Martini E, van den Berg I, et al. Preimplantation genetic screening reveals a high incidence of aneuploidy and mosaicism in embryos from young women undergoing IVF. Hum Reprod. 2006;21:223–33.
Article CAS PubMed Google Scholar
Gardner DK, Lane M, Stevens J, Schoolcraft WB. Non-invasive assessment of human embryo nutrient consumption as a measure of developmental potential. Fertil Steril. 2001;76:1175–80.
Article CAS PubMed Google Scholar
Gardner DK, Lane M, Stevens J, Schlenker T, Schoolcraft WB. Blastocyst score affects implantation and pregnancy outcome: towards a single blastocyst transfer. Fertil Steril. 2000;73:1155–8.
Article CAS PubMed Google Scholar
Gardner DK, Meseguer M, Rubio C, Treff NR. Diagnosis of human pre-implantation embryo viability. Hum Reprod Update. 2015;21:727–47.
Article CAS PubMed Google Scholar
Cruz M, Garrido N, Herrero J, Perez-Cano I, Munoz M, Meseguer M. Timing of cell division in human cleavage-stage embryos is linked with blastocyst formation and quality. Reprod Biomed Online. 2012;25:371–81.
Article PubMed Google Scholar
Siristatidis CS, Sertedaki E, Vaidakis D, Varounis C, Trivella M. Metabolomics for improving pregnancy outcomes in women undergoing assisted reproductive technologies. Cochrane Database Syst Rev. 2018;3:CD011872.
PubMed Google Scholar
Kirkegaard K, Svane AS, Nielsen JS, Hindkjær JJ, Nielsen NC, Ingerslev HJ. Nuclear magnetic resonance metabolomic profiling of day 3 and 5 embryo culture medium does not predict pregnancy outcome in good prognosis patients: a prospective cohort study on single transferred embryos. Hum Reprod. 2014;29:2413–20.
Article CAS PubMed Google Scholar
Vergouw CG, Heymans MW, Hardarson T, et al. No evidence that embryo selection by near-infrared spectroscopy in addition to morphology is able to improve live birth rates: results from an individual patient data meta-analysis. Hum Reprod. 2014;29:455–61.
Article CAS PubMed Google Scholar
Pudakalakatti SM, Uppangala S, D'Souza F, et al. NMR studies of pre-implantation embryo metabolism in human assisted reproductive techniques: a new biomarker for assessment of embryo implantation potential. NMR Biomed. 2013;26:20–7.
Article CAS PubMed Google Scholar
Gardner DK, Wale PL, Collins R, Lane M. Glucose consumption of single post-compaction human embryos is predictive of embryo sex and live birth outcome. Hum Reprod. 2011;26:1981–6.
Article CAS PubMed Google Scholar
Seli E, Vergouw CG, Morita H, et al. Non-invasive metabolomic profiling as an adjunct to morphology for non-invasive embryo assessment in women undergoing single embryo transfer. Fertil Steril. 2010;94:535–42.
Article PubMed Google Scholar
Siristatidis C, Dafopoulos K, Papapanou M, et al. Why has metabolomics So Far Not managed to efficiently contribute to the improvement of assisted reproduction outcomes? The answer through a review of the best available current evidence. Diagnostics (Basel). 2021;11:1602.
Article PubMed Google Scholar
Alakwaa FM, Chaudhary K, Garmire LX. Deep learning accurately predicts estrogen receptor status in breast cancer metabolomics data. J Proteome Res. 2018;17:337–47.
Article CAS PubMed Google Scholar
Russell SJ, Norvig P. Artificial intelligence: a modern approach: Pearson education; 2003. p. 1132.
Google Scholar
Coticchio G, Fiorentino G, Nicora G, et al. Cytoplasmic movements of the early human embryo: imaging and artificial intelligence to predict blastocyst development. Reprod Biomed Online. 2021;42:521–8.
Article PubMed Google Scholar
Feyeux M, Reignier A, Mocaer M, et al. Development of automated annotation software for human embryo morphokinetics. Hum Reprod. 2020;35:557–64.
Article CAS PubMed Google Scholar
VerMilyea M, Hall JMM, Diakiw SM, et al. Development of an artificial intelligence-based assessment model for prediction of embryo viability using static images captured by optical light microscopy during IVF. Hum Reprod. 2020;35:770–84.
Article CAS PubMed PubMed Central Google Scholar
Zaninovic N, Rosenwaks Z. Artificial intelligence in human in vitro fertilization and embryology. Fertil Steril. 2020;114:914–20.
Article CAS PubMed Google Scholar
Uyar A, Bener A, Ciray HN. Predictive modeling of implantation outcome in an in vitro fertilization setting: an application of machine learning methods. Med Decis Making. 2015;35:714–25.
Article PubMed Google Scholar
Siristatidis C, Stavros S, Drakeley A, et al. Omics and artificial intelligence to improve in vitro fertilization (IVF) success: a proposed protocol. Diagnostics (Basel). 2021;11:743.
Article CAS PubMed Google Scholar
Zheng W, Zhang S, Gu Y, et al. Non-invasive metabolomic profiling of embryo culture medium using Raman spectroscopy with deep learning model predicts the blastocyst development potential of embryos. Front Physiol. 2021;12:777259.
Article PubMed PubMed Central Google Scholar
Alpha Scientists in Reproductive Medicine and ESHRE Special Interest Group of Embryology. The Istanbul consensus workshop on embryo assessment: proceedings of an expert meeting. Hum Reprod. 2011;26:1270–83.
Wishart DS, Jewison T, Guo AC, et al. HMDB 3.0--The human metabolome database in 2013. Nucleic Acids Res. 2013;41:D801–7.
Article CAS PubMed Google Scholar
Wishart DS, Feunang YD, Marcu A, et al. HMDB 4.0: the human metabolome database for 2018. Nucleic Acids Res. 2018;46:D608–17.
Article CAS PubMed Google Scholar
The jamovi project. 2021. Jamovi version 1.8. Computer software. https://www.jamovi.org Accessed on 12 Sep 2021.
R Core Team. R: A language and environment for statistical computing. Computer software version 4.0. R Foundation for Statistical Computing. 2021. https://cran.r-project.org. Accessed on 12 Sep 2021.
Richter KS, Bugge KR, Bromer JG, Levy MJ. Relationship between endometrial thickness and embryo implantation, based on 1,294 cycles of in vitro fertilization with transfer of two blastocyst-stage embryos. Fertil Steril. 2007;87:53–9.
Article PubMed Google Scholar
Bu Z, Sun Y. The impact of endometrial thickness on the Day of human chorionic gonadotrophin (hCG) administration on ongoing pregnancy rate in patients with different ovarian response. PLoS One. 2015;10:e0145703.
Article PubMed PubMed Central Google Scholar
Bracewell-Milnes T, Saso S, Abdalla H, et al. Metabolomics as a tool to identify biomarkers to predict and improve outcomes in reproductive medicine: a systematic review. Hum Reprod Update. 2017;23:723–36.
Article CAS PubMed Google Scholar
Wallace M, Cottell E, Cullinane J, McAuliffe FM, Wingfield M, Brennan L. ¹H NMR based metabolic profiling of day 2 spent embryo media correlates with implantation potential. Syst. Biol. Reprod. Med. 2014;60:58–63.
Article CAS PubMed Google Scholar
Seli E, Botros L, Sakkas D, Burns DH. Non-invasive metabolomic profiling of embryo culture media using proton nuclear magnetic resonance correlates with reproductive potential of embryos in women undergoing in vitro fertilization. Fertil Steril. 2008;90:2183–9.
Article PubMed Google Scholar
Nadal-Desbarats L, Veau S, Blasco H, et al. Is NMR metabolic profiling of spent embryo culture media useful to assist in vitro human embryo selection? Magn Reson Mater Phys Biol Med. 2013;26:193–202.
Article Google Scholar
Rinaudo P, Shen S, Hua J, et al. (1)H NMR based profiling of spent culture media cannot predict success of implantation for day 3 human embryos. J Assist Reprod Genet. 2012;29:1435–42.
Article PubMed PubMed Central Google Scholar
Hernández-Vargas P, Muñoz M, Domínguez F. Identifying biomarkers for predicting successful embryo implantation: applying single to multi-OMICs to improve reproductive outcomes. Hum Reprod Update. 2020;26:264–301.
Article PubMed Google Scholar
Asampille G, Cheredath A, Joseph D, Adiga SK, Atreya HS. The utility of nuclear magnetic resonance spectroscopy in assisted reproduction. Open Biol. 2020;10:200092.
Article CAS PubMed PubMed Central Google Scholar
Wang R, Pan W, ** L, et al. Artificial intelligence in reproductive medicine. Reproduction. 2019;158:R139–54.
Article CAS PubMed PubMed Central Google Scholar
Sarais V, Reschini M, Busnelli A, Biancardi R, Paffoni A, Somigliana E. Predicting the success of IVF: external validation of the van Loendersloot's model. Hum Reprod. 2016;31:1245–52.
Article PubMed Google Scholar
Hafiz P, Nematollahi M, Boostani R, Namavar Jahromi B. Predicting implantation outcome of in vitro fertilization and intracytoplasmic sperm injection using data mining techniques. Int J Fertil Steril. 2017;11:184–90.
PubMed PubMed Central Google Scholar
Rosen MP, Shen S, Rinaudo PF, Huddleston HG, McCulloch CE, Cedars MI. Fertilization rate is an independent predictor of implantation rate. Fertil Steril. 2010;94:1328–33.
Article PubMed Google Scholar
Yih MC, Spandorfer SD, Rosenwaks Z. Egg production predicts a doubling of in vitro fertilization pregnancy rates even within defined age and ovarian reserve categories. Fertil Steril. 2005;83:24–9.
Article PubMed Google Scholar
Borini A, Lagalla C, Cattoli M, et al. Predictive factors for embryo implantation potential. Reprod Biomed Online. 2005;10:653–68.
Article PubMed Google Scholar
Sjöblom P, Menezes J, Cummins L, Mathiyalagan B, Costello MF. Prediction of embryo developmental potential and pregnancy based on early stage morphological characteristics. Fertil Steril. 2006;86:848–61.
Article PubMed Google Scholar
Van den Abbeel E, Balaban B, Ziebe S, Lundin K, Cuesta MJ, Klein BM, Helmgaard L, Arce JC. Association between blastocyst morphology and outcome of single-blastocyst transfer. Reprod Biomed Online. 2013;27:353–61.
Article PubMed Google Scholar
Parrella A, Irani M, Keating D, Chow S, Rosenwaks Z, Palermo GD. High proportion of immature oocytes in a cohort reduces fertilization, embryo development, pregnancy and live birth rates following ICSI. Reprod Biomed Online. 2019;39:580–7.
Article PubMed Google Scholar
Bori L, Dominguez F, Fernandez EI, et al. An artificial intelligence model based on the proteomic profile of euploid embryos and blastocyst morphology: a preliminary study. Reprod Biomed Online. 2021;42:340–50.
Article CAS PubMed Google Scholar
Goyal A, Kuchana M, Ayyagari KPR. Machine learning predicts live-birth occurrence before in-vitro fertilization treatment. Sci Rep. 2020;10:20925.
Article CAS PubMed PubMed Central Google Scholar
Qiu J, Li P, Dong M, **n X, Tan J. Personalized prediction of live birth prior to the first in vitro fertilization treatment: a machine learning method. J Transl Med. 2019;17:317.
Article PubMed PubMed Central Google Scholar
Kaufmann SJ, Eastaugh JL, Snowden S, Smye SW, Sharma V. The application of neural networks in predicting the outcome of in-vitro fertilization. Hum Reprod. 1997;12:1454–7.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This study is dedicated to the memory of our late colleague, NMR scientist Prof. Hanudatta S. Atreya. The facilities provided by the NMR Research Centre at the Indian Institute of Science (IISc) are gratefully acknowledged. AC and AJ acknowledge the Dr. TMA Pai Structured PhD Fellowship from the Manipal Academy of Higher Education (MAHE).

Funding

Open access funding provided by Manipal Academy of Higher Education, Manipal

Author information

Aswathi Cheredath and Shubhashree Uppangala have contributed equally to this work and share first authorship.

Authors and Affiliations

Division of Clinical Embryology, Department of Reproductive Science, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, 576 104, India
Aswathi Cheredath, Ameya Jijo & Satish Kumar Adiga
Division of Reproductive Genetics, Department of Reproductive Science, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, 576 104, India
Shubhashree Uppangala
Department of Mechatronics Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, 576 104, India
Asha C. S
Department of Data Science, Prasanna School of Public Health, Manipal Academy of Higher Education, Manipal, 576 104, India
Vani Lakshmi R
Department of Reproductive Medicine and Surgery, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, 576 104, India
Pratap Kumar
NMR Research Centre, Indian Institute of Science, Bangalore, 560 012, India
David Joseph
Northwest Metabolomics Research Center, Mitochondria and Metabolism Center, Anesthesiology and Pain Medicine, University of Washington, Seattle, WA, USA
Nagana Gowda G.A
Division of Reproductive Biology, Department of Reproductive Science, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, 576 104, India
Guruprasad Kalthur

Authors

Aswathi Cheredath
View author publications
You can also search for this author in PubMed Google Scholar
Shubhashree Uppangala
View author publications
You can also search for this author in PubMed Google Scholar
Asha C. S
View author publications
You can also search for this author in PubMed Google Scholar
Ameya Jijo
View author publications
You can also search for this author in PubMed Google Scholar
Vani Lakshmi R
View author publications
You can also search for this author in PubMed Google Scholar
Pratap Kumar
View author publications
You can also search for this author in PubMed Google Scholar
David Joseph
View author publications
You can also search for this author in PubMed Google Scholar
Nagana Gowda G.A
View author publications
You can also search for this author in PubMed Google Scholar
Guruprasad Kalthur
View author publications
You can also search for this author in PubMed Google Scholar
Satish Kumar Adiga
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AC, SU, DJ: acquisition of data or analysis. SU, VL, AJ, ACS: analyzed the data. NGG, PK, GK: revised it critically for important intellectual content. SKA: conceived and designed the study. SU, ACS, SKA: wrote the paper. AC is the guarantor of this work, had full access to all the data, and takes responsibility for the integrity of the data and the accuracy of the data analysis. All authors have given final approval for publication.

Corresponding author

Correspondence to Satish Kumar Adiga.

Ethics declarations

Ethics Approval

The study was approved by the Institutional Ethics Committee (IEC) of the Kasturba Hospital of Manipal Academy of Higher Education and informed written consent was obtained from all patients

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table 1

(DOCX 14 kb)

Supplementary Table 2

(DOCX 12 kb)

Supplementary Fig. 1

Representative one-dimensional ¹H NMR spectrum of ONESTEP embryo culture medium used in the study. The figure shows the assignment of peaks for different metabolites. The x-axis represents the chemical shift in parts per million. (PNG 79 kb)

High resolution image (TIF 151 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cheredath, A., Uppangala, S., C. S, A. et al. Combining Machine Learning with Metabolomic and Embryologic Data Improves Embryo Implantation Prediction. Reprod. Sci. 30, 984–994 (2023). https://doi.org/10.1007/s43032-022-01071-1

Download citation

Received: 19 May 2022
Accepted: 23 August 2022
Published: 12 September 2022
Issue Date: March 2023
DOI: https://doi.org/10.1007/s43032-022-01071-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Combining Machine Learning with Metabolomic and Embryologic Data Improves Embryo Implantation Prediction

Abstract

Similar content being viewed by others

Development of a Novel Non-invasive Metabolomics Assay to Predict Implantation Potential of Human Embryos

Prediction model for day 3 embryo implantation potential based on metabolites in spent embryo culture medium

Non-invasive metabolomic profiling of embryo culture media and morphology grading to predict implantation outcome in frozen-thawed embryo transfer cycles

Introduction

Materials and Methods

Patient Selection

Controlled Ovarian Stimulation (COS) and Oocyte Aspiration

Fertilization and Embryo Evaluation

NMR Sample Preparation and Analysis

ML Model Training and Testing Procedures

Input and Output Data

Data Classification Using Custom ANN

Software

Statistical Analysis

Results

Patient Characteristics and Embryo Implantation Outcomes

Variation in Relative Levels of Metabolites in SCM

Use of ML Models in Predicting Embryo Implantation Potential

Discussion

Conclusion

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics Approval

Competing Interests

Additional information

Publisher’s Note

Supplementary information

Supplementary Table 1

Supplementary Table 2

Supplementary Fig. 1

High resolution image (TIF 151 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation