Abstract
To assess their roles in breast cancer diagnostics, we aimed to compare plasma cell-free DNA (cfDNA) levels with the circulating metabolome in a large breast screening cohort of women recalled for mammography, including healthy women and women with mammographically detected breast diseases, ductal carcinoma in situ and invasive breast cancer: the Breast Screening and Monitoring Study (BSMS). In 999 women, plasma was analyzed by nuclear magnetic resonance (NMR) and Ultra-Performance Liquid Chromatography-Mass Spectrometry (UPLC-MS) and then processed to isolate and quantify total cfDNA. NMR and UPLC-MS results were compared with data for 186 healthy women derived from the AIRWAVE cohort. Results showed no significant differences between groups for all metabolites, whereas invasive cancers had significantly higher plasma cfDNA levels than all other groups. When stratified the supervised OPLS-DA analysis and total cfDNA concentration showed high discrimination accuracy between invasive cancers and the disease/medication-free subjects. Furthermore, comparison of OPLS-DA data for invasive breast cancers with the AIRWAVE cohort showed similar discrimination between breast cancers and healthy controls. This is the first report of agreement between metabolomics and plasma cfDNA levels for discriminating breast cancer from healthy subjects in a true screening population. It also emphasizes the importance of sample standardization. Follow on studies will involve analysis of candidate features in a larger validation series as well as comparing results with serial plasma samples taken at the next routine screening mammography appointment. The findings here help establish the role of plasma analysis in the diagnosis of breast cancer in a large real-world cohort.
Similar content being viewed by others
Introduction
Breast cancer (BC) is the most frequent cause of death among women after lung cancer, worldwide [1]. Current diagnosis is largely based on a physical examination, mammographic and other imaging and histopathological assessment of tissue biopsy, complemented by blood tests for the detection of specific antigens and/or proteins [2, 3]. Early diagnosis significantly increases long-term survival rates [4]. However, more sensitive and breast cancer-specific biomarkers are required for early detection of aggressive disease.
Use of cfDNA was first described over 60 years ago [5]. Elevated levels are seen in cancer in part due to reduced DNase activity [6,7,8]. Elevated levels of cfDNA in plasma have been suggested for the diagnosis of breast cancers and qualitative tests have demonstrated increased cfDNA integrity/size [9,10,11]. However elevated levels of cfDNA are also sometimes observed in benign breast disease [12], reducing its specificity for cancer. Certain patterns in cfDNA (e.g. mutations, loss of heterozygosity (LOH), hypermethylation) have the potential to provide specific markers and have also been investigated [13,14,15]. We have previously described that that patient-specific circulating tumor (ctDNA) analysis can detect early evidence of progression up to 2 years ahead of imaging [16].
Altered metabolism is one of the key hallmarks of cancer. The development of sensitive, reproducible and robust bioanalytical tools such as NMR and mass spectrometry (MS) techniques has allowed us to explore its role [17, 18] in conjunction with other new methods. We have previously shown that metabonomics identifies excess energy expenditure pathways perturbed during chemotherapy for breast cancer [19] and have suggested new therapeutic approaches that focus on metabolism [20]. Either individually or grouped as a metabolomic profile, detection of metabolites can be carried out in the same plasma samples as cfDNA analysis. We have thus explored the potential of using both cfDNA and the metabolome together, in a large cohort of women recalled for mammography at Imperial College Healthcare NHS Trust, including healthy women and women with early mammographically detected breast cancer. We also compared results to a second independent series of healthy controls from the AIRWAVE study. Together the use of cfDNA and metabolomics, when used as a translational research tool, can provide a link between the laboratory and clinic.
Results
The demographics and clinical metadata of the 1185 individuals analyzed in this study are reported in the Supplementary Table 1 comprising 999 from the BSMS study and 186 female individuals recruited from AIRWAVE (AW II).
NMR spectroscopy
In the BSMS cohort OPLS-DA of plasma 1H-NMR global profiling data (1D-NOESY and CPMG) between patients diagnosed with invasive breast cancer and cancer-free subjects, did not show significant discrimination (Table 1, Fig. 1a, b). Similar non-significant discrimination was found between groups for the comparison between benign vs. in situ, invasive cancer vs. benign, invasive cancer vs. in situ and cancer-free vs. all breast cancer groups. Similar results, with poor discrimination accuracy (<60%, Table 1) between all studied groups (Supplementary Fig. 2) were obtained for OPLS-DA modeling of the plasma NMR targeted data (19 metabolites and 112 lipoproteins).
Taking advantage of NMR data reproducibility between spectrometers and spectra collection centers [21], we also compared invasive cancer patients with data generated as part of the AIRWAVE study, comprising an independent cohort of female healthy individuals (n = 186). In particular, the targeted datasets from both studies (i.e. the absolute concentration values of 19 metabolites and 112 plasma lipoproteins) were employed and used to build the corresponding MVA models. Initially, unsupervised Principal Component Analysis (PCA) was performed on diseases-free and healthy AIRWAVE individuals’ datasets from both studies to test the feasibility of coupling the two independent datasets. PCA score plot (Supplementary Fig. 3a) from the 19 metabolites concentrations showed a perfect classification between healthy AIRWAVE versus BSMS diseases-free individuals. Further examination of loadings plots (Supplementary Fig. 3b) revealed that glucose and lactic acid concentrations were significantly different between the 2 study cohorts, where glucose and lactic acid values were higher and lower, respectively, in BSMS diseases-free individuals (Supplementary Fig. 3c, d). This could be attributed to the sample collection time points, nutritional habits and/or physical exercise between individuals from each cohort, amongst possible factors. Nevertheless, glucose and lactic acid were removed from both datasets, and the new PCA results indicated an overlap without any significant classification trends between BSMS and AIRWAVE samples, allowing us to employ them for further supervised MVA analyses. It should be noted that the lipoproteins datasets were highly overlapped for both studies (Supplementary Fig. 3e) and they were employed for further analyses as such.
The supervised OPLS-DA analysis of the 17 metabolites dataset (excluding glucose and lactic acid) for BSMS patients with invasive breast cancer versus the AIRWAVE healthy subjects showed high classification accuracy (Table 1) of the two groups (Supplementary Fig. 4a, b) and one-way ANOVA calculated p-values after Benjamini-Hochberg correction [22] indicated citric acid, acetic acid, leucine, histidine, glycine, glutamine, pyruvic acid and creatinine as discriminative biomarkers (Supplementary Fig. 4c). The same analysis for the 112 plasma lipoproteins provided a good classification of invasive cancer patients versus healthy AW subjects (Table 1, Supplementary Fig. 5) and 17 lipoprotein classes appeared to significantly change (p < 0.05) between the 2 classes (Supplementary Table 2). Following the same strategy, OPLS-DA models were constructed for the comparison between benign vs healthy (AIRWAVE) (Supplementary Fig. 6a) and in situ vs healthy (AW) (Supplementary Fig. 6b) and their performance is summarized in Table 1. Results indicated again high classification accuracies for the benign vs. healthy (AW) and in situ vs healthy (AW) models based upon the 17 metabolites concentration datasets. The produced loadings from the models suggested several metabolites as potential biomarkers, such as pyruvic acid, citric acid, leucine, histidine, glycine, glutamine and creatinine (Supplementary Fig. 6c, d). It is noteworthy that although the mean age of BSMS breast cancer and AW subjects were significantly different (Supplementary Table 1), Pearson correlation analysis of all plasma metabolites concentrations with subjects’ age indicated an insignificant contribution of age to the measured values (Supplementary Fig. 7) in the present datasets.
UPLC–MS
Similarly, OPLS-DA showed no significant discrimination between any sample class pairings for all LC–MS assays. In particular, the statistical models based upon the lipidomic profile of plasma samples for both positive and negative ionization modes, exhibited similar discrimination accuracy between invasive cancer and cancer-free subjects (accuracy = 64%), whereas the models from the benign vs. in situ, invasive cancer vs. benign, invasive cancer vs. in situ and cancer-free vs. the rest of the types of breast cancer groups showed lower discrimination accuracy values (i.e. <60%) (Table 1, Supplementary Fig. 8). However, a moderate discrimination accuracy (AUC = 0.65 and accuracy = 76.5%) was observed between the invasive cancer and the cancer-free control group from the HILIC+ dataset. An examination of the extracted loadings data from the supervised OPLS-DA analysis showed that the most weighted HILIC+ features leading to the observed discrimination, corresponded to lidocaine, most likely explained by contamination of several plasma samples by local anesthetic during the blood sampling procedure. When we removed HILIC+ lidocaine features and repeated the MVA analysis the model showed less accuracy in discriminating the two groups (AUC = 0.62 and accuracy = 67.0%) in agreement with the lipidomic profile (Table 1 and Fig. 2a).
a. Scores plot and ROC curve of the OPLS-DA analysis between Cancer-free vs. Invasive breast cancer subjects from the MS HILIC+ (HPOS) assay data. b. Scores plot and the ROC curve of the OPLS-DA analysis [MS HILIC+ (HPOS) assay] between Invasive breast cancer vs. Diseases/medication-free subjects (n = 288), where the two observed subgroups are colored differently; those predicted as Invasive Cancer are depicted as red diamonds and the rest of the Diseases/medication-free subjects are depicted as inverted yellow triangles.
Having considered lidocaine contamination of the samples, we further stratified the 614 cancer-free controls, comparing 288 reported as having no drugs intake and/or other disease with the other 326 subjects. Subsequently, we isolated this disease/medication-free group and we re-evaluated all MVA analyses for both UPLC-MS and NMR data. This was undertaken to avoid any confounding in the data owing to the presence of features corresponding to drug related compounds or to metabolites relating to other diseases that cancer-free subjects were experiencing during the blood sampling period. This OPLS-DA model for invasive cancer vs. disease/medication-free subjects indicated a slightly higher discrimination accuracy (+3%) for all UPLC–MS assays (Table 1 and Fig. 2b). When exploring the predicting ability of our models, 51 of the 288 plasma samples from the diseases/medication-free healthy controls, were predicted as invasive cancer with accuracy >85% based on their metabolic data (Table 1 and Fig. 3a).
a Scores plot and the ROC curve of the OPLS-DA analysis [MS HILIC+ (HPOS) assay] between Diseases/medication-free subjects subgroup 1 (n = 237) vs. Diseases/medication-free subjects subgroup 2 (n = 51) consisted of those predicted as Invasive Cancer. b The cfDNA n x Fold concentration changes between the studied groups. The n × Fold was calculated by the equation: \(n \times {\rm{Fold}} = {\rm{log}}_2\left( {\frac{{{\rm{median}}\;{\rm{of}}\;{\rm{group}}\;1}}{{{\rm{median}}\;{\rm{of}}\;{\rm{group}}\;2}}} \right)\). Moreover, one-way ANOVA analysis coupled with t-test was performed for the determination of the statistically significant (p < 0.05) differences of the observed cfDNA concentration changes for each case. For each comparison, cfDNA concentration is higher in the underlined group.
However, the supervised OPLS-DA analysis of the diseases/medication-free vs. the diseases/medication-free predicted as invasive cancer samples showed high discrimination accuracy, namely, 86%, 76 and 71% for HILIC+, Lipid RPC+ and Lipid RPC- MS assays, respectively (Table 1). When this group of 51 control subject were excluded highly predictive models were produced from the diseases/medication-free (without those predicted as Invasive Cancer) vs. invasive cancer plasma samples, with accuracy values 76%, 70 and 73% for HILIC+, Lipid RPC+ and Lipid RPC− MS assays, respectively.
Plasma cfDNA analysis
Initially, total cfDNA levels in all blood samples from BSMS were employed for multiple univariate ANOVA analyses, comparing the total cfDNA concentration between each group of subjects as for the metabolomics data (Fig. 3b). All univariate analyses of the cfDNA concentration corroborate the obtained results from the MS based MVA models. The total cfDNA concentration was significantly higher in invasive breast cancer vs. the diseases-free subjects, whereas the cases of cancer-free and benign tumors vs. invasive cancer samples showed no significant differences (Fig. 3b). In addition, there was no significant difference in concentration between patients with invasive and in situ cancer. Of note, the 51 diseases/medication-free subjects (subgroup 2), that were classified as “cancer like” by HILIC+, Lipid RPC+ and Lipid RPC− LC–MS assays respectively also had a significantly higher cfDNA concentration (p = 0.002) compared to the rest of the healthy controls (n = 237), whereas non-significant differences were observed vs. the invasive cancer samples. In addition, the subgroup of 237 diseases-free subjects (subgroup 1) had significantly lower cfDNA concentration vs. the invasive cancer (Fig. 3b). Consequently, cfDNA results were in total agreement with the LC-MS metabolomics data. It should be noted that Pearson correlation analysis (r = 0.068) of plasma cfDNA measured values with subjects’ age indicated insignificant contribution of age to the cfDNA differences between the studied groups.
As expected, the MVA analysis of the combined cfDNA and LC–MS datasets—since their agreement—produce superior OPLS-DA models i.e., with higher discrimination accuracy (see MVA results of HILIC+ and cfDNA combined datasets in Supplementary Fig. 9).
Discussion
We report the metabolomic and cfDNA analysis of a large cohort of sequential plasma samples from 999 women attending for routine breast screening and validation with an independent cohort of 186 healthy women from the AIRWAVE study. Our main findings demonstrate the utility of cfDNA quantification here. This represents a real-world cohort, and results of this comprehensive work exemplify the challenges of establishing such a complex composite biomarker panel since the resulting accuracy of the signature derived from the UPLC-MS analysis was only moderate (AUCs between 0.62 and 0.76).
Several metabolomics studies have attempted to detect the breast cancer fingerprint in serum and plasma [1, 24], showing high accuracy in models (AUC > 0.9), which discriminate breast cancer from healthy subjects. The majority of the models described in the aforementioned studies are derived by MS plasma or tissue analyses with a maximum of 100 advanced breast cancer and 100 controls, although another NMR-based metabolomic study employing a large serum/plasma cohort succeeded in monitoring and predicting BC relapse (accuracy = 71%) and discriminating early BC from metastatic BC patients (accuracy = 85%) [25]. Here, our large cohort analysis represents a much earlier cancer stage with greater power based on the larger sample size (999 women). NMR untargeted metabolomics data were incapable of discriminating/fingerprinting any of the patient groups (Fig. 1) in this screening population. Moreover, using a targeted approach nineteen metabolites and 112 lipoproteins concentrations extracted by NMR data, were also statistically insignificant among the studied groups (Supplementary Fig. 1). It is noteworthy that many plasma metabolites quantified herein are reported to change in invasive BC (e.g. l-glutamine, l-valine, creatine etc.) [1, 24]. However, in this large cohort of early screen detected breast cancers none of these metabolites exhibited statistically significant variation in concentration (Supplementary Fig. 1). Such ‘negative data’ serves to reinforce the importance of performing screening studies in larger cohorts. Strikingly, our results are in agreement with a very recent study, where it was shown that NMR metabolomic data were multi-disease specific for patients risk stratification except from breast cancer [26]. Nevertheless, it is notable that the measured concentration of several plasma metabolites (i.e. creatine, histidine, valine, alanine and tyrosine) was found slightly (but not significantly) elevated in the plasma samples of women with invasive BC (Supplementary Fig. 1), which is in accordance to published literature [ We recruited individuals from the Breast Screening and Monitoring Study (BSMS) who were recalled from mammography. The study protocol was approved by the Riverside Research Ethics Committee (Imperial College Healthcare NHS Trust; Tissue Bank Ethics/REC reference numbers: 12/LO/2019; 13/LO/1152; R10015-16A; 07/Q0401/20) and conducted in accordance with Good Clinical Practice Guidelines and the Declaration of Helsinki. All patients gave written informed consent prior to participation and were over 18 years of age. 20 ml blood was taken into K2 EDTA tubes (BD Biosciences) and processed to recover plasma and buffy coat within 2 h of collection and stored at −80 °C for subsequent extraction of cfDNA and germline DNA as described previously [10]. The cohort included individuals with no breast disease, and women with biopsy confirmed benign breast disease, carcinoma in situ and those with invasive breast cancer. Driven by the LC-MS multivariate analyses (see below statistical methods) as well as clinical metadata (Supplementary Table 1), we formed several subgroups of samples due to the presence of features from medication (e.g., lidocaine, etc.). Furthermore, an additional subgroup was formed from the cancer/medication-free samples that was statistically classified as invasive breast cancer within high accuracy. This was also driven by the cfDNA assay results. A second independent control group of healthy individuals was also analyzed from women recruited from the AIRWAVE study (MREC/13/NW/0588). The AIRWAVE Health Monitoring Study was established to evaluate possible health risks associated with the use of TETRA, a digital communication system used by the police forces and other emergency services. This is an ongoing long-term observational study following up the health of police officers and staff across the United Kingdom, with the ability to monitor both cancer and non-cancer health outcomes through data linkage. 53,280 participants have been recruited between June 2004 and March 2015 with a response rate averaging 50% of employees in participating forces. At baseline, participants completed an enrollment questionnaire (sent via routine administration or the occupational health service), or a comprehensive health screening performed locally, or both. Screened participants have now been followed-up for 7.5 years on average. Each recruited individual provided a single EDTA 7 mL blood sample for subsequent plasma isolation and storage at −80 °C. This cohort was used for the validation of the cancer/medication-free group, aiming at testing its NMR-based model robustness/predictive accuracy, and as an external (independent) cancer/medication-free cohort versus invasive cancer samples for the detection of any biomarkers. Plasma samples for UPLC-MS and NMR analyses were prepared and data acquired as published previously [33,34,35]. For UPLC-MS, the separation of lipophilic analytes by reversed-phase chromatography (lipid RPC) and the separation of hydrophilic analytes (e.g., polar and charged metabolites) by hydrophilic interaction liquid chromatography (HILIC) took place. MS positive and negative electrospray ionization modes produced lipid positive and negative (lipid RPC+ and lipid RPC− respectively) and HILIC positive (HILIC+) datasets. Solution 1H-NMR spectra of all samples were acquired using a Bruker IVDr 600 MHz spectrometer (Bruker BioSpin) operating at 14.1. Further details about the quality control of both UPLC-MS and NMR data, metabolites quantification as well as experimental procedures can be found in supplementary materials. Cell-free DNA was isolated from 4 ml of blood plasma with the MagMAX Cell-free DNA Isolation Kit (Thermo Fisher Scientific) on the Kingfisher Flex instrument (Thermo Fisher Scientific) using the MagMAX cfDNA-4mL-Flex.bdz protocol and processed according to the manufacturer’s instructions. Multivariate statistical (MVA) models, specifically Orthogonal Partial Least Squares–Discriminant Analysis (OPLS-DA) of NMR and UPLC-MS metabolomics data and clinical metadata were generated between study participants with invasive cancer (n = 105), in situ (n = 40) and benign breast disease (n = 214), and imaging or biopsy confirmed cancer-free controls (n = 614). Modeling was performed in MATLAB (MathWorks, version R2019b), using the PLS_Toolbox version 8.7.1 (2019) (Eigenvector Research, Inc., Manson, WA, USA 98831; software available at http://www.eigenvector.com). All multivariate statistical models and their metrics were produced after cross-validation. Any correlation of metabolomics/cfDNA data with subjects’ age/height/weight (see Supplementary Table 1) was performed by refitting each multivariate model after adding each variable into the model and calculating its accuracy. For all studied groups, age/height/weight were not appeared as statistically significant variables. Variables loadings data (i.e., metabolites’ LC–MS/NMR features) and Variable Importance in Projection (VIP) scores from each multivariate OPLS-DA model were used to initially evaluate any significant feature (i.e., any metabolite that could drive the classification between studied groups). VIP scores estimate the importance of each variable in the projection used in a PLS model and is often used for variable selection. A variable with a VIP Score close to or greater than 1 (one) can be considered important in given model. Variables with VIP scores significantly less than 1 (one) are less important and might be good candidates for exclusion from the model [36]. Nevertheless, each variable’s statistical significance (i.e. metabolites and lipoproteins concentration) was further tested by univariate (ANOVA) analyses via built in MATLAB functions (https://uk.mathworks.com/help/stats/one-way-anova.html). Any reported p-values were corrected for false discovery rate (FDR) (applying Benjamini-Hochberg FDR correction [22] using “fdr_bh” function (https://www.mathworks.com/matlabcentral/fileexchange/27418-fdr_bh).Materials and methods
Patients and samples
Ultra-performance liquid chromatography-mass spectrometry (UPLC-MS) − 1H Nuclear Magnetic Resonance (NMR) spectroscopy
Extraction and quantitation of plasma cfDNA
Statistical analyses – multivariate/univariate statistics
Data availability
The datasets generated and/or analyzed during the current study are not publicly available due to individuals’ privacy reasons but are available from PGT and JAS on reasonable request and formal legal agreement.
Code availability
Any software/code employed in the study is available to download from the links reported in the Methods section: “Statistical analyses – Multivariate/Univariate statistics”.
References
Jové M, Collado R, Quiles JL, Ramírez-Tortosa MC, Sol J, Ruiz-Sanjuan M, et al. A plasma metabolomic signature discloses human breast cancer. Oncotarget. 2017;8:19522–33.
Weigelt B, Horlings H, Kreike B, Hayes M, Hauptmann M, Wessels L, et al. Refinement of breast cancer classification by molecular characterization of histological special types. J Pathol. 2008;216:141–50.
Shaw JA, Stebbing J. Circulating free DNA in the management of breast cancer. Ann Transl Med. 2014;2:16–8.
Saadatmand S, Bretveld R, Siesling S, Tilanus-Linthorst MMA. Influence of tumour stage at breast cancer detection on survival in modern times: Population based study in 173 797 patients. BMJ. 2015;351:h4901.
Mandel P, Metais P. [Nuclear Acids In Human Blood Plasma]. C R Seances Soc Biol Fil. 1948;142:241–3.
Leon SA, Shapiro B, Sklaroff DM, Yaros MJ. Free DNA in the serum of cancer patients and the effect of therapy. Cancer Res. 1977;37:646–50.
Stroun M, Anker P, Lyautey J, Lederrey C, Maurice PA. Isolation and characterization of DNA from the plasma of cancer patients. Eur J Cancer Clin Oncol. 1987;23:707–12.
Cherepanova AV, Tamkovich SN, Bryzgunova OE, Vlassov VV, Laktionov PP. Deoxyribonuclease activity and circulating DNA concentration in blood plasma of patients with prostate tumors. Ann N Y Acad Sci. 2008;1137:218–21.
Huang ZH, Li LH, Hua D. Quantitative analysis of plasma circulating DNA at diagnosis and during follow-up of breast cancer patients. Cancer Lett. 2006;243:64–70.
Page K, Guttery DS, Zahra N, Primrose L, Elshaw SR, Pringle JH, et al. Influence of plasma processing on recovery and analysis of circulating nucleic acids. PLoS ONE. 2013;8:e77963.
Wang BG, Huang H-Y, Chen Y-C, Bristow RE, Kassauei K, Cheng C-C, et al. Increased plasma DNA integrity in cancer patients. Cancer Res. 2003;63:3966–68.
Zanetti-Dällenbach R, Wight E, Fan AX-C, Lapaire O, Hahn S, Holzgreve W, et al. Positive correlation of cell-free DNA in plasma/serum in patients with malignant and benign breast disease. Anticancer Res. 2008;28:921–5.
Chen XQI, Bonnefoi H, Diebold-Berger S, Lyautey J, Lederrey C, Faltin-Traub E, et al. Detecting tumor-related alterations in plasma or serum DNA of patients diagnosed with breast cancer. Clin Cancer Res. 1999;5:2297–303.
Shaw JA, Smith BM, Walsh T, Johnson S, Primrose L, Slade MJ, et al. Microsatellite alterations in plasma DNA of primary breast cancer patients. Clin Cancer Res. 2000;6:1119–24.
Silva JM, Dominguez G, Garcia JM, Gonzalez R, Villanueva MJ, Navarro F, et al. Presence of tumor DNA in plasma of breast cancer patients. Cancer Res. 1999;59:3251–56.
Coombes C, Page K, Salari R, Hastings RK, Armstrong AC, Ahmed S, et al. Personalized detection of circulating tumor DNA antedates breast cancer metastatic recurrence. Clin Cancer Res. 2019;25:4255–63.
Čuperlović-Culf M. 8 - NMR metabolomics application by cancer type. In: Čuperlović-Culf M, editor. NMR metabolomics in cancer research. Woodhead Publishing; 2013. p. 385–412. https://www.sciencedirect.com/book/9781907568848/nmr-metabolomics-in-cancer-research.
Liesenfeld DB, Habermann N, Owen RW, Scalbert A, Ulrich CM. Review of mass spectrometry–based metabolomics in cancer research. Cancer Epidemiol Biomark Prev. 2013;22:2182–201.
Keun HC, Sidhu J, Pchejetski D, Lewis JS, Marconell H, Patterson M, et al. Serum molecular signatures of weight change during early breast cancer chemotherapy. Clin Cancer Res. 2009;15:6716–23.
Stebbing J, Sharma A, North B, Athersuch TJ, Zebrowski A, Pchejetski D, et al. A metabolic phenoty** approach to understanding relationships between metabolic syndrome and breast tumour responses to chemotherapy. Ann Oncol. 2012;23:860–6.
Takis PG, Ghini V, Tenori L, Turano P, Luchinat C. Uniqueness of the NMR approach to metabolomics. TrAC Trends Anal Chem. 2019;120:115300.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57:289–300.
Fan Y, Zhou X, **a TS, Chen Z, Li J, Liu Q, et al. Human plasma metabolomics for identifying differential metabolites and predicting molecular subtypes of breast cancer. Oncotarget. 2016;7:9925–38.
Lécuyer L, Bala AV, Deschasaux M, Bouchemal N, Triba MN, Vasson MP, et al. NMR metabolomic signatures reveal predictive plasma metabolites associated with long-term risk of develo** breast cancer. Int J Epidemiol. 2018;47:484–94.
Hart CD, Vignoli A, Tenori L, Uy GL, van To T, Adebamowo C, et al. Serum metabolomic profiles identify ER-positive early breast cancer patients at increased risk of disease recurrence in a multicenter population. Clin Cancer Res. 2017;23:1422–31.
Buergel T, Steinfeldt J, Ruyoga G, Pietzner M, Bizzarri D, Vo**ovic D, et al. Metabolomic profiles predict individual multidisease outcomes. Nat Med. 2022;28:2309–20.
More TH, RoyChoudhury S, Christie J, Taunk K, Mane A, Santra MK, et al. Metabolomic alterations in invasive ductal carcinoma of breast: a comprehensive metabolomic study using tissue and serum samples. Oncotarget. 2018;9:2678–96.
Burns JS, Manda G. Metabolic pathways of the Warburg effect in health and disease: perspectives of choice, chain or chance. Int J Mol Sci. 2017;18:2755.
Liberti MV, Locasale JW. The Warburg effect: how does it benefit cancer cells? Trends Biochem Sci. 2016;41:211–8.
Martin LJ, Melnichouk O, Huszti E, Connelly PW, Greenberg CV, Minkin S, et al. Serum lipids, lipoproteins, and risk of breast cancer: a nested case-control study using multiple time points. J Natl Cancer Inst. 2015;107:djv032.
Fernandez-Garcia D, Hills A, Page K, Hastings RK, Toghill B, Goddard KS, et al. Plasma cell-free DNA (cfDNA) as a predictive and prognostic marker in patients with metastatic breast cancer. Breast Cancer Res. 2019;21:149.
Teo YV, Capri M, Morsiani C, Pizza G, Faria AMC, Franceschi C, et al. Cell-free DNA as a biomarker of aging. Aging Cell. 2019;18:e12890.
Lewis MR, Pearce JTM, Spagou K, Green M, Dona AC, Yuen AHY, et al. Development and application of ultra-performance liquid chromatography-TOF MS for precision large scale urinary metabolic phenoty**. Anal Chem. 2016;88:9004–13.
Izzi-Engbeaya C, Comninos AN, Clarke SA, Jomard A, Yang L, Jones S, et al. The effects of kisspeptin on β-cell function, serum metabolites and appetite in humans. Diabetes Obes Metab. 2018;20:2800–10.
Dona AC, Jiménez B, Schäfer H, Humpfer E, Spraul M, Lewis MR, et al. Precision high-throughput proton NMR spectroscopy of human urine, serum, and plasma for large-scale metabolic phenoty**. Anal Chem. 2014;86:9887–94.
Chong I-G, Jun C-H. Performance of some variable selection methods when multicollinearity is present. Chemom Intell Lab Syst. 2005;78:103–12.
Acknowledgements
This study was made possible with funding from Cancer Research-UK (C14315/A23464) and a grant from Action Against Cancer. This work was supported by the Medical Research Council and National Institute for Health Research [grant number MC_PC_12025] and infrastructure support was provided by the National Institute for Health Research (NIHR) Imperial Biomedical Research Centre (BRC). We thank the patients who provided samples. We are grateful to Anna Rita Boyden, Olivia Ogle and Alison Hills for their help with patient recruitment and sample collection for the study.
Author information
Authors and Affiliations
Contributions
All authors contributed to the collection, design and analysis of data. JS and JAS conceived the study design and conception. JS, PGT and JAS wrote the final manuscript and PGT performed/designed all statistical analyses. CJS, LM and MRL helped with metabolomic data analyses. KP, DG, DFG and LP performed the cfDNA work and descriptive statistics and KG helped with the patient cohort. All authors approved the final submitted manuscript.
Corresponding author
Ethics declarations
Competing interests
JS, DG, and JAS have had and/or have significant roles at Oncogene and played no role in the review process. JS other conflicts can be found at https://www.nature.com/onc/editors (none are relevant here). The rest of the authors have no conflict of interest to disclose.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Stebbing, J., Takis, P.G., Sands, C.J. et al. Comparison of phenomics and cfDNA in a large breast screening population: the Breast Screening and Monitoring Study (BSMS). Oncogene 42, 825–832 (2023). https://doi.org/10.1038/s41388-023-02591-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41388-023-02591-z
- Springer Nature Limited