Introduction

The incidence of the digestive system neoplasm is rising yearly worldwide. These tumors including colorectal cancer, stomach cancer, liver cancer, etc. have been associated high risk of morbidity and mortality based on the Global Cancer Statistics 20181. Thus, early detection and timely treatment are integral for a favorable prognosis and long-term survival. At present, tumor markers have been widely used for tumor screening and diagnosis, monitoring treatment response, and surveillance of tumor recurrence after treatment. In clinical practice, markers commonly used to screen and diagnose tumors of the digestive system are mostly derived from blood, including carbohydrate antigen (CA50, 199, 242, 724), alpha-fetoprotein (AFP), carcinoembryonic antigen (CEA), pepsinogen, etc. However, the sensitivity and specificity of a single tumor marker are suboptimal, and it is often necessary to combine several tumor markers for the screening and early diagnosis of tumors2,3,4,5. Of course, there are also circulating tumor DNA and methylated genes from blood, urine or feces that are still being evaluated. Although these have shown good performance in the diagnosis of tumors of the digestive system, they have not been widely used in clinical practice due to immaturity in technology and high cost. Nevertheless, endoscopic examination with biopsy remains the gold standard for the diagnosis of gastric, esophageal, and colorectal cancers, while endoscopic ultrasonography and other imaging examinations can be used for the diagnosis of liver and pancreatic cancer. However, endoscopic screening and diagnostic approaches have limitations, including patient discomfort and potential major complications and harms. Therefore, an effective noninvasive screening tool for malignancies of the digestive system malignancies is desired.

Breath analysis is recognized as a simple and non-invasive method for screening and monitoring pathology or disease. Nevertheless, studies have shown that volatile organic compounds (VOCs) from urine also demonstrated good performance in the screening of cancer, in particular, special phenotypes of VOCs in urine could be used for the screening of prostate and gastrointestinal cancers6. Contrary to the conventional biomarker studies subjected to the health status of the individuals, the analysis of VOCs reflects the fingerprint characteristics of individuals that represent an instance of individualized medicine7. Moreover, a variety of sensor technologies are now being applied in analyzing the patterns of VOCs, such as eNoses and nanomaterials. As technology advances and clinical research progresses, VOCs analysis does not only enable early detection of cancer, but also the monitoring of the response to cancer therapy and detection of disease recurrence early when secondary treatments are most effective8.

VOCs are organic compounds with relatively low molecular weight and high vapor pressure. Cancer-originated VOCs have been frequently detected in feces, urine, blood, skin, sweat, and gases exhaled from cancer patients. These have been produced by tumor cells, which can reflect the disease9,10,11. Therefore, it is vital to explore the differences in VOCs released by varying cancer types to identify specific representative VOCs to each tumor that can be used as a diagnostic tool. For instance, the study by Kumar et al.12 has shown significantly lower concentrations of several specific volatile compounds in non-tumor individuals than those with tumors.

VOCs reflect changes in pathology and metabolic processes13,14. These specific VOCs have been considered to be the results of an imbalance between the systemic manifestations of oxidative stress, metabolic abnormalities, or reactive oxygen and the ability of the biological system to detoxify or repair damages15,16,17. In particular, the gut microbiome plays a key role in diseases of the digestive system, including colorectal cancer and inflammatory bowel disease. Changes in the composition of gut bacterial flora affect the fermentation products and the forms of volatile organic compounds (VOCs). Given that different diseases are associated with specific metabolomics that reflects cell metabolism, providing a new way of thinking for the diagnosis of diseases. In fact, in inflammatory bowel disease, changes in the gut microbiome are associated with colonic inflammation and can influence VOCs production. Also, some bacterial communities have been implicated in the development of colorectal cancer18,19,20,21. At the cellular level, changes in the production of VOCs have been associated with oxidation of polyunsaturated acids in the cell membrane as a result of genetic and/or protein mutations within tumor cells and the increased relative incidence of reactive oxygen species within cancer cells22,23. These cancer-related volatile organic compounds are released from the affected tissues into the feces or bloodstream and eventually excreted either through breath or through urine or feces14. Therefore, VOCs, as a comparatively new and non-invasive biomarker, provides a promising and attractive option for the screening of tumors in the digestive system.

Several studies have shown that VOCs analysis is valuable in the diagnosis of tumors of the digestive system24,25,26. However, these studies have some shortcomings, with some only analyzing VOCs in exhaled gas, while others had too a small sample size. In our meta-analysis, VOCs from all sources and tumors of the digestive tract were summarized and analyzed. This would ascertain the diagnostic efficacy of VOCs for tumors of the digestive system.

Materials and methods

Search strategy

A comprehensive, systematic electronic literature search of PubMed and Web of Science was conducted to identify all relevant papers from inception to 7th December 2020. Search terms were as follows: volatile organic compounds or VOCs and esophageal neoplasm or esophageal cancer or gastric cancer or gastric neoplasm or liver cancer or liver neoplasm or hepatic neoplasm or pancreas cancer or pancreas neoplasm or colorectal cancer or colorectal neoplasm. All relevant literature was also screened for other possible studies.

Study selections

All articles were reviewed by two independent investigators according to the inclusion and exclusion criteria. The inclusion criteria were: (1) VOCs were used to detect patients with neoplasm of the digestive system; (2) the control group consisted of patients with benign diseases of the digestive system and/or healthy individuals; (3) patients did not undergo any therapy; (4) extracted data could be used to measure true-positive (tp), false-positive (fp), false-negative (fn), and true-negative (tn) values. Studies/articles were excluded for the following criteria: (1) studies not within the field of our research; (2) non-human or animal study; (3) non-English literature; (4) non-diagnostic study; (5) studies on the neoplasm of not the digestive system; (6) data from the study were insufficient to constitute 2 × 2 tables; (7) review articles, editorials, case reports, conference proceedings, or letters.

Data extraction

For each study included in our meta-analysis, the following data were extracted: first author, country, year of publication, number of participants, mean age of participants, percentage of males, cancer type, the source of VOCs, analytical platforms, and the control group. Also, each investigator registered and counted the numbers of tp, fp, tn, and fn. (Based on the criteria of diagnostic tests, we used the pathological results as the gold standard and then classified the results as true positive (tp), false positive (fp), true negative (tn) and false negative (fn) according to the degree of matching between the results obtained by pathology and VOCs analysis. Sensitivity was calculated as tp/(tp + fn), while specificity was calculated as tn/(tn + fp). Correspondingly, four basic data could be extracted from the known sensitivity and specificity. Any disagreements between the investigators were assessed by a third reviewer and resolved by mutual agreement.

Quality assessment

To assess the quality of the included studies, the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool27 was used, which was an evidence-based quality assessment tool for systematic reviews of diagnostic accuracy studies that included four domains: patient selection, index test, reference standard, and flow and timing.

Statistical analysis

Heterogeneities between the studies were evaluated by measuring the P-value of the Cochran-Q test and I-square statistic28. Studies were considered homogenous if P > 0.05 and I2 ≤ 50%, and we performed only pooled sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), and diagnostic odds ratio (DOR). If P < 0.05 or I2 ≥ 50%, indicating statistically significant heterogeneity between the studies, further meta-regression and subgroup analysis were carried out to determine the possible sources of heterogeneity.

Sensitivity, specificity, accuracy, positive and negative predictive values, PLR, NLR, and DOR of VOCs in the diagnosis of digestive system neoplasm were obtained from the individual study, and forest plots were used to calculate and graphically display pooling of the data. The SROC curve was displayed to obtain the optimal diagnostic efficiency for VOCs, and the area under the summary ROC curve (AUC) was calculated. The publication bias was presented by using the Deeks’ funnel plot asymmetry test. Meta-analysis was performed using STATA15.0 (StataCorp, College Station, TX, USA) with the MIDAS module, while RevMan 5.4 (Revman, the Cochrane Collaboration) was used to evaluate the quality of the included studies. A P-value of < 0.05 was considered statistically significant.

Results

Literature search

The initial search of PubMed and Web of Science databases yielded 589 articles. Of these, 168 were excluded due to repetitive publications, and 350 articles were excluded based on the inclusion/exclusion criteria after reading the titles and abstracts: 263 did not associate VOCs as tools for the diagnosis of neoplasm of the digestive system; 46 were review articles; 28 were conference abstracts; 2 were letters; 10 were not based on human studies; 1 was non-English literature. A total of 35 articles were excluded after reading the full text: 14 were not of diagnostic research; 5 were research of non-digestive tract neoplasm; 16 articles contained insufficient data to form 2 × 2 tables. Finally, a total of 36 studies were included in our meta-analysis (Fig. 1).

Figure 1
figure 1

Flow diagram of the study selection process.

Study characteristics

The 3612,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63 included studies comprised of a total of 39 datasets that investigated 5 cancer types: 3 on liver cancer, 16 on colorectal cancer, 10 on gastric cancer, 4 on esophageal cancer, and 6 on pancreatic cancer. Overall, our meta-analysis was performed based on 1712 cases and 3215 controls from 11 different countries. As for the origin of the VOCs measurements, 4 datasets measured VOCs patterns in feces, 22 (from 21 studies) in exhaled breaths, 8 (from 7 studies) in urine, 1 in blood, 2 (from 1 study) in saliva, 1 in bowel gas, and 1 in bile.

There were 2 datasets that used the electronic nose as the analytical platform, 16 datasets (from 15 studies) used gas chromatography coupled with mass spectrometry (GC–MS), 5 datasets used field asymmetric ion mobility spectrometer (FAIMS), 8 datasets (from 7 studies) used selected ion flow tube mass spectrometer (SIFT-MS), and 8 studies used other analytical platforms: 2 datasets used silicon nanowire field-effect transistor (SINW-FET), 1 dataset used proton transfer reaction mass spectrometer (PTR-MS), 1 dataset used ion–molecule reaction mass spectrometry (IMR-MS), 1 dataset used single-photon ionization mass spectrometry (SPI-MS), 1 dataset used field asymmetric ion mobility spectrometer–mass spectrometry (FAIMS–MS) and 2 datasets used sensor (Table 1).

Table 1 Major characteristics of included studies.

Risk of bias and quality assessment

There was no apparent publication bias between the included studies, given that the funnel chart was well-proportioned with a P-value of greater than 0.39 (see Supplemental Fig. S1). The analysis using the QUADAS-2 tool showed a low risk of bias and moderate to a high quality of the included studies (see Supplemental Fig. S2).

Diagnostic effect

Diagnostic accuracy was estimated using several indicators, including sensitivity, specificity, PLR, NLR, and DOR. The pooled sensitivity and specificity of VOCs for diagnosing neoplasm of the digestive system were 0.87 (95% CI 0.83–0.90) and 0.86 (95% CI 0.82–0.89), respectively (Fig. 2), while the pooled PLR and NLR were 6.18 (95% CI 4.68–8.17) and 0.15 (95% CI 0.12–0.20), respectively (Fig. 3). Furthermore, the pooled DOR (see Supplemental Fig. S3) and AUC (Fig. 4) were 40.61 (95% CI 24.77–66.57) and 0.93 (95% CI 0.90–0.95), respectively.

Figure 2
figure 2

Forest plots of pooled sensitivity and specificity.

Figure 3
figure 3

Forest plots of pooled positive likelihood radio and negative likelihood ratio.

Figure 4
figure 4

SROC curve of VOCs for the diagnosis of digestive system cancer. Abbreviations: The numbers in the circles represent the studies included in the paper. The eighth study corresponds to reference12, and the remaining studies1,2,3,4,5,6,7,8,9,10,11,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36 correspond to reference27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61. VOCs: Volatile organic compounds.

Heterogeneity

Heterogeneity existed in the pooled specificity (I2 = 71.04%, P = 0.00), as well as the pooled results of sensitivity (I2 = 87.23%, P = 0.00) (Fig. 2).

Meta-regression and subgroup analysis

Univariate meta-regression and subgroup analyses were performed to investigate potential sources of heterogeneity between studies. The forest plots (see Supplemental Fig. S4) of univariate meta-regression indicated that race, sample size, analytical platforms (MS, ISM, Sensor), VOCs source (exhaled breath, feces, urine, others), and the control groups (healthy individuals and non-cancer/benign disease cohort) could be the sources of the heterogeneity. Factors including race, analytical platform, and VOCs source were included in the subgroup analysis (Table 2). Both the sensitivity and specificity in Asian (0.88, 95% CI 0.82–0.92; and 0.91, 95% CI 0.81–0.96, respectively) were higher than that in European and American (0.86, 95% CI 0.82–0.90; and 0.84, 95% CI 0.80–0.88, respectively). With regards to the source of VOCs, the sensitivity and specificity (0.87, 95% CI 0.82–0.91; and 0.87, 95% CI 0.83–0.91, respectively) were the highest when VOCs were derived from exhaled breaths in the screening for tumors of the digestive system tumors. For the analytical platform, the sensitivity and specificity (0.89, 95% CI 0.85–0.92; and 0.88, 95% CI 0.84–0.91, respectively) were the highest when VOCs were analyzed with MS.

Table 2 Subgroup analysis of diagnostic effect.

Discussion

VOCs as biomarkers have been explored in recent years and are regarded as a new frontier in cancer diagnosis. It has great potentials in develo** into a rapid, noninvasive, and inexpensive cancer diagnostic tool64,65. A recent meta-analysis has demonstrated a sensitivity of 0.79 and specificity of 0.89 in diagnosing cancer using VOCs analysis24. Also, in diagnosing colorectal cancer, another meta-analysis has revealed the sensitivity and specificity of VOCs being 0.82 and 0.79, respectively26. Compared with the study by Zhou et al.26 that included studies of a single tumor, our meta-analysis included clinical studies of five different tumor types of the digestive system. Also, contrary to the study by Hanna et al.24, the sources of VOCs in our study were not limited to exhaled gases only but also included feces, blood, and urine. Furthermore, the number of studies included in our meta-analysis was also higher.

From the retrieved data of the included studies, we performed calculations on the sensitivity, specificity, PLR, NLR, and DOR to estimate the diagnostic accuracy of VOCs as a screening tool for tumors of the digestive system. From our analysis, the pooled sensitivity and specificity were 0.87 and 0.86, respectively. Moreover, the overall diagnostic performance was assessed using the SROC curve, which revealed an AUC of 0.93, indicating excellent diagnostic performance. The DOR represented a single measure of test accuracy, which in our analysis, the DOR was 40.61 (DOR > 10), indicating great discriminatory test capability. Furthermore, the likelihood ratio and post-test probability indicated the risk of tumors of the digestive system when tested positive or negative. Our analysis revealed a PLR of 6, indicating that patients with digestive system tumors were six times more likely to test positive than healthy individuals. Coupled with the NLR of 0.15, VOCs analysis represented a promising method to diagnose the neoplasm of the digestive system.

To identify potential sources of heterogeneity, univariate meta-regression and meta-analysis were performed, which revealed that factors including race, sample size, source of VOCs, analytical platforms, and types of the control group might be sources of heterogeneity in our meta-analysis. Further analysis demonstrated higher specificity for VOCs in the screening of tumors in the digestive system tumors of the Asians than the others (Europeans and Americans), suggesting that the performance of VOCs in cancer screening may vary according to race or ethnicity. Also, we found that analysis of VOCs from the exhaled breath performed better than other sources of VOCs. However, the sample size of studies investigating other sources of VOCs was smaller compared with those of VOCs from exhaled breath. For the analytical platform, the use of MS (mass spectrometry) to analyze VOCs had the highest sensitivity and specificity compared with other platforms. However, the sample size of studies utilizing MS was also much larger than that of other analytical platforms. In our study, the performance of VOCs as a screening tool for different tumors was not analyzed, given that our objective was to explore VOCs analysis as a rapid and non-invasive screening tool for tumors of the digestive system, with its applicability as a primary population screening rather than as a diagnostic tool for individual tumor types.

There were several reasons why this meta-analysis did not conduct threshold effect analysis: (1) different studies used different instruments for analyzing organic matter; (2) different instruments detected different types of organic matter; (3) different studies had different ways of setting the threshold value of organic compounds. Therefore, it was difficult to analyze the diagnostic ability of specific organic compounds for tumors. However, this did not affect the results that VOCs have good diagnostic ability in the digestive system.

Compared with other tumor markers currently applied in clinical practice, VOCs have demonstrated superior diagnostic ability in screening for gastric cancer and colorectal cancer. Serum pepsinogen is a widely used biomarker with good diagnostic efficacy for gastric cancer screening, with a meta-analysis demonstrating a sensitivity of 0.69 and a specificity of 0.73, while the sensitivity and specificity could be improved further to 0.70 and 0.79 respectively when applying the ratio of PG I to PG II concentrations66. Besides, the combination of CA199 with CA242 has demonstrated a sensitivity of 0.90 and a specificity of 0.7667. From our analyses, the sensitivity (0.90) and specificity (0.91) of VOCs for gastric cancer, and the sensitivity (0.84) and specificity (0.82) of VOCs for colorectal cancer exhibited superior performance to conventional biomarkers in screening for neoplasm.

There were limitations to our study. The included studies were mostly of case–control and cross-sectional nature, with few prospective longitudinal studies to associate VOCs with disease severity. Second, only articles in the English language had been selected, which invariably resulted in selection bias. Third, the cancer types, and analytical platforms of tumor-associated VOCs also varied between the studies. Therefore, we did not analyze the performance of varying VOCs in screening for different tumors of the digestive system.

Therefore, further studies are warranted to not only innovate and combine advanced technologies in the sampling, detection, and analysis of VOCs but also to standardize these methods. Moreover, it is vital to explore the differences of VOCs between varying types and stages of tumors. Clinical trials would then be required for the validation of results before widespread clinical application.

Conclusion

The pooled results of our meta-analysis have confirmed the difference in the VOCs analysis between patients with tumors of the digestive system and healthy individuals, which sheds light on VOCs analysis as a promising novel screening tool for early detection of tumors.