Background

Incidence of esophageal adenocarcinoma (EAC) has risen markedly in many western countries. Most cases can be attributed to known risk factors, such as symptomatic gastroesophageal reflux (sGERD), central obesity, cigarette smoking and family history [1]. Nevertheless, the relative rarity of the cancer, combined with the cost and invasiveness of upper endoscopy for identifying early cancers and high-risk pre-cancers (e.g., Barrett’s esophagus (BE) with dysplasia or genomic abnormalities) [2, 3] make it challenging to define effective screening and surveillance strategies [4].

A key barrier has been identifying those most likely to benefit from endoscopy or newer non-endoscopic tissue sampling methods [5, 6]. Clinical practice guidelines vary by country and professional society regarding criteria for initial screening for BE or EAC, as well as the definition of BE [7, 8]. Furthermore, none consider the strong effect of age on EAC incidence except in defining a fixed age threshold, and all tend to treat the remaining risk factors as equally important. For example, 2016 American College of Gastroenterology (ACG) guidelines [7] suggest that screening may be considered among men with sGERD plus two or more other specified risk factors for BE or EAC. However, this qualitative approach excludes the approximately 47% of all EAC that present in persons without significant sGERD, who may be at increased risk due to other factors, and does not take advantage of known quantitative relationships (i.e., strength of association and dose-response) between EAC incidence and sGERD, smoking and obesity, for example [9].

To address this barrier, an online Interactive and Contextual Risk Calculator (IC-RISC™; https://ic-risc.esocan.org) was developed to take advantage of existing knowledge from observational studies and clinical trials to estimate more precisely an individual’s absolute risk of develo** EAC over a ten-year period, and to convey this estimate in the context of risk of dying from other cancers or from common causes such as injury, stroke or heart disease. Using this calculator, we compared the risk distribution between EAC cases and population-based controls from six studies in the Barrett’s and Esophageal Adenocarcinoma Consortium (BEACON; https://beacon.esocan.org) to contrast discrimination metrics of current guidelines vs. more stringent thresholds that might be used.

Implementation

Information required to calculate absolute risk of EAC, adjusting for competing mortality, includes i) incidence and mortality rates of EAC; ii) all-cause mortality rates; and iii) relative risks (RR), 95% confidence intervals (CI) and prevalence for each risk factor. These data are input from three spreadsheet files. Given the wide variation in risk of EAC by demographic factors, the population incidence and mortality rates are age-, sex- and race-specific, rather than being model-based. Similarly, since EAC incidence is substantially higher among persons with BE, and the constellation and strength of risk factors that predict EAC are quite different from the general population, input parameters are stratified by diagnosed BE status (Additional file 1: Figure S1 and Tables S1 and S2).

Fig. 1
figure 1

Risk calculator tab. In example a) A 60-year-old white male with moderate physical activity, non-use of NSAIDs or statins, no family history of BE or EAC, weekly – daily reflux symptoms, a body mass index (BMI) of 28 (“overweight”), who never smoked cigarettes, and has not been screened (BE status unknown) is estimated to have a 10-year risk of develo** EAC of 5.7 per 1000, or 1 in 175 people of similar characteristics. This is higher than the 10-year risk of dying from colon cancer and stroke for a 60-year-old white male, but lower than that from injury or heart disease. In example b) this same individual has undergone an upper endoscopy and found to have a visible Barrett’s segment length of 5 cm, but no evidence of dysplasia. His 10-year risk is now estimated to be 34.0 per 1000 (1 in 29 people) which is approximately equal to his risk of dying from heart disease

Incidence and mortality

The mean incidence and mortality rates by sex and race (white, black) in 5-year age groups (ages 40–84, and 85+) were obtained from the SEER database of 18 cancer registries for the years 2010–2015 [10]. Overall and cause-specific mortality rates by sex and race in 5-year age groups for the year 2014 were obtained from National Center for Health Statistics [11]. Estimated annual EAC incidence among persons with BE varies greatly depending on cohort definition and study design, but recent reports range from 0.19–0.41% among men and women [12,13,14]. In our calculations we used an incidence of 0.31% per year for white men (including those with dysplasia), reported from a large (n = 8, 929) population-based cohort of persons with BE [15]. We applied this figure to the 60–64 year age group of white males, a typical mean age of diagnosis [15,16,17]. Incidence was assumed to vary from this figure by a factor of 1.04 per year of age [18, 19]. EAC incidence among white women with BE was estimated to be 0.4 times the rate of white men [15]. There is little information on black males with BE; assuming that a portion of their approximately four-fold overall lower risk of EAC can be attributed to lower risk at the BE stage, we estimated their EAC incidence at 0.75 times that of white males. For ease of use in calculations, incidence and mortality rates were modeled using a third-degree polynomial (Additional file 1: Figure S2).

Risk factors

Risk factors for EAC were identified from the literature. In the general population, sGERD, obesity, cigarette smoking, and family history of BE or EAC were the major factors associated with increased risk (Additional file 1: Table S1) [20,21,22,23]. Conversely, physical activity, use of non-steroidal anti-inflammatory drugs (NSAIDs) and statin drugs have been associated with decreased risk in observational studies and randomized clinical trials [5, 24,25,26,27,28,29,30,31]. Estimates of strength of association (e.g., RR), 95% CI, and prevalence were taken in most instances from large meta-analyses of pooled individual data from population-based case-control studies (noted by bold type in Additional file 1: Tables S1 and S2).

In the setting of BE, clinical (Barrett’s segment length), histopathologic (high-grade and confirmed low-grade dysplasia) and molecular/genomic abnormalities were the dominant predictors of progression (Additional file 1: Table S2) [3, 12, 14, 16,17,18,19, 26, 32,33,34,35,36,37,38,39,40,41] with smoking and body mass index (BMI) playing more minor roles [18, 19, 33, 38]. In contrast, NSAIDs and statins exhibited inverse associations with EAC similar to the general population, based on a clinical trial and a cohort study for NSAIDs [29, 34] and a meta-analysis of 11 studies for statins [24]. Little is known regarding physical activity and family history and progression risk in BE, so these factors were omitted. Regarding sGERD, the RR used in the calculator for progression to EAC is estimated as one half of the trend coefficient for the general population. (For additional details see Additional file 1).

Estimation of absolute risk

Absolute risk of EAC over 10 years, adjusting for competing risk of death, was calculated according to methods described by Hsu, et al., and others [42,43,44]. First, the baseline hazard rate, defined as the hazard rate for individuals whose risk factors are at the lowest risk level, was calculated from the age-, sex- and race-specific EAC incidence rates and the population attributable risk calculated using the risk factors and prevalence from Additional file 1: Table S1. Second, the relative risk for an individual was calculated as the product of the relative risks attributed to each risk factor. Finally, the baseline hazard rate, relative risk, and risk of death from competing causes were combined over 10 years to yield adjusted absolute risk of EAC.

95% CIs were calculated from standard errors using the delta method, thus accounting for both direct effects of uncertainty in RR estimates, and effects on the population attributable fraction. IC-RISC™ is written in the R programming language [45] using the Shiny [1 illustrates inputs and 10-year risk for an individual who has not been screened (BE status unknown) (Fig. 1a), and one who has been diagnosed with BE (Fig. 1b) (See legend for details.) Mortality rates for selected causes of death are displayed in the center of the thermometer; these are specific for age, sex and race (black/white), but do not consider non-demographic risk factors.

The landing page is the “About” tab describing the purpose and background. Tabs are available to calculate BMI and categorize usual physical activity, which are not usually known. The “Risk factors” tab includes three options for displaying the input data used: a graph (Additional file 1: Figure S1, of the RR estimates and 95% CIs for each factor, stratified by the presence of diagnosed BE (unknown/negative vs. positive); and detailed tables (Additional file 1: Tables S1 and S2, containing all the RR data with supporting references. Reliable risk factor data specific for black females and other demographic groups of either sex are not currently available. The “More information” tab contains an option for viewing the age-, race- and sex-specific incidence and mortality rates used in the calculator (stratified by BE status) and all-cause mortality (Additional file 1: Figure S2) Additional menu options include contacts, version history, and licensing information.

Figure 2 illustrates the distribution of 10-year risk estimates for EAC cases and controls using data from population-based BEACON studies, along with estimates of specificity, sensitivity and positive predictive value (ppv) according to increasingly more stringent thresholds for further action. The area under the associated ROC curve (not shown) is 0.81 (95% CI = 0.79–0.83.) When applied to the general population, as in Fig. 2, the calculator provides insights into the performance of current practice guidelines (two examples of which are represented by vertical solid lines, see legend) [7], which sacrifice a significant amount of specificity in order to capture 79–86% of eventual cases. A small increase in action threshold, for example to 2 per 1000, would potentially eliminate almost half of procedures or tests (reducing 40% false positives to 20%), while only reducing sensitivity to 66%; whereas an increase to 4 per 1000, would further reduce false positives to 12%, with a sensitivity of 49%. Unfortunately, given the rarity of EAC, only with higher thresholds of 8 per 100,000 or more does the ppv surpass 1%.

It is notable that absolute risks of greater than 2.0 are readily reached by persons not included in current guidelines. For example, a 65-year-old white female ex-smoker with frequent (“> daily”) reflux symptoms, no family history, and a BMI of 31 has an estimated risk of 4.4 per 1000. Similarly, a 60-year-old non-smoking white male without reflux symptoms (“rarely”), a BMI of 31 (obese category I) and one first degree relative with BE/EAC has an estimated risk of 4.8 per 1000. Finally, a 65-year-old ex-smoking black male with occasional reflux symptoms (“< weekly”), no family history, and a BMI of 36 (obese category II) has an estimated risk of 2.1 per 1000.

Given that almost half of EAC cases in the general population do not report significant symptoms of reflux [9], we investigated the performance of the risk calculator in the subset of the cases (46%) and controls (70%) from the BEACON studies who reported sGERD only occasionally (“< weekly”) or rarely (Additional file 1: Figure S3) The overall discriminatory ability in this subset was similar to that in the entire dataset, though slightly more modest, with an area under the ROC curve of 0.78 (95% CI = 0.75–0.81.) While a specificity of 95% was reached at a lower risk threshold than in the entire dataset (about 4 vs. 8 per 1000 over ten years), the sensitivity at the lower threshold in this low-sGERD subset was only half (25%) that of the entire dataset (50%), and the ppv did not rise above 0.5%.

Discussion

This educational tool was developed to facilitate shared decision-making between a health provider and patient regarding: i) how personal risk of EAC fits into the “bigger picture” of health and disease, ii) whether preventive actions are indicated to possibly reduce risk of EAC and other conditions, and iii) whether additional tests and procedures might be warranted to identify and manage those with higher risk profiles [54].

A strength of IC-RISC™ is the robust data on which it is based. Incidence rates of EAC were based on the SEER 18 registries, which cover more than a quarter of the US population, and RR and prevalence estimates for the general population calculations came from large individual pooling efforts (including between 900 and 1500 cases). This contrasts with previous efforts in the general population which have both estimated associations and created risk models using relatively small individual studies, often with varying availability and comparability of potential predictors [42, 61].

Conclusions

Decisions regarding EAC prevention faced by patients and providers are plentiful. For example, they might involve preventative actions centered on lifestyle (e.g., weight loss, exercise, dietary change), chemopreventative actions (e.g., aspirin), non-invasive screening, endoscopic screening, surveillance, and treatment of high-risk Barrett’s. Informed judgment plays an important role in most of these decisions since compelling evidence does not currently exist to support hard decision thresholds. If decision thresholds did exist, they would have to account for vastly different ratios of risk to likely benefit for each option, and large inter-individual differences (in the general population as well as providers) in risk tolerance and willingness to undergo medical interventions, among many other factors. In this context, IC-RISC™ provides a simple and rapid means to graphically communicate risk of EAC in the context of other health risks, allows “what-if” scenarios regarding potential preventative actions, and can directly inform discussions regarding screening, surveillance and treatment options. It extends current qualitative clinical practice guidelines for endoscopic screening into a quantitative assessment, which brings the possibility of screening being offered to higher-risk persons not currently targeted (e.g., males without sGERD and females), while also potentially identifying lower-risk persons who might otherwise be targeted for screening as unlikely to benefit. It also may be useful in defining higher-risk persons for intervention trials. The calculator would benefit from prospective testing and application to existing well-characterized cohort studies to refine the estimates and facilitate harmonizing existing risk calculators so that a uniform message can be developed for clinical use. Until such time, IC-RISC™ can be used to inform joint decision-making rather than to indicate specific preventative actions or thresholds.

Availability and requirements

Project name: IC-RISC.

Project home page: Software - https://github.com/FredHutch/IC-RISC-Working

Application – https://ic-risc.esocan.org

Operating system: Platform independent.

Programming language: R/Shiny.

License: Academic use - 2-Clause BSD; Non-academic - License needed.