Introduction

Advanced gastric cancers (AGC) display several chromosomal and genetic abnormalities. Notably, up to 37 % of AGCs show amplification of the genes related to receptor tyrosine kinase/Ras signaling pathways [1, 2]; this has been corroborated by the recently proposed molecular classification by The Cancer Genome Atlas [3]. One major challenge is therefore to implement strategies that select patients according to the intrinsic molecular characteristics of their cancer so that targeted agents can be optimized to improve overall survival (OS).

In 1986, some gastric cancers were found to exhibit amplified human epidermal growth factor receptor 2 (HER2) [4]. In the following decade, it was estimated that between 9 and 36 % of all tumors overexpress or amplify HER2 depending on location and histopathological subtype [57]. Initial clinical trials indicated that trastuzumab, an anti-HER2-targeted agent, possesses considerable activity against HER2-positive AGC [8, 9].

The ToGa phase III trial evaluated HER2 amplification using fluorescence in situ hybridization (FISH) and HER2 overexpression by immunohistochemistry (IHC) in 3807 patients with AGC or adenocarcinoma of the gastroesophageal junction. The results established a clear benefit of trastuzumab in terms of OS [hazard ratio (HR) 0.74, P = 0.0046]. The subgroup with HER2 expression of IHC3+ or IHC2+/FISH+ obtained the greatest survival benefit (16 vs. 11.8 months) [10]. Trastuzumab not only improves OS but also helps to preserve patient quality of life [11]. Thus, the introduction of trastuzumab into the frontline treatment of HER2-positive AGC ushered in the age of targeted therapy and personalized medicine for this pathology.

Based on these data, the European Medicines Agency approved trastuzumab in combination with chemotherapy to treat HER2-positive AGC in 2010. Since then, HER2 testing has become a standard practice. However, research has paid less attention to the process of implementing HER2 determination across centers and the main barriers to its use. Novel medical technologies such as HER2 testing are gradually implemented after a certain period of time, and the speed at which this process is completed (e.g., the time taken to change the standard of care) may impact clinical outcomes in a complex manner. As such, the introduction of systematic HER2 testing could be an attractive model to exemplify the effect of predictive biomarkers for response to targeted treatments in the community. Therefore, the aim of the study reported in the present paper was to evaluate how changes in the proportions of patients with AGC who were not tested for HER2 at institutions influenced the OS of those patients during the period from 2008 to 2016.

Patients and methods

Patients

This Spanish National Cancer Registry includes cases who were consecutively recruited at 28 tertiary centers from January 2008 to January 2016. The basic eligibility criteria were that the patients were adults (≥18 years) with histologically confirmed unresectable or metastatic adenocarcinoma of the distal esophagus, gastroesophageal junction, and stomach. The patients had to have received at least one cycle of polychemotherapy (two or more drugs) in the first-line treatment, and to have been followed up for at least 3 months with adequate clinical information; an objective tumor evaluation based on computed tomography was required, and laboratory data for this period had to be available. Subjects who died early are exempted from these requirements. Patients were excluded if the information needed to complete the database collection could not be obtained, or if they were still alive at the time of collection but did not give their written consent or exercised their right to rectify and revoke. The ethics review board at each of the participating institutions approved the study, and the procedures were in accordance with the Helsinki Declaration.

Study design

The Spanish National Cancer Registry is a non-interventionist cancer registry. HER2 testing and trastuzumab or chemotherapy regimens were implemented or not, depending on each center’s clinical practice. Data were collected through the centralized web platform: http://www.agamenonstudy.com/. This software tool has filters to ensure data consistency. In addition, an independent researcher monitored data accuracy remotely, validating information and locking data for cases when they were completed.

Variables

HER2 overexpression/amplification was tested according to clinical practice protocols at each center by means of IHC or FISH [12]. All patients with biopsy scores of IHC 3+ were eligible for trastuzumab, as well as cases with biopsy scores of IHC 2+ and a positive FISH amplification result. When the HER2 status was not determined according to the usual practice, researchers were not requested to obtain a retrospective assessment. Hospitals were classified according to the proportion of patients who were not tested for HER2, and divided by quartiles into four groups of approximately the same size, representing universal, high, medium, and low HER2 testing. Reasons for not ordering HER2 testing were recorded. The primary outcome measure was OS, calculated from the beginning of the first cycle of chemotherapy.

Statistics

Survival curves were estimated using the Kaplan–Meier method. The Wilcoxon test was applied to compare the survival functions. The effect of trastuzumab was assessed using stratified Cox proportional hazards (PH) regression. When building this model, we analyzed 13 covariates using univariate statistical method screening; variables univariately predictive of survival (P < 0.1) were entered into the Cox PH regression. We excluded the covariate HER2 status to avoid multicollinearity with trastuzumab therapy. This analysis was stratified by year of diagnosis and controlled for other confounding factors. The PH assumption was determined using Schoenfeld residuals. Sensitivity analyses were performed according to the reason for not ordering HER2 testing. All P values are two-tailed. The STATA statistical package v14 (StataCorp, College Station, TX, USA) was used throughout.

Results

HER2 determination and use of trastuzumab

The registry contained 1615 patients, 1170 of whom met the eligibility criteria (Fig. 1). Patient baseline characteristics are shown in Table 1. Overall, HER2 was not tested in 22.9 % of the patients in the registry (N = 268). The rate of HER2 testing increased over time, from 58.3 % in 2008 to 92.9 % in 2016 (Fig. 2). Alleged reasons for not determining HER2 in this registry were as follows: HER2 testing (or the use of trastuzumab) was not formalized as standard for AGC in this institution in 74.3 %, date of first-line therapy prior to 2010 in 8.3 %, unknown in 6.4 %, HER2 status tested for second-line therapy in 3.7 %, insufficient tissue sampling in 3.7 %, chemotherapy initiation deemed urgent, but no subsequent HER2 assessment due to deterioration or premature death in 3.3 %, and subjects were not candidates to receive trastuzumab therapy (e.g., elderly or severe heart disease) in 0.3 %.

Fig. 1
figure 1

Flowchart of the study. Asterisk: note that the categories were not mutually exclusive

Table 1 Baseline characteristics (n = 1170)
Fig. 2
figure 2

Patients tested for HER2 and treated with trastuzumab each year. Percentages of HER2-positive patients (IHC 3+ or IHC 2+ and FISH+) and patients treated with trastuzumab with respect to the total number of patients tested for HER2 are shown

Of the 908 tested tumors, HER2 was positive in 194 of them (21.3 %) (IHC 3+ in 123, IHC 2+/FISH+ in 71). Trastuzumab was administered combined with first-line chemotherapy to 165 of these individuals. Two of these tumors, initially considered HER2-positive and treated with trastuzumab, were retested due to early progression, and finally both were classified as HER2-negative. In this series, trastuzumab associated with first-line chemotherapy was first administered in 2008. Since then, the percentage of patients with HER2-positive tumors who have received first-line trastuzumab has increased year after year (Fig. 2). The turning point occurred in 2011 when 90 % of subjects with HER2-positive tumors received trastuzumab in combination with the first-line treatment.

HER2-positive patients treated with trastuzumab exhibited better prognosis than HER2-negative patients. Median OS was 14.6 months (CI 95 %, 11.1–18.1) in trastuzumab-treated patients as opposed to 9.7 months (CI 95 %, 9.1–10.3) in HER2-negative patients, P < 0.001. In contrast, 31 patients with HER2-positive AGC did not receive trastuzumab with first-line chemotherapy (see Table S1 in the Electronic supplementary material, ESM). Reasons for not administering first-line trastuzumab in these 31 HER2-positive cases were as follows: initiation of first-line therapy before 2010 with HER2 being tested for successive lines (N = 11); contraindication due to chronic heart disease (N = 5); still not considered standard by the treating physician (N = 7); and trastuzumab administration with the second-line treatment (N = 8). Despite the small subgroup (N = 31), the median OS in HER2-positive patients who did not receive first-line trastuzumab was 14.9 months (CI 95 %, 7.7–22.1). Remarkably, 19 of these 31 patients received trastuzumab with the second- or third-line therapy.

Impact of HER2 testing on outcomes

In the univariate analysis, a positive effect on OS was observed for the IHC 2+/FISH+ subgroup, with a stratified HR of 0.71 (CI 95 %, 0.53–0.97), P = 0.032, as well as for patients with IHC 3+ status, with a stratified HR of 0.57 (CI 95 %, 0.45–0.73), P < 0.0001. In contrast, patients in whom the HER2 testing was not performed had a worse OS compared to the rest of the series: they had a stratified HR of 1.226 (CI 95 %, 1.035–1.451), P = 0.018, considering a HER2-negative status as the reference (see Table 2). However, this effect disappeared in the sensitivity analyses when subjects who died within 30 days of chemotherapy or for whom trastuzumab was contraindicated (e.g., the elderly or those with severe heart disease) were excluded.

Table 2 Overall survival by subgroups (univariate analyses)

Patients treated in the 4-year period 2012–2015 had a longer median OS, 10.4 months (CI 95 %, 9.64–11.24), as compared to patients treated during 2008–2011, with a median OS of 9.2 months (95 % CI, 8.2–10.2), P = 0.021. Otherwise, patients treated at institutions with increasing proportions of tumors that were not tested for HER2 showed a significant increase in risk of death, with a stratified HR of 1.005 (CI 95 %, 1.003–1.007), P < 0.0001. This effect remained significant after excluding either those subjects with trastuzumab contraindication, subjects with early mortality (median OS < 30 days), or those who were treated before 2011.

Table S2 in the ESM displays the proportion of the patients who were not tested for HER2 at each institution (encoded). This shows that the rate of non-testing at institutions was not homogeneous. The centers were therefore classified by quartiles based on the proportion of patients with missing HER2 data, which divided the set of measurements into four approximately equal parts (Q 1Q 4, see Table 3).

Table 3 Median overall survival (OS) and hazard ratio (HR) based on missing HER2 data quartiles

Kaplan–Meier curves and survival times for Q 1Q 4 are presented in Fig. 3 and Table 3. The median OS progressively decreased for quartiles Q 1 to Q 4: 10.67 months (95 % CI, 9.62–11.72) for Q 1, 10.28 months (95 % CI, 8.91–11.65) for Q 2, 9.98 months (95 % CI, 8.43–11.54) for Q 3, and 8.57 months (95 % CI, 7.44–10.67) for Q 4. The differences in median OS between the fourth quartile (Q 4, missing HER2 data >30 %) and the other quartiles (Q 1Q 3) were significant. Pairwise comparisons of median OS were as follows: Q 1 vs. Q 2, P = 0.642; Q 1 vs. Q 3, P = 0.381; Q 1 vs. Q 4, P = 0.001; Q 2 vs. Q 3, P = 0.190; Q 2 vs. Q 4, P = 0.001; and Q 3 vs. Q 4, P = 0.034. Differences among institutions remained significant when patients treated before 2011 were excluded.

Fig. 3
figure 3

Kaplan–Meier curves for OS by percentile. Centers are classified by quartile, based on the proportion of patients with a missing HER2 determination: Q 1 = 0 %, Q 2 = 1–3 %, Q 3 = 4–30 %, and Q 4 > 30 %. Median OS and HR based on missing HER2 data quartiles are presented in Table 3

In the stratified Cox PH regression, the proportion of the patients who were not tested for HER2 at the institutions was significantly associated with adverse outcomes (Table 4). Each 1 % increase in the patients with HER2 missing data at each hospital induced a significant approximately 0.3 % increase in the risk of death during this period of observation: HR, 1.0035 (CI 95 %, 1.0014–1.0057), P = 0.0019, with the latter being specifically stratified by year of diagnosis. For example, a hypothetical institution that tested HER2 in only 65 % of AGC patients would have an approximately 10 % increase in the risk of death in comparison with a center in which every patient is tested.

Table 4 Stratified Cox proportional hazards model for overall survival (OS)

A secondary consequence of HER2 testing in this series is the change in the pattern of chemotherapy use, with a gradual reduction in the use of triplet chemotherapy in parallel with increased HER2 testing. Thus, chemotherapy with three cytotoxic agents was used in 13 % of the HER2-positive (IHC +3) patients and for 4.2 % of the HER2-positive (IHC +2/FISH+) tumors, whereas 40 % of the HER2-negative patients and those whose HER2 status was undetermined were treated with triple chemotherapy (P < 0.0001).

Discussion

Gastric cancer is a common malignancy associated with poor prognosis. It is therefore imperative that new, more effective treatment strategies should be developed and quickly incorporated into daily clinical practice. An analysis of genomic aberrations has shown that signaling pathways for genes such as HER2, vascular endothelial growth factor (VEGF), phosphatidylinositol-4,5-bisphosphate 3-kinase/AKT/mammalian target of rapamycin (PI3K/AKT/mTOR), and hepatocyte growth factor and its receptor MET (HGF/MET) are often altered in gastric cancer [2]. Inhibiting agents that target these molecular anomalies are a very promising therapeutic option [14].

According to the ToGA study data, trastuzumab combined with chemotherapy unequivocally increases survival in HER2-positive AGC [10]. However, the literature is less clear about how these advances in knowledge trickle down into clinical practice in terms of the implementation of HER2 testing, how to overcome the barriers that impede the use of this new drug, and how this will ultimately serve to bring the benefits observed in the trial into daily clinical practice.

In 2015, we began a clinical practice registry covering 28 centers to detect areas for improvement in the management of AGC. The registry suggests that HER2 was relatively rapidly introduced as up to 11 centers began HER testing in the biennium 2008–2009. Consequently, 63 % of the cancers were tested in that time period. Because the registry includes relatively few cases from these years and brings together the main Spanish reference centers but not smaller institutions, it is difficult to know whether this truly represents clinical practice in the country during 2008–2009. Nonetheless, this result is striking, as little information had been published about the effectiveness of trastuzumab at that time [15, 16]. In addition, trastuzumab had not been approved during that period; hence, in most of these cases, it was not used as first-line therapy.

The ToGA study was published in August 2010 [10], and had a strong impact on clinical management. A few months following ToGA study publication [10], trastuzumab was commonly used with first-line chemotherapy in almost all HER2-positive tumors. Since then, the percentage of tumors tested has increased year after year, although HER2 testing did not reach 85 % until 2012, and universal testing was probably not achieved in Spain until very recently, despite the 2011 national consensus [12]. Nevertheless, we noticed that the introduction of HER2 testing did not occur synchronously nor homogeneously throughout the country. This is possibly due to the decentralization of the national health system in Spain, and it appears to have had a noticeable clinical impact at different regional institutions. To analyze this issue, each of the hospitals included in this registry was classified on the basis of the rate of missing HER2 data. Based on the four quartiles of this distribution, the centers were categorized into four groups: universal, high, medium, and low HER2 testing. Generally, prognosis worsened from the universal to the high to the medium and then to the low HER2 testing group, such that a statistically significant difference in median OS of 2.1 months was observed between the universal and low HER2 testing groups.

Two main ideas emerge from this analysis. First, many patients with AGC were missing out on a crucial genetic test to determine whether they would benefit from trastuzumab. Second, the favorable effect of widespread HER2 testing does not seem to be limited to patient selection. This implies that HER2-negative patients might also have benefited in some way from the proper implementation of HER2 testing at their center. A plausible interpretation of these data is that the percentage of HER2-tested patients behaves as a quality-of-care indicator at these centers. It is tempting to speculate that determining HER2 status in clinical practice is a challenge for the institution, which forced multidisciplinary teams to coordinate, improving the quality of care in many different ways beyond assisting in patient selection for anti-HER2 therapy [12]. As such, the policy of universal genetic testing seems to behave as a quality-of-care indicator. However, it is also necessary to keep in mind that not determining HER2 due to poor clinical status or trastuzumab contraindication can be regarded as good practice. On the other hand, centers that determined HER2 less frequently tended to supplement this practice with the more frequent use of triplet instead of doublet chemotherapy (e.g., docetaxel-based regimens). A multivariate analysis of the cases included in our registry illustrates that this practice can be beneficial, but it does not completely palliate the negative effect of leaving HER2 untested.

Our study has several limitations inherent to retrospective registries, especially those dealing with complex spatial and temporal patterns. First, we cannot exclude the possibility that other reasons that were not taken into consideration may have confounded patient prognosis. One of the most obvious concerns was that early mortality interferes competitively with HER2 testing. To control this potential bias, our analysis was stratified based on patient demise prior to 30 days. Second, as with all observational records, it is impossible to guarantee that all confounding factors, particularly time-dependent ones, have been taken into account. Third, the groups with low and intermediate levels of HER2 testing include only a few hospitals (7 and 5 of 28 centers, respectively), and it is possible that local, non-HER2 testing practices at these institutions may have influenced patient prognosis.

Despite these limitations and caveats, we consider that our findings are consistent with the fact that early introduction of HER2 testing and trastuzumab therapy were beneficial in the patients included in this registry. In contrast, centers that took longer to implement HER2 testing exhibited worse clinical outcomes. We believe that these data could be extrapolated to other biomarkers used to select patients for personalized oncology therapies. Centers should strive to incorporate these diagnostic improvements as soon as there is evidence of the benefit and cost-effectiveness of this procedure. Reviewed local and global guidelines on HER2 testing should be used to achieve this goal in a timely manner [12, 1719].