Introduction

The presence of lymph node metastases is the most important predictive factor for survival in gastric cancer in patients with potentially curative resection [1, 2]. More than one-third of patients with gastric adenocarcinoma have an unnecessarily extended lymph node dissection [3]. Several studies have shown increased perioperative morbidity and mortality with D2 versus D1 lymph node dissection [46]. Successful estimation of lymph node involvement may help to define those patients who would and those who would not benefit from an extended lymph node dissection in association with gastrectomy. For this purpose, a computer program was developed by Maruyama and his colleagues [7] to estimate the incidence of lymph node metastases, the expected prognosis, and the proportion of curable cases at surgery, based on the most significant preoperative prognostic factors. This program was later improved as the Windows-based program WinEstimate v. 2.5, using a database of 4302 primary gastric cancer patients treated at the National Cancer Center Hospital in Tokyo between 1968 and 1989 [8].

Sentinel lymph node (SLN) map** has become a standard procedure in breast cancer and malignant melanoma [9, 10]. The injection method and selection of tracers for SLN map** in gastric cancer remain controversial. Studies from Japan and from Western Europe have demonstrated a high detection rate (93.7 %) and an accuracy of 92 % for SLN biopsy in patients with gastric cancer [11]. We have evaluated, for the first time in Hungary, the validity of SLN map** using a blue dye-only method in gastric cancer [12].

In this prospective study we compared the efficacy of the Maruyama computer program (MCP) and the SLN biopsy (SNB) technique to predict the nodal status and localization of lymph node metastases in our patients.

Patients, materials, and methods

The study was conducted prospectively from February 2008 to April 2011. at the Department of Surgery of the Gyula KenézyTeaching County Hospital, Debrecen, Hungary. Exclusion criteria were gastric stump tumor, cardia cancer, distant metastases, and involvement of the surrounding organs (T4). Forty consecutive patients were evaluated with the MCP. The calculation required the following prognostic factors in every patient: age, gender, position of the tumor (upper, middle, or lower third of the stomach, anterior or posterior wall, lesser or greater curvature), Bormann’s classification or early gastric cancer classification according to the Japanese Endoscopy Society, depth of infiltration, and histological type. Based on these data, the computer model calculated the proportion of supposedly positive nodes at each lymph node location (stations 1–16) (Fig. 1).

Fig. 1
figure 1

Prediction of lymph node involvement by the Maruyama computer program in a 53-year-old female patient. The tumor histology was moderately differentiated adenocarcinoma, showing definite serosal involvement (S2), Borrmann type 3. The lesion was found in the anterior wall in the lower third of the stomach and had a maximal diameter of 15 mm

All the patients underwent open gastric resection with blue dye map** (Fig. 2) and modified D2 lymph node dissection (stations 1 and 2 in patients with total gastrectomy and stations 3–9, 11, and 12 routinely). Sampling of station 10 and compartment 3 nodes (13–16) was optional if macroscopically suspicious, so we did not routinely calculate the values of these stations. Our SLN marking method and the histological examination were described earlier [12]. The position of lymph nodes was labelled according to the Japanese classification of gastric carcinoma (JCGC) [13]. Only negative sentinel nodes were examined for micrometastases. Postoperatively, the SLNs were sectioned at 0.2-mm intervals and hematoxylin and eosin (HE) staining was performed, and immunohistochemistry examinations were performed with DAKO Monoclonal Mouse Anti-Human Cytokeratin (clone AE1/AE3; dilution 1:30; Dakocytomation; Glostrup; Denmark).

Fig. 2
figure 2

Sentinel lymph node subserosal map** with blue dye only. This photograph shows good labeling of the lymphatic vessel and lymph node station 3

To compare the probability calculations by MCP and the results of SLN map**, we had to define a cutoff level; this was done using receiver-operating characteristics (ROC) analysis [14]. This logistic regression model indicates the probability of concordance between the predicted probability and the proven diagnosis of lymph node metastasis. We estimated the sensitivity and specificity with several cutoff points of the Maruyama program for the expected percentage values obtained by the ROC analysis for each of the 12 lymph node (LN) stations. We defined the best cutoff points for every LN station and the common critical cutoff point to maximize the test validity.

For each station, we declared positivity if its Maruyama value was greater than or equal to the cutoff specific to that station. Overall positivity was then declared based on the presence of station-specific positivity at any station included in the analysis. Thus, the only way for station elimination to produce a difference is if positive stations are eliminated such that all remaining stations are negative.

Statistical analysis of equivalence between the results of MCP and SLN map** was based on calculating the ratios of test performance indicators (those of the SLN map** method divided by those of the MCP procedure). Equivalence was established when the 95 % exact confidence interval around such a ratio was fully contained within the range 0.8–1.25.

The statistical package Stata [15] was used for data handling and analysis. Confidence intervals based on robust standard errors were calculated by using the Variance-Covariance matrix of the Estimators (cluster \( \left\langle {\left. {\text{clustervar}} \right\rangle } \right. \)) option in a Poisson regression model formulated for the relative comparison of two proportions, where \( \left\langle {\left. {\text{clustervar}} \right\rangle } \right. \) denotes the subject identifier, i.e., a variable taking the same unique value in the two records (one for Maruyama-based, and another for sentinel LN-based results) for each subject. The option is made available in Stata explicitly for the analysis of datasets of a multiple-observations-per-subject structure and produces robust, rather than naïve, standard errors based on the Huber–White (sandwich) estimator of variance [16] generalized and implemented for the purpose [17].

Results

We investigated 40 consecutive patients, 21 females and 19 males with a mean age of 64.1 (range 50–80) years. The average body mass index (BMI) was 22.6 (range 17.1–27.6). Fifteen patients had signet-ring cell carcinoma, 18 patients had moderately differentiated adenocarcinoma, and 7 patients had poorly differentiated adenocarcinoma. The tumor was localized in the upper third of the stomach in 8 patients, in the middle third of the stomach in 10 patients, and in the lower third of the stomach in 22 patients. The depth of invasion was T1 in 10, T2 in 11, and T3 in 19 patients. Total gastrectomy was performed in 14 patients, and subtotal gastrectomy in 26 patients. A total of 795 lymph nodes were removed, and 19.9 lymph nodes per patient were examined on average (range 10–38 lymph nodes per patient). The mean number of blue nodes was 4.25 per patient. The frequency of lymph node labeling was evaluated in relation to the tumor’s location (Table 1). In two patients with a lower-third tumor the lymph node station 10 was labeled in an uncommon manner. Both patients had a circular tumor. The first patient had a T3, grade 3, papillary adenocarcinoma with lymphatic vessel invasion, and five lymph nodes were labeled (LN no. 3, no. 6, no. 7, no. 8, and no. 10), while the other patient had a T3, grade 3, signet-ring cell carcinoma and twelve lymph nodes were stained (LN no. 5, no. 6, no. 7, no. 8, no. 10, no. 11, and no. 12).

Table 1 Frequency of lymph node (LN) labeling in relation to tumor location

Table 2 shows the frequency of patients with dissected LN stations and metastatic involvement.

Table 2 Frequency of patients with dissected and metastatic LN stations

The best cutoff point was estimated for every LN station and a critical cutoff point of 12 % of the MCP expected percentage maximized the test validity (Table 2). In 2 patients with metastases in LN stations 9 and 12 the MCP was calculated as 0 %, so the sensitivity was zero with any cutoff level, as it was in 2 other patients with metastases in LN station 10.

The sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and accuracy of MCP were calculated with a 12 % cutoff point and the best cutoff point station by station (Table 3).

Table 3 Statistical features of MCP and SNB and the equivalency results

In 39 patients SLNs were successfully identified, resulting in a detection rate of 97.5 %. Seventeen patients (42.5 %) were histologically node-negative; in 16 patients both SLNs and non-SLNs were negative. In one patient there was no sign of any labeling; the BMI of this patient was 26.8. In 22/23 patients at least one SLN showed tumor involvement, with a false-negative rate of 1/23 (4.3 %). In this single false-negative case the tumor was T3 in extension, and grade 3 with perineural invasion histopathologically and macroscopically involved lymph nodes were found during the operation. In the sixteen SLN-negative cases micrometastases were not found in the sentinel nodes. The results of the intraoperative frozen sections (SLN) correlated completely with the postoperative pathological findings. Sensitivity was 95.7 %, and specificity was 100 %. The NPV was 94.1 % and the PPV was 100 %. The accuracy was 97.4 %.

Results of the MCP and SLN biopsies were equivalent only in sensitivity, based on the 95 % confidence interval (CI) of the ratio of performance indicators. Specificity, NPV, PPV, and accuracy with the SNB method were superior to those of MCP (Tables 3, 4).

Table 4 Ranges of 95 % confidence intervals

The MCP with the best cutoff point (station by station) had a somewhat better statistical result than that with the 12 % cutoff point; however, the area under the curve value of SNB was higher. The difference was statistically significant between SNB and MCP with the 12 % cutoff point (p = 0.0043) and between SNB and MCP with the best cutoff point (p = 0.0003). The area under the curve value of MCP with the best cutoff pointwas higher than that of MCP with the 12 % cutoff point, although the difference was not significant (p = 0.1441) (Fig. 3).

Fig. 3
figure 3

Comparison of Maruyama computer model (12% cutoff and best cutoff) and sentinel lymph node biopsy by the area under the curve (AUC) of the receiver operating characteristic

We evaluated the accuracy of MCP for the first tier lymph nodes, owing to the low value of the area under the ROC curve in stations 9, 10, and 12. However, the accuracy and the statistical features (sensitivity, specificity, PPV, NPV) did not change, because patients with false-negative second-tier lymph nodes had false-negative lymph nodes in the first tier also. The accuracy of MCP in sentinel node-positive patients was 91 % and it was only 75 % in the complete group of the forty patients.

The sensitivity of MCP in the cohort of SLN-positive patients was 91 % and the PPV was 100 %. So, these features of MCP and SNB were proven equivalent in the sentinel node-positive group. In these patients the specificity and NPV could not be defined, owing to the lack of sentinel node-negative patients.

The accuracy of the MCP for the prediction of lymph node metastasis in stations 7–12 was 50 % in sentinel node-positive patients. This accuracy increased to 72 % when the sentinel node was found in compartment 2.

There were no side-effects of the blue dye map**. The postoperative period was uneventful in all the 40 patients, without any surgical or nonsurgical complications.

Discussion

Dutch and British prospective randomized trials found higher morbidity and mortality rates in patients with gastric cancer following extended lymph node dissection when compared with findings for those who underwent D1 dissection only [4, 5]. Forty percent of Western European patients with R0 resection have an unnecessary extended lymph node dissection [3]. Preoperative diagnostic tools have low sensitivity and specificity for defining the patient subpopulations which would and which would not benefit from an extended lymph node dissection. The sensitivity, specificity, and accuracy of spiral computed tomography for the detection of pathological lymph node involvement are 73.1, 50.0, and 84.2 %, respectively [18] and endoscopic ultrasonography has an accuracy of 68.6 %, with a sensitivity and specificity of 66.7 and 73.7 %, respectively [11].

A single comparative study (SNB vs. MCP) from Germany has proven SNB to be of higher clinical relevance than the Maruyama computer model for predicting nodal status and compartimental involvement [25].

Although it is difficult to draw definitive conclusions in our prospective comparative study, owing to the small size of the series, we demonstrated a degree of reliability of MCP similar to that in the cited studies, with 91.3 % sensitivity, 64 % specificity and 80 % accuracy by the best cutoff point. The false-negative rate was 8.7 %. We detected SLNs in 97.4 % of patients with gastric cancer with 95.7 % sensitivity, and 100 % specificity using the blue dye-alone technique. The false-negative rate was 4.3 %–exactly half the false-negative rate found with MCP . We are planning to carry out reduced lymphadenectomy in SLN-negative patients in the near future, according to the internationally accepted guidelines.

As to the clinical meaning of the equivalence range (0.8–1.25 under general convention, subject to narrowing or widening if special circumstances justify) for the ratio of performance indicators in the given setting: perfect equivalence is represented by the value 1, while that part of the range greater than 1 represents the superiority of the Maruyama method over the SLN method (because all test performance indicators are formulated to express better performance with greater numerical values). The lowest point of the range represents the inferiority of the Maruyama method to an extent where it can only achieve 80 % of what SLN testing is capable of. For sensitivity, this means that, should this worst-case scenario be the actual case, 20 % of metastatic cases expected to be identified as positive through SLN testing are expected to be misclassified as negative by the Maruyama procedure. However, the point estimate is at a relative sensitivity of 0.955 (4.5 % fewer metastatic cases expected to be identified), and the confidence interval extends from 0.812 (18.8 % fewer cases identified) to 1.122 (12.2 % more cases identified; this presupposes an SLN sensitivity of 89.1 % or less, which is well within the 95 % confidence interval for that estimate). The analysis of other indicators produced no conclusive results, as the confidence intervals for the indicator ratios included values both inside and outside the equivalence range. So, the sensitivity of MCP and SNB was proven equivalent, while the specificity, NPV, PPV, and accuracy were higher with SNB.

It is generally accepted that metastases in SLNs indicate the need for a D2 lymphadenectomy. We analyzed the relevance of MCP in sentinel node-positive patients. The accuracy of MCP in sentinel node-positive patients was 16 % higher than that of SNB in our group of forty patients. The sensitivity and the PPV of MCP and SNB were proven equivalent in the sentinel node-positive group. Unfortunately, in this cohort of patients, the accuracy of MCP was low in the prediction of lymph node involvement in stations 7–12. So, it would be interesting in the future to find an appropriate technique that combines the sentinel node status and the results of MCP for determining the adequate extension of lymphadenectomy.

In summary, our comparative study showed a lower clinical impact of the MCP compared with that of SNB; however, using these two methods in a parallel fashion could be useful in preoperative decision-making for determining the appropriate extent of lymphadenectomy and individualized stage-adapted surgery in gastric cancer. The efficiency of MCP in the cohort of sentinel node-positive patients requires further evaluation.