1 Introduction

Melanoma is the leading cause of skin cancer-related deaths [1]. Timely and effective treatment is crucial for improving patients’ prognosis. Adjuvant therapy has been demonstrated to be an effective survival enhancer and is now an essential part of the treatment plan for stage III and stage IV patients after complete resection [2]. For stages I and II, even though the a relatively low risk of metastasis, there are still several patients who later suffer metastasis, which often leads to death by melanoma [3]. Currently, clinical trials have now been conducted on stage II patients and have shown effectiveness in high-risk groups [4, 5]. So, a prognostic assay would be highly beneficial in this scenario as it would enable early-stage, high-risk patients to receive treatment without subjecting all patients to the potential side effects of adjuvant treatments. Studies showed that the lesions from certain anatomical sites (cutaneous melanoma of the back) are associated with a lower 5-year overall survival (OS) rate and higher hazard ratio (HR) compared with cutaneous melanoma of other sites [6,7,8]. As the most common subtype of malignant melanoma in Asians and other non-Caucasian populations [9], acral lentiginous melanoma (ALM) originates from melanocytes on the non-hair-bearing skin and the nail unit. ALM is histologically defined since its growth pattern is characterized by a predominantly confluent single-cell (lentiginous) array of atypical melanocytes along the dermo-epidermal junction. ALM is correlated with greater Breslow depth, more advanced clinical stage at presentation, higher rates of sentinel lymph node metastasis and ulceration as well as worse prognosis [10]. Nevertheless, the prognostic significance of ALM in stage I-III patients is not well explored.

At present, most of the study results can not comprehensively evaluate the prognosis of postoperative ALM patients. In addition, owing to the distinct proportion of melanoma subtypes in Asians and Westerners, conclusions that were drawn from the Western populations still warrant further investigation and validation in a wider range of the population. Therefore, the closest approach may be reached by analyzing the data from the Surveillance, Epidemiology, and End Results (SEER) database to help provide ALM's individual risk stratification information. Among the existing assessment tools, the nomogram is a reliable and widely adopted tool to evaluate risk by integrating clinicopathologic characteristics for oncologic outcomes [11,12,13,14,15] and assist patients along with physicians in making optimal clinical decisions. Based on clinicopathologic characteristics of both patients and tumors, nomograms enable personalized computation of independent risk factors simultaneously and draw the most accurate conclusion to predict the prognosis of a particular patient. Thus, nomograms are convenient, and economical and have been proven to show better predictive ability than the traditional Tumor, Node, Metastasis (TNM) staging system in various types of tumors [16]. Nevertheless, the application of nomograms in ALM is rare.

Herein, we collected the baseline information, clinicopathologic characteristics, and follow-up data of stage I-III ALM patients from the SEER database and performed a comprehensive analysis. This study randomized 1686 patients in a 7:3 ratio to the training cohort and validation cohort. In addition, this prognostic nomogram was also validated by a multicenter external cohort from China.

2 Methods

2.1 Patient selection criteria and follow-up

The SEER database (https://seer.cancer.gov/) includes 18 population-based cancers. In this study, all the information for 2160 patients with ALM (#8744) was extracted by the SEER*Stat program (v 8.3.9.2) with diagnosis years from 2004 to 2015 according to the International Classification of Disease for Oncology (ICD‐O) version 3. Patients with the following information were included: Age recodes with < 1-year-olds, Year of diagnosis, Sex, Derived AJCC (American Joint Committee on Cancer) Stage Group, 6th ed (2004–2015), Survival months, Vital status recode, SEER cause-specific death classification. Patients were excluded if any of the data was missing or incomplete or were in stage IV (since all the patients in our cohort are within stage III). According to the standard of the World Health Organization (WHO), using the age of 60 as the, the age was divided into elderly and young groups. Finally, 1686 cases were eligible and randomly allocated in a 7:3 ratio to the training and internal validation cohorts. The training group contained 1181 patients, while 505 patients were in the internal validation group.

In the external validation cohort, patients diagnosed with acral lentiginous melanoma at Fudan University Shanghai Cancer Center (FUSCC, Shanghai, China), Tian** Medical University Cancer Institute and Hospital (TMUCIH, Tian**, China), and Zhejiang Cancer Hospital (ZJCH, Zhejiang, China) between September 1, 2009, and August 31, 2017, were identified. At least two experienced pathologists from each center evaluated all histological specimens. The following variables were also collected: age, gender, initial disease stage and primary anatomical sites, survival months, and vital status. Patient follow-up was performed using re-examination or telephone calls until death or September 31, 2019. Survival was censored at the date of the last follow-up. OS was defined as the time interval from curative resection to death/the last follow-up. Median OS was defined as the time 1/2 the patients are expected to live after curative resection to death/the last follow-up. Cancer-specific survival (CSS) was defined as the duration from the date of diagnosis until death due to acral lentiginous melanoma other than other causes/the last follow-up; Median CSS as the time 1/2 the patients are expected to live after curative resection to death due to acral lentiginous melanoma other than other causes /the last follow-up. Patients with incomplete clinicopathological characteristic data, less than 18 years old, or with a follow-up of less than 1 month after curative resection were excluded. After applying these criteria, 381 patients histologically classified as ALM were identified. This study was approved by the Institutional Review Board and Ethics Committee of the FUSCC, TMUCIH, and ZJCH. Because unidentified health data of patients were used, informed consent was waived by all the institutes mentioned above. All procedures in our study were to the corresponding guidelines and regulations.

2.2 Nomogram construction and validation

Eligible cases from the SEER database were randomly allocated in a 7:3 ratio to the training cohort (n = 1181) or internal validation cohort (n = 505) by using the createDataPartition function in the caret package. In the training group, univariate and multivariate Cox proportional hazards regression models were performed on each variable to investigate the correlation between clinicopathologic features and OS/CSS. The independent prognostic factors were assessed in a multivariate Cox proportional hazards regression with the stepwise selection of variables based on the Akaike information criterion (AIC). Then nomograms (stepwise models) were constructed to predict the probability of 2-, 4- and 6-year OS and CSS probabilities after curative resection in stage I-III ALM patients by utilizing the RMS package in R version 4.1.1 (http://www.r-project.org).

The performance of the constructed nomogram was evaluated by discrimination, calibration, and uniformity. The concordance index (C-index) was used to assess the discrimination ability. The C-index value ranges from 0.5 to 1 and is approximately equivalent to the area under the curve (AUC). C-index and AUC over 0.7 indicate that the nomogram is considered to have a moderate ability to stratify patients into different prognosis groups. Furthermore, receiver operating characteristic (ROC) curves and the corresponding AUCs at 2, 4, and 6 years were also generated to estimate the predictive accuracy. Calibration curves were used to graphically present the uniformity between the predicted probabilities of the nomogram and the observed outcome frequencies. The closer the predicted calibration curve is to the standard curve, the better the predictive ability of the nomogram.

2.3 Statistical analyses

While qualitative variables were described as frequencies and percentages, quantitative variables were described in terms of means and standard deviations (SD). Pearson’s chi-squared test and Student’s t-test were utilized to identify corresponding independent prognostic factors and calculate HRs and 95% CIs from categorical variables and continuous variables. The correlation between clinicopathologic features and prognosis index (OS and CSS) was evaluated by Cox proportional hazards regression models. Derived from the stepwise multivariate analysis, selected risk factors were integrated into the nomogram to predict the probability of 2-, 4- and 6-year OS and CSS probabilities after curative resection in stage I-III ALM patients. All statistical analyses were performed using R software (version 4.1.1, https://www.r-project.org/). The R packages used in this study include caret, survival, survival ROC rms, foreign, survival, nomogramEX, dplyr, and pec. A two-tailed P-value of less than 0.05 was considered statistically significant.

3 Results

3.1 Patient characteristics

A total of 1686 ALM patients were extracted from the SEER database from 2004 to 2015 and were randomly allocated in a 7:3 ratio to the training cohort (n = 1181) or internal validation cohort (n = 505). In addition, 381 ALM patients were enrolled as the external validation cohort from the multicenter data (including FUSCC, TMUCIH, and ZJCH) from 2009 to 2017. The flow chart is shown in Fig. 1. In the training cohort, 608 (51.48%) patients were female, and 767 (64.94%) patients were aged ≥ 60 years. In the internal validation cohort, 261 (51.68%) patients were female, and 324 (64.16%) patients were aged ≥ 60 years. In the external validation cohort, 196 (51.44%) patients were female, and 237 (62.20%) patients were aged ≥ 60 years. The demographics and clinicopathologic features of the training, internal, and external validation cohort are presented in Table 1.

Fig. 1
figure 1

Workflow of the present study

Table 1 Demographics and clinicopathologic characteristics of the training and validation cohorts

3.2 Prognostic risk factors in the training cohort

The univariate analysis indicated that age, gender, and stage were significantly associated with OS (p-value < 0.05), while age, gender, and stage were statistically associated with CSS. Based on the stepwise multivariate COX regression analysis, the following four independent risk factors were selected for the multivariate analysis by using a Cox proportional hazards regression model: age (≥ 60: HR 2.92, 95% CI 2.33–3.67 P < 0.0001), gender (male: HR 1.61, 95% CI 1.35–1.92, P < 0.0001), race (black: HR 1.37, 95% CI 1.02–1.84, P = 0.0388), and stage (IB: HR 1.99, 95% CI 1.46- 2.73, P < 0.0001; IIA: HR 2.03, 95% CI 1.42–2.89, P = 0.0001; IIB: HR 3.21, 95% CI 2.28–4.53, P < 0.0001; IIC: HR 3.46, 95% CI 2.39–5.03, P < 0.0001; IIIA: HR 3.54, 95% CI 2.29–5.45, P < 0.0001; IIIB: HR 4.49, 95% CI 3.19–6.31, P < 0.0001 and IIIC: HR 8.29, 95% CI 5.73–11.98, P < 0.0001). and age (≥ 60: HR 1.93, 95% CI 1.46–2.56, P < 0.0001), gender (male: HR 1.70, 95% CI 1.33–2.18, P < 0.0001), race (black: HR 1.22, 95% CI 0.81–1.83; P = 0.3356), site (lower: HR 1.57, 95% CI 1.12–2.21, P < 0.0001) and stage (IB: HR 3.17, 95% CI 1.68–6.01, P = 0.0004; IIA: HR 3.98, 95% CI 2.03–7.82, P = 0.0001; IIB: HR 8.11, 95% CI 4.28–15.37, P < 0.0001; IIC: HR 9.53, 95% CI 4.93–18.42, P < 0.0001; IIIA: HR 13.40, 95% CI 6.88–26.08, P < 0.0001; IIIB: HR 15.91, 95% CI 8.63–29.35, P < 0.0001 and IIIC: HR 31.74, 95% CI 16.98–59.36, P < 0.0001) were included into the Cox model (stepwise model) for CSS. The results of the univariate and multivariate analyses are presented in Tables 2 and 3.

Table 2 Selected risk factors in the training cohort for establishing the nomogram (OS) by univariate and multivariate Cox regression analysis
Table 3 Selected risk factors in the training cohort for establishing the nomogram (CSS) by univariate and multivariate Cox regression analysis

Since SEER data did not contain detailed information on treatments upon recurrence and adjuvant therapy, treatments were not incorporated into the nomogram. To assess the impact of study years on survival, we analyzed patient survival from two time periods (2004–2008 vs. 2011–2015) using Kaplan–Meier curves (Supplementary Fig. 1) and found that the survival of patients in the later period was better than that in the earlier period (prior to the widespread adoption of many effective adjuvant therapies) but the difference was not statistically significant (p = 0.29).

3.3 Prognostic nomogram for predicting OS and CSS

The prognostic nomogram to predict the 2-, 4-, and 6-year OS (Fig. 2A) and CSS (Fig. 2B) were constructed based on the above risk factors. By drawing a straight line upward to the points axis, the specific score of each corresponding independent risk factor can be obtained. The total points present the sum of the score of each factor. The corresponding predicted probability of the 2-, 4- and 6-year OS and CSS can be achieved by drawing a straight line down from the total points axis to the survival probability axes. Taking the nomogram of OS as an example, a 60-year-old (51 points) Asian (0 points) male (23 points) patient with stage IIB (56 points) ALM has a sum-point of OS equal to 130, corresponding to a predicted 2-, 4- and 6-year OS of 81%, 62%, and 48%, respectively.

Fig. 2
figure 2

Prognostic nomogram of ALM patients' 2-, 4- and 6-year OS (A) and CSS (B). By drawing a straight line upward to the points axis, the specific score of each corresponding independent risk factor can be obtained. The total points present the sum of the score of each factor. The corresponding predicted probability of the 2-, 4- and 6-year OS or CSS can be achieved by drawing a straight line down from the total points axis to the survival probability axes. Age measurement unit: year

3.4 Calibration and discrimination of the nomogram

The calibration curves show a favorable consistency between the nomogram predictions and the actual observed outcomes of the 2-, 4- and 6-year OS and CSS in the training and internal validation cohorts (Fig. 3). As indicated by the C-index values, the nomogram of OS presents good prognostic accuracy and clinical applicability (training cohort: 0.727; internal validation cohort: 0.722; external validation cohort: 0.682), which correspond to the AUC values in the ROC curve analysis (0.75, 0.77 and 0.78 in the training cohort; 0.79, 0.76 and 0.75 in the internal validation cohort at 2, 4 and 6 years respectively). For the nomogram of CSS, it achieved a favorable efficacy with a C- index of 0.791 and 0.774 in the training and internal validation cohort, respectively. The AUC of 2-,4-, and 6-year CSS were 0.84, 0.83, and 0.84 in the training set, and 0.84, 0.82, and 0.80 in the internal validation set, respectively (Fig. 4).

Fig. 3
figure 3

Calibration curves predicting the 2-, 4- and 6-year OS and CSS of patients in the training cohort (A, C), and internal validation cohort (B, D). The x-axis indicates the predicted survival probability and the y-axis indicates the actual survival probability at 2, 4, or 6 years. The blue line indicates that the prediction agrees with actuality. OS, overall survival; CSS: cancer-specific survival

Fig. 4
figure 4

ROC curves and AUCs at 2, 4, and 6 years OS and CSS in the training cohort (A, B) and the internal validation cohort (C, D) were used to estimate the prognostic accuracy of the nomogram

4 Performance of the nomogram in external validation cohorts

The nomogram of OS was then externally validated in a multicenter database. The C-index of the external validation set was 0.657. The ROC curves for predicting the 2-, 4-, and 6-year OS of stage I-III ALM patients are shown in Fig. 5A, with the AUCs of 0.67, 0.72, and 0.65, indicating that the nomogram has good discrimination. In addition, the calibration curve of the external validation cohort was shown in Fig. 5B, suggesting good consistency between the predicted situation and the actual survival of patients.

Fig. 5
figure 5

ROC curves (A) and calibration curves (B) at 2, 4, and 6 years in the external validation cohort were used to estimate the predicted performance of the nomogram

5 Discussion

ALM is found on peripheral body parts such as soles, palms, and nail beds. The incidence of melanoma is highest in white Caucasian populations, while ALM accounts for less than 10% of the cases and is one of the least common cutaneous melanoma histological subtypes [17]. Nevertheless, in populations with darker and more heavily pigmented skin where melanoma incidence rates are lower, ALM takes up to 50% of all cases with melanoma [17]. Since the etiology of ALM is largely unrelated to intermittent sun exposure, the actual incidence rates of ALM are similar for all ethnic groups [18, 19]. To date, the specific pathogenesis of ALM remains largely unknown, but mechanical injury or stress to the site may play an important role [20, 21]. Whole-genome sequencing studies have shown that ALM development is driven by structural changes and mutation signatures of unknown etiology [19].

Compared with other histological subtypes of cutaneous melanoma, ALM is more aggressive and mainly detected at a relatively advanced stage with a worse prognosis [17, 20], mainly due to late diagnosis, genetic features as well as socioeconomic factors [22,1), consistent with nomogram findings. In addition, determining the exact cause of death can be challenging, particularly in retrospective studies. Our external validation set lacks CSS data and therefore cannot validate the nomogram for CSS. Additional validation on larger and more comprehensive datasets is warranted. Finally, due to the small sample size of the external validation cohort, it is imperative to obtain additional validation cohorts with a larger sample size for the nomogram. A future model based on a larger sample size of multicenter data incorporating other clinical information progressively in the future may facilitate the prognostic value of this model.

6 Conclusions

We developed and validated nomograms to provide convenient and reliable tools for prognosis prediction for ALM patients. More future prospective studies are warranted to confirm this result.