Introduction

Head and neck squamous cell carcinoma, originates from various subsites of the upper aerodigestive tract, including the oral cavity, sinonasal cavity, pharynx, and larynx [1, 2]. Smoking, alcohol consumption, exposure to environmental pollutants, and infection with HPV and EB viruses are the primary risk factors for HNSCC [GO and KEGG analysis

Pearson correlation analysis was performed using R (version 4.3.2) to uncover the correlation between CTHRC1 expression and other genes. Subsequently, the top 500 relevant genes were selected and imported into the database for Annotation, Visualization, and Integrated Discovery (DAVID) (https://david.ncifcrf.gov). The DAVID is a database utilized for GO analysis, including biological process (BP), cellular component (CC), and molecular function (MF). KEGG analysis was also conducted using these genes. The results of the analysis were visualized using the “ggplot2” package.

GSVA analysis

The associated gene sets were obtained from “AmiGO2” (https://amigo.geneontology.org/). The functional enrichment score for each sample was computed using the “GSVA” R package with default parameters. The results were visualized through a heatmap, and pearson correlation analysis was employed to assess the correlation between CTHRC1 and the specified gene sets.

Construction of protein–protein interaction network

To investigate the interaction among related proteins, we constructed a protein-protein interaction network using the STRING database (https://string-db.org/). A minimum required interaction score of 0.4 was set. The protein-protein interaction (PPI) network was visualized using Cytoscape software. Subsequently, GO and KEGG analyses were performed based on the genes encoding these proteins, following the previously mentioned method.

Immune checkpoint testing

A total of 79 immune checkpoints were retrieved from a previous study [39]. Pearson correlation analysis was employed to examine the correlations between CTHRC1 expression and these checkpoints. A P value of less than 0.05 was considered statistically significant. Subsequently, a doughnut diagram was generated using the provided R package in the default parameters of the R environment to visualize the results.

Tumor infiltrating immune cell analysis

Tumor infiltrating immune cell analysis was performed using Cell-type Identification By Estimating Relative Subsets Of RNA Transcripts (CIBERSORT) to characterize the immune features of HNSCC. Immune scores were calculated using the Estimation of Stromal and Immune Cells in Malignant Tumor Tissues using Expression Data (ESTIMATE) algorithm to assess the presence of immune cells. Also, the correlations between CTHRC1 expression levels and the level of immune infiltration were estimated using TIMER, a comprehensive resource for analyzing immune infiltrates in gene expression profiles.

Results

CTHRC1 expression in normal tissues and tumors

TIMER indicated a significant increase in the expression of CTHRC1 across various tumors, particularly in digestive tract tumors (Fig. 1A). For HNSCC, 1815 genes were up-regulated, while 1947 genes were down-regulated. CTHRC1 ranked 89th in the up-regulated genes. Compared with the adjacent normal tissues, the expression of CTHRC1 was significantly increased in the tumor tissues with a LogFC value of 3.5 (Fig. 1B). Comparable results were observed in the GEPIA database and HPA database (Fig. 1C and E). The results were also confirmed in the GEO database (Fig. 1D).

Fig. 1
figure 1

Expression analysis of CTHRC1 in different databases. A Expression of CTHRC1 in different types of human cancers in TIMER database. B Volcano plot of differentially expressed genes between tumor and normal tissue in HNSCC. C Expression of CTHRC1 in HNSCC from GEPIA database. D Expression of CTHRC1 in HNSCC、dysplasia、and normal tissue in GSE30784. E Expression of CTHRC1 in tumor and normal tissue in HPA database. (Normal: Female, age 59 ID4109. Tumor: Female, age 55, ID4420)

Higher expression of CTHRC1 was found in advanced stage of HNSCC patients and related to poor prognosis

The heatmap illustrates the distribution of clinicopathological features and survival status among patients with increasing CTHRC1 expression in TCGA (Fig. 2). No significant differences were observed among different age groups, genders, M stages and races. CTHRC1 expression was significantly upregulated in the clinical T4 and N3 group (Fig. 3A and B). Further analysis revealed remarkable variation in mRNA expression levels of the CTHRC1 gene among patients with HNSCC at different clinical stages. The expression of CTHRC1 was significantly higher in clinical stage IV, whereas there was no significant difference among stages I, II, and III (Fig. 3C). Regarding pathological staging, the expression levels of the CTHRC1 gene were higher in the T4 and N2&N3 group (Fig. 3D and E). Furthermore, higher expression of CTHRC1 was observed in pathological staging IV (Fig. 3F). Regarding tissue or organ origin, the CTHRC1 expression level was highest in the overlap** lesion of the lip, oral cavity, and pharynx, and lowest in tonsils; however, there was no statistical significance (Fig. 3G).

The Kaplan-Meier Plotter was employed for survival analysis. To assess the OS of patients, 270 cases were classified into the high expression group, and 251 cases were assigned to the low expression group using maximally selected rank statistics in TCGA. Kaplan-Meier curves revealed that high CTHRC1 expression indicated a higher risk of poor overall survival (P = 0.0017). The median overall survival of the high and low CTHRC1 expression groups was 34.6 and 77.3 months, respectively (Fig. 3H). The above results were validated in the GEO database (GSE41613). The high group exhibited worse survival than the low group (P = 0.043). The median survival of the high group was 43.3, and for the low group, it was 78.3(Fig. 3I).

Fig. 2
figure 2

Landscape of CTHRC1-related clinicopathological features in HNSCC from TCGA

Fig. 3
figure 3

High expression of CTHRC1 is observed in late stage of HNSCC and predicts a poor prognosis. A-F CTHRC1 expression across different stages in HNSCC. G CTHRC1 expression across different organs or tissues. H-I Prognostic impact of CTHRC1: Kaplan-Meier survival curves in TCGA and GSE41613. Statistical significance was determined using either an unpaired t-test or one-way analysis of variance (ANOVA)

Go, KEGG and GSVA analysis

The top 500 co-expression genes were used for GO and KEGG analysis. The GO analysis revealed that these genes were enriched in processes such as extracellular matrix organization (BP: GO: 0030198), collagen fibril organization (BP: GO: 0030199), extracellular matrix (CC: GO:0031012), extracellular region (CC: GO: 0005576), (MF: GO: 0005201) and collagen binding (MF: GO: 0005518) (Fig. 4A and C). Additionally, the KEGG pathway analysis revealed pathways in which these genes were enriched, including ECM-receptor interaction, focal adhesion, protein digestion and absorption, PI3K-Akt signaling pathway, Human papillomavirus infection, and hypertrophic cardiomyopathy (Fig. 4D). Furthermore, GSVA analysis was conducted to reveal the physiological processes in which CTHRC1 might be involved. The results indicated that CTHRC1 might be associated with the formation and organization of collagen and fibril, extracellular matrix, and basement membrane (Fig. 4E).

Fig. 4
figure 4

CTHRC1 is closely associated with ECM-related process. A-C, Biological processes (BP), cellular components (CC), and molecular functions (MF) associated with CTHRC1 in TCGA Database. D Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis of CTHRC1 in TCGA. E Enrichment scores analysis between CTHRC1 expression and ECM-related pathways by GSVA

PPI network construction

The protein-protein interaction network for CTHRC1 was constructed using the STRING database. Thirty genes were found to be closely related to CTHRC1, many of which are involved in cancer biology, including TGFBR1, TGFBR2, TGFB1, MMP2, MMP14, TIMP1, TIM3, WNT5A, ENG, COL1A1, COL1A2, and POSTN (Fig. 5A). To explore the biological role and pathways of the related genes, GO and KEGG were used. Similar to CTHRC1, genes encoding these proteins were closely related to the formation and organization of collagen and fibril, extracellular matrix, and basement membrane. In addition, the result of KEGG indicated that many cancers have been linked to these genes, and many classical tumor-related pathways might be involved (Fig. 5B-E).

Fig. 5
figure 5

The protein-protein interaction network for CTHRC1. A. Protein-protein interaction network constructed by STRING. B-D Biological processes (BP), cellular components (CC), molecular functions (MF) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis of genes encoding these proteins

Relationship between CTHRC1 and immune checkpoint genes

We explored the relationship between the expression of CTHRC1 and immune checkpoint genes using Pearson correlation analysis. Based on the previous literature, 79 immune checkpoint genes were selected. Finally, 19 genes were confirmed to be positively correlated with CTHRC1 expression in HNSCC (R > 0, P < 0.05), while 24 genes were negatively correlated (R < 0, P < 0.05) (Table 1). The top 6 checkpoints were TNFSF4 (R = 0.474, P = 1.25 × 10− 30), CD276 (R = 0.379, P = 2.94 × 10− 19), CEACAM1 (R=-0.365, P = 7.36 × 10− 18), KIR2DL4 (R=-0.365, P = 7.36 × 10− 18), HLA-DOB (R=-0.241, P = 2.32 × 10− 8), and TNFSF18 (R=-0.208, P = 1.70 × 10− 6) (Fig. 6).

Fig. 6
figure 6

Pearson correlation between CTHRC1 and immune checkpoints. The width of the band represented the R-value. Red: positive correlation, Green: negative correlation

Table 1 Pearson correlation between CTHRC1 and immune checkpoints

CTHRC1 expression and immune cell infiltration

In this study, we conducted an analysis of immune cell infiltration features using CIBERSORT, TIMER, and ESTIMATE. The CIBERSORT results revealed that 12 out of the 22 types of immune cell types were differentially expressed between CTHRC1 high and low expression groups in TCGA, including plasma cells, T cells CD8, T cells CD4 memory activated, T cells follicular helper, NK cells activated, Monocytes, Macrophages (M0, M1, M2), Dendritic cells resting, Dendritic activated, and Neutrophils (Fig. 7A). In GSE30784, only 4 types of immune cells were discovered (NK cells activited, Macrophages M0, Macrophages M1 and Macrophages M2) (Fig. 7B). Similar findings were also observed, with the exception of CD8 cells, which showed no significant association with CTHRC1 expression in the TIMER database (Fig. 7C). ESTIMATE indicated a correlation between Stromal Score and ESTIMATE score with CTHRC1 expression. However, no correlations were found between CTHRC1 expression and Immune Score (Fig. 7D). Since the ESTIMATE Score is negatively correlated with tumor purity, we infer that tumor purity is negatively correlated with CTHRC1 expression. This finding contradicted the conclusions drawn in the TIMER database, which showed no correlations between CTHRC1 expression and tumor purity.

Fig. 7
figure 7

Immune infiltration analysis of CTHRC1 expression in HNSCC. A Infiltration of different immune cells in CTHRC1 high expression group and low expression group analyzed by CIBERSORT in TCGA. B Infiltration of different immune cells in CTHRC1 high expression group and low expression group in GSE30784. C Correlation of CTHRC1 expression with immune infiltration levels in TIMER. D Pearson correlation between CTHRC1 and immune score using ESTIMATE

Discussion

HNSCC is one of the most prevalent cancers globally, posing a significant threat to human health. The management of HNSCC varies depending on the stage of the disease, anatomical location, and surgical accessibility. In the early stages (stage I and stage II), smaller tumors may undergo extensive resection directly. Intriguingly, diode lasers have gradually been adopted in surgical treatments, enhancing operational efficiency and yielding exceptional aesthetic and functional outcomes [40, 41]. In advanced stages (stage III and stage IV), surgery remains the primary method, with the removal of the primary lesion accompanied by neck lymph node dissection. Postoperative radiotherapy or chemotherapy may be administered subsequently [42, 43]. In recent years, targeted therapies exemplified by EGFR inhibitors, and immunotherapies, such as pembrolizumab hold immense potential in the treatment of advanced HNSCC [26, 44].

The aberrant expression of CTHRC1 has been documented in various cancers, including breast cancer, stomach adenocarcinoma, non-small cell lung cancer, and colon adenocarcinoma [19,20,21,22]. The understanding of the expression pattern of CTHRC1 in tumors will help us to understand the pathogenesis of tumors. Experimental research for HNSCC has indicated that N-Glycosylation collaborates with canonical Wnt pathway to induce CTHRC1 and drive HNSCC cell migration [45]. The high expression of CTHRC1 may be associated with HNSCC progression and poor prognosis, hel** identify high risk individuals. Detecting CTHRC1 aids in the early detection and diagnosis of HNSCC, improving treatment outcomes through noninvasive methods. Further research on CTHRC1 will help develop new diagnostic and therapeutic strategies. In the present study, we found that CTHRC1 expression was significantly higher in tumor tissues consistent with lee’s study in which the HNSCC samples showed a 12.3fold higher expression than normal samples [23]. This suggests that elevated CTHRC1 expression may be associated with a higher risk of HNSCC, and CTHRC1 can serve as a tumor marker to predict the risk of HNSCC occurrence.

Timely diagnosis and early surgery in the initial stages of the tumor are crucial factors affecting the prognosis of HNSCC. It has been asserted that delayed diagnosis and treatment of patients with HNSCC worsen prognosis outcomes and increase undesirable morbidity and mortality in cancer patients [46, 47]. In this study, we observed that CTHRC1 expression tends to be up-regulated in advanced stages of head and neck squamous cell carcinoma. Additionally, the OS of the high CTHRC1 expression group is significantly worse than that of the low expression group. This trend has been confirmed in both the TCGA and GSE41613 datasets. Therefore, high expression of CTHRC1 may predict poor prognosis of patients which is consistent with findings of Lee’s study [23]. The prognostic value of CTHRC1 has also been found in other tumors, such as colon adenocarcinoma, gastric cancer, breast cancer and kidney renal clear cell carcinoma [20, 48, 49]. Our results indicates that CTHRC1 can also serve as a marker to predict the stage and prognosis of HNSCC.

Prior studies have indicated that TME is implicated in various pathological processes of tumor, including tumor growth, metastasis, and invasiveness [29, 30]. Being a dynamic interconnected mesh of macromolecules, ECM stands out as the most abundant component in TME, offering structural support and regulating cellular behavior through mechanical and biochemical cues [50]. It regulates various cellular processes, including proliferation, differentiation, migration, invasion, and survival in cancer [50, 51]. In this study, genes related to CTHRC1 expression in HNSCC were identified using the TCGA database. POSTN, SPARC, PLPP4, GLT8D2, and THY1 were identified as the five genes most closely expressed to CTHRC1, suggesting a robust link between CTHRC1 and the relevant functions of the ECM. Subsequently, we explored the physiological function and related pathways using GO, KEGG, and GSVA analyses. The results indicated that CTHRC1 and its coexpressed genes are enriched in ECM-related processes, such as extracellular matrix organization, collagen fibril organization, cell adhesion, and cell matrix adhesion. In previous studies, Zhao found that CTHRC1 was related to several pathways in gastric cancer, including extracellular matrix organization and vascular development, through enrichment analysis [20]. Also, a similar result was found in meng’s study of colon cancer [48]. Our results were consistent with their studies.

The protein-protein interaction network consists of proteins that interact to participate in various life processes, including biological signal transmission, gene expression regulation, energy and substance metabolism, and cell cycle regulation [52]. In this study, a protein-protein network was constructed, revealing 30 related proteins, including COL3A1, TGFBR1, LTBP1, COL1A2, COL5A1, and POSTN. KEGG analysis showed that, in addition to ECM-related physiological processes, these proteins were also implicated in the initiation and progression of various cancers, as demonstrated in other studies. Zhao indicate that ECM-related genes correlate with immune cells, overall survival, and recurrence of bladder cancer [53]. Also, Keerthi proved that matrisome genes were seen to affect survival across cancers by a pan-cancer analysis [54]. Furthermore, these proteins were involved in several classical signaling pathways that have been proven to play important roles in tumors, such as the Wnt signaling pathway, Hippo signaling pathway, and mTOR signaling pathway [55].

Head and neck cancer inherently exhibits immunosuppression, prompting research into the role of the immune landscape in HNSCC [56]. The advent of immune checkpoint inhibitors represents a noteworthy advancement in oncological therapy. Immune checkpoint blockade (ICB) therapies targeting PD1 and PDL1 have been approved to treat various malignancies, yielding some therapeutic effects. Despite these advancements, only a small percentage of HNSCC patients respond to ICB, with studies indicating a benefit for only 10–20% of HNSCC patients [26, 27]. Therefore, there is an urgent need for the identification of the most suitable treatment regimens and the reduction of immunosuppression in non-responding patients with head and neck cancer. Immune checkpoint agents exhibit antitumor properties by reversing tumor immunosuppressive effects. The study also investigated the correlation between the signature and the levels of various immune checkpoint proteins. Patients with higher CTHRC1 expression exhibited elevated expressions of TNFSF4 and CD276. TNFSF4, a cytokine that promotes the activation and proliferation of T cells, is a well-known immune checkpoint. A previous study demonstrated that TNFSF4 could facilitate chemoresistance in lung adenocarcinoma by inhibiting the apoptosis of tumor cells [57]. The cell surface molecule CD276 serves as an immune checkpoint antigen. Elevated expression of CD276 on tumors contributes to the suppression of anti-tumor T-cell responses and correlates with a poor prognosis [58]. Consequently, the results of this study suggest that TNFSF4 and CD276 may be novel targets for HNSCC immunotherapy.

In addition to stromal cells, immune cells constitute crucial components of the TME and play a pivotal role in the onset and progression of tumors. Studies have indicated that the immune infiltration status of tumors is a crucial factor influencing tumor growth and prognosis [59]. In our study, we assessed the tumor’s immune infiltration status using ESTIMATE, TIMER, and CIBERSORT. ESTIMATE calculates immune and stroma scores by analyzing gene expression signatures specific to immune and stroma cells to predict infiltration by non-tumor cells. The results indicated no correlation between CTHRC1 and the immune score, but it can influence the infiltration status of non-tumor cells by impacting the stromal score. While not influencing the overall immune score, CIBERSORT demonstrated a significant impact of CTHRC1 on the composition of immune cells in tumors, particularly macrophages. Macrophages can polarize into either M1-like or M2-like macrophages. M1-like macrophages are pro-inflammatory and play a role in host defense by producing cytokines like IL-12 and TNF-α, and promoting Th1 responses. In contrast, M2-like macrophages are anti-inflammatory, involved in tissue repair, and secrete cytokines like IL-10 and TGF-β, promoting Th2 responses [60]. Macrophages in the tumor microenvironment are termed tumor-associated macrophages (TAMs), representing a distinctive phenotype of M2-like macrophages [61, 62]. It has been demonstrated that TAMs within the tumor microenvironment promote tumor growth, angiogenesis, and metastasis by secreting pro-tumorigenic factors and suppressing antitumor immune responses across various cancer types [63]. The study revealed a significant increase in M2 macrophages and a corresponding decrease in M1 macrophages in the CTHRC1 high-expression group. This alteration implies that CTHRC1 may be linked to the imbalance in macrophage proportion, a factor deemed crucial in tumor development, immune evasion, and subsequent metastasis and drug resistance, which was also proved in other cancers [22, 49, 61, 64]. Taken together, these findings suggest that CTHRC1 might serve as an indicator of HNSCC prognosis by influencing immune infiltration status, particularly in relation to M2 macrophages. Immunotherapy targeting macrophage polarization may offer potential benefits for improving the treatment outcome of HNSCC patients, with CTHRC1 emerging as a promising target. However, further studies are needed to confirm the causal relationship between them.

In this study, we investigated the relationship between CTHRC1 and HNSCC, aiming to elucidate the underlying mechanisms. Nevertheless, this study has certain limitations. All the data utilized in this study were obtained from online databases, lacking experimental validation and may have biases such as measurement errors and data missing. Also, we did not draw a clear boundary to distinguish high expression and low expression. Additionally, some inconsistent results, such as the disparity between TIMER and ESTIMATE in the assessment of tumor purity, remain unexplained. Therefore, additional analyses and experiments, encompassing in vitro and in vivo validations, are necessary to validate our findings.

Conclusions

High expression of CTHRC1 predicts poor prognosis and is associated with immune infiltration in HNSCC, confirming its utility as a tumor marker for HNSCC.