Introduction

Hepatocellular carcinoma (HCC) is the third leading cause of cancer death worldwide with approximately 830,000 deaths in 2020 [1]. Even though advancements have been made in screening systems and treatment modalities, the prognosis of HCC patients is poor because a high proportion of patients had large and multiple tumors at the initial presentation [2]. The treatment strategies of HCC are stratified according to Barcelona Clinic Liver Cancer (BCLC) stages [3, 4]. Intermediate-stage HCC (BCLC-B) is comprised of patients with multinodular tumor burden, preserved liver function, and good performance status [3, 4]. Transarterial chemoembolization (TACE), which consists of injecting chemotherapeutic agents emulsified with lipiodol into the tumor feeding arteries followed by embolic agents, is the primary recommended therapy in BCLC-B based on two randomized trials [5, 6].

The recent update of the BCLC staging system in 2022 restricts the recommendation for TACE in patients who have well-defined tumor nodules, preserved portal flow, and selective vascular access and are not eligible for liver transplantation [7]. On the other hand, patients with diffuse, infiltrative, and extensive bilobar involvement should be better candidates for systemic therapy as the first-line treatment [7]. Categorization of patients in this subgroup of BCLC-B remains unclear and no cutoff point has been published. Additionally, because of the very heterogeneous nature of this population with a wide range of tumor burdens, it is necessary to subclassify BCLC-B to facilitate more appropriate treatment strategies in clinical practice.

Conventionally, the selection of treatment for HCC relies closely on the size and number of tumor nodules. The six-and-twelve (SAT) score and seven-eleven criteria (SEC) were proposed to assess tumor burden in HCC patients [8, 9]. These two prognostic scores are calculated by combining the largest diameter of the nodules and the number of tumors. The three strata of these scores include the summation of scores of ≤ 6, > 6 up to 12, and > 12 in the SAT score and ≤ 7, > 7 up to 11, and > 11 in the SEC. The higher strata predict lower patient survival outcome. More recently, the tumor burden score in HCC patients who undergo TACE was reported [10, 11]. In contrast to the SAT score and SEC, the tumor burden score integrates both tumor number and tumor size into a single continuous variable and was shown to differentiate the prognosis among HCC patients.

Furthermore, some of the predictive scores need a sophisticated calculation of tumor burden and liver function. The proposed Bolondi’s subclassification system for intermediate HCC, based on the Child–Pugh score, tumor burden, performance status, and presence of portal vein thrombosis, has been applied to Asian and European cohorts [12,13,14,15]. Recently, the hepatoma arterial-embolization prognostic (HAP) score and the modified HAP-II (mHAP-II) score that contain four variables (serum alpha-fetoprotein [AFP], tumor burden, serum albumin, and total serum bilirubin) have also been proposed with significant prognostic performance in selection of optimal candidates for TACE [16, 17].

Recently, the ALBI-TAE model was introduced for accurate prognostication and selection of HCC patients who can benefit the most from TACE [18]. This prognostic model was developed at a large-volume medical center in Taiwan. The study reported that the combination of three factors, which included the up-to-11 criteria, serum AFP, and albumin-bilirubin (ALBI) grade, was an independent predictor to estimate overall survival (OS) in BCLC-B HCC patients undergoing TACE. The four strata of the model include class A (low risk), B (intermediate risk), C (high risk), and D (very high risk). An internal validation was achieved in Taiwanese patients.

To the best of our knowledge, no study has published an external validation of the recent ALBI-TAE model in HCC patients undergoing TACE. Thus, we aimed to validate the ALBI-TAE model in Thai HCC patients with BCLC-B stage and compare this score with several established scoring systems in terms of prognostic power.

Methods

Patient population

This study was carried out in compliance with the ethical principles outlined in the Declaration of Helsinki and was approved by the institutional ethics committee (REC No. 65–269-7–1). The requirement for written informed consent for this study was waived by the Institutional Review Board, and all data were analyzed anonymously. Patients with unresectable HCC who underwent conventional TACE as the first-line treatment between January 2008 and December 2019 were included in this study. HCC was diagnosed based on histopathological results by liver biopsy or radiological characteristics by the assessment of dynamic contrast-enhanced computed tomography (CT) and magnetic resonance imaging (MRI) according to the diagnostic criteria of the American Association for the Study of Liver Diseases guidelines [3]. The inclusion criteria for our study were as follows: (i) age > 18 years; (ii) intermediate-stage HCC (BCLC-B) defined as tumor size > 3 cm, tumor number > 3, or a single tumor > 5 cm without vascular invasion or extrahepatic metastasis; and (iii) Child–Pugh class A and B. The exclusion criteria were (i) initial treatment with tumor resection; (ii) locoregional or systemic therapies before the TACE session; (iii) diffuse infiltrative tumor; (iv) concomitant malignancy; (v) history of spontaneous tumor rupture; and (vi) no evaluation using contrast-enhanced imaging.

Transarterial chemoembolization protocols

All eligible HCC patients underwent TACE using digital subtraction angiography (DSA) guidance (Allura Clarity FD20, Philips Healthcare, Eindhoven, the Netherlands) under the supervision of experienced interventional radiologists through the transfemoral route. A microcatheter (1.7-Fr to 2.4-Fr) was advanced through a 5-Fr catheter inserted into the celiac artery and contrast medium was injected into the common hepatic artery to identify the tumor-feeding arteries. We performed selective catheterization as distal as possible into the tumor-feeding branches. In the next step, we slowly administered a mixture of iodized oil (range 2‒16 mL) (Lipiodol, Guerbet) and doxorubicin hydrochloride (range 5–50 mg) (Adriamycin, Pfizer) or mitomycin (range 10‒20 mg) (Vesimycin, Naprod Life Sciences) followed by embolization of the branches using gelatin sponge particles. We finished the procedure when the tumor-feeding branch was entirely blocked and disappearance of tumor staining from DSA was observed.

Treatment evaluation and follow-up

An imaging study using 4-phase contrast-enhanced CT scan or dynamic MRI was carried out within 1 month after the initial procedure to evaluate the radiological response of the treated tumors according to the modified Response Evaluation Criteria in Solid Tumors (mRECIST) [19]. If no definite evidence of residual or recurrent tumor was seen, imaging investigation was evaluated subsequently at 3-month intervals. The decision to repeat the TACE procedure was carried out on demand based on patterns of tumor recurrence, BCLC staging, and preserved hepatic reserve.

Data collection

The following data were collected: demographic data that included age and sex; clinical history (alcohol consumption, hepatitis B or C virus carriers, diabetes mellitus, presence of ascites, and Child–Pugh class); laboratory data (serum AFP, levels of alanine transaminase, albumin, total bilirubin, and platelet count); tumor factors (size and number of tumors); imaging response within one month after the initial TACE; and complications (post embolization syndrome and liver decompensation). Postembolization syndrome was defined as a set of symptoms that includes nausea, vomiting, abdominal pain, and fever over 38 °C occurring after TACE. The development of liver decompensation was defined as the occurrence of any of the following within 4 weeks after TACE: an increase in serum bilirubin levels of ≥ 2.0 mg/dL or an increase of at least 3 times the baseline or upper limit of normal, the development of new or increasing ascites or encephalopathy. OS was calculated from the date of initial TACE received until death from any cause or the censoring date of December 31, 2021. Survival data were obtained from national statistical data provided by the Thailand civil registration database. The ALBI-TAE model, Bolondi’s subclassification for BCLC-B, HAP score, mHAP-II score, tumor burden score, SAT score, and SEC were calculated as described in the original publications [8,9,10, 12, 16,17,18] (Table 1).

Table 1 Calculations of the seven scoring models to predict TACE response for intermediate-stage HCC patients

Statistical analysis

Baseline characteristics are presented as median with interquartile range (IQR) for skewed distribution or mean ± standard deviation (SD) for normally distributed variables. Categorical variables are presented as frequency with percentages. Prognostic factors affecting survival were evaluated by univariate analysis. Subsequently, all factors with P < 0.20 in the univariate analysis were included in the multivariate analysis. Multivariate Cox regression analysis with backward selection was made to detect independent predictors of survival time. Entry criteria for selection into the final multivariate model was P < 0.05. Survival curves were estimated for each group of ALBI-TAE model using the Kaplan–Meier method and compared statistically using the log rank test. The ALBI-TAE model was validated and compared with the other scores, including Bolondi’s subclassification for BCLC-B [12], HAP score [16], mHAP-II score [17], tumor burden score [10], SAT score [8], and the SEC [9]. Head-to-head comparisons between the prognostic scores were assessed using Harrell’s C-index and prediction error curves based on the integrated Brier score (IBS). A C-Index of 0.5 indicates no predictive power and a C-Index of 1.0 implies perfect predictive ability. The prediction error was made by calculating the IBS over the study interval of 0‒60 months. All statistical analyses were achieved with R software, version 4.2.0 (R Foundation, Vienna, Austria).

Results

Baseline characteristics

The study included 480 patients who met the full set of inclusion and exclusion criteria. Their baseline patient characteristics are displayed in Table 2. The mean age was 62 years, and 72% were men. The main etiology of HCC was hepatitis B virus (54%) followed by hepatitis C virus (21%), and alcohol (17%). The Child–Pugh classes were A (66%) and B (34%). At the initial diagnosis of HCC, the most common largest tumor size was > 5 cm (60%) followed by 3–5 cm (26%). Tumor sizes ranged 1.0–21.7 cm and the mean tumor size ± SD was 7.2 ± 4.6 cm. More than half (55%) of the patients had 2–5 tumor nodules. Median values for serum alanine transaminase and platelet count were 40 (27–61) U/L and 122 × 103/mm3 (76–201), respectively. Most patients had serum AFP ≤ 200 ng/mL (63%). The median (IQR) serum albumin and total bilirubin levels were 3.4 (3.0,3.8) g/dL and 0.83 (0.54,1.39) mg/dL, respectively. Tumor response rates by mRECIST with complete response, partial response, stable disease, and progressive disease were 15% (74 patients), 41% (195 patients), 25% (118 patients), and 19% (93 patients), respectively. In addition, most patients had ALBI-TAE grade B (45%).

Table 2 Baseline patient demographic and clinical characteristics of 480 patients with intermediate-stage HCC who underwent TACE

Analysis of the prognostic factors for overall survival (OS)

In univariate analysis, size of the largest lesion, number of tumors, serum AFP level, serum albumin level, serum total bilirubin level, Child–Pugh class, treatment response to initial TACE, and ALBI-TAE model were associated with poor OS. Multivariate Cox model showed that Child–Pugh class B (hazard ratio [HR] 1.52, 95% CI 1.23–1.87; P < 0.001), treatment response to initial TACE (HR 1.65, 95% CI 1.35–2.01; P < 0.001), and ALBI-TAE model (group B vs. group A [HR 1.59, 95% CI 1.06–2.40: P = 0.026], group C vs. group A [HR 2.54, 95% CI 1.66–3.88; P < 0.001], and group D vs. group A [HR 3.73, 95% CI 2.31–6.01; P < 0.001]) were independently associated with increased mortality in these patients (Table 3).

Table 3 Univariate and multivariate analysis of prognostic factors associated with overall survival

Survival analysis

The median (IQR) OS of the entire cohort was 16.6 months (14.9,18.4 months). The 1-, 3-, and 5-year OS rates were 60%, 22%, and 11% respectively. By stratifying the ALBI-TAE model (groups A, B, C, and D), the median (IQR) OS rates were 40.80 (29.04,105.19), 20.14 (17.61,23.85), 10.58 (9.17,14.13), and 7.54 (4.37,9.33) months (Fig. 1). The 1-, 3-, and 5-year OS rates were 91%, 51%, and 40%, respectively, for ALBI-TAE A; 72%, 26%, and 12%, respectively, for ALBI-TAE B; 48%, 14%, and 5%, respectively, for ALBI-TAE C; and 27%, 9%, and 4%, respectively, for ALBI-TAE D. There were significant survival differences between the four groups (P < 0.001).

Fig. 1
figure 1

Kaplan–Meier curves of overall survival among intermediate-stage HCC patients who underwent TACE stratified by the ALBI-TAE model

Comparing the performances of the ALBI-TAE model and other scores .

Table 4 provides a detailed overview of the head-to-head comparison. Among these seven scores, the ALBI-TAE model had the highest C-index of 0.633, which suggested a better prognostic performance to discriminate survival in TACE patients, followed by the HAP score (0.629), mHAP-II score (0.624), SEC (0.578), SAT score (0.574), Bolondi’s subclassification (0.570), and tumor burden score (0.546). The IBS values for the study interval (0–60 months) were 0.152 for the ALBI-TAE model, 0.153 for the mHAP-II score, and 0.154 for the HAP score. Based on the Kaplan Meier estimates of the unstratified sample, the reference IBS was 0.169. The prediction error curves based on the IBS are shown in Fig. 2.

Table 4 Head-to-head comparison of the performance and discriminative ability of the seven models in predicting survival for intermediate-stage HCC patients who underwent TACE
Fig. 2
figure 2

Predictive error curve and integrated Brier score (IBS) for Kaplan–Meier estimates based on the ALBI-TAE model (blue), mHAP-II score (green), HAP score (red), seven-eleven-criteria (gray), six-and-twelve score (pink), Bolondi’s subclassification (purple), tumor burden score (orange), and compared with the unstratified sample (black)

Treatment response and complications stratified by the ALBI-TAE model

Treatment response and complications after TACE were evaluated based on the ALBI-TAE model (Table 5). Among the 480 patients, 269 responded well to TACE, while 211 had a poor response. Compared to ALBI-TAE group A, the chance of TACE response was significantly lower in ALBI-TAE group C (adjusted OR 0.14, 95% CI 0.06–0.34) and ALBI-TAE group D (adjusted OR 0.04, 95% CI 0.01–0.12). Patients in ALBI-TAE group B had a numerically lower chance of TACE response compared to patients in ALBI-TAE group A, although not statistically significant (adjusted OR 0.44, 95% CI 0.19–1.04). Postembolization syndrome was more common in ALBI-TAE groups B (19%), C (31%), and D (59%) compared to patients in ALBI-TAE group A (11%) with significant differences (P < 0.001). Moreover, ALBI-TAE group A had no incidence of liver decompensation, while groups B, C, and D had rates of 5%, 9%, and 29%, respectively. These differences between the groups were significant (P < 0.001).

Table 5 Treatment response and complications of intermediate-stage HCC patients who underwent TACE, stratified by the ALBI-TAE model

Discussion

Since BCLC-B HCC patients are heterogeneous in terms of tumor burden and hepatic reserve function, it remains challenging to make treatment decisions in these patients [7, 12]. We performed a head-to-head-comparison of the ALBI-TAE model and the other six scoring systems (Bolondi’s subclassification, HAP score, mHAP-II score, tumor burden score, SAT score, and SEC) in predicting HCC prognosis. We demonstrated that the ALBI-TAE model had the most predictive performance (Harrell’s C-index 0.633) in differentiating OS in BCLC-B HCC patients who underwent TACE. Moreover, the ALBI-TAE model was identified as an independent prognostic factor for survival outcome in multivariate analysis.

Compared with the other scoring systems based on tumor size and tumor number, the ALBI-TAE model, which consists of up-to-11 criteria, ALBI grade, and serum AFP level, offered a significantly higher C-index (0.633) than the SEC (0.578), SAT score (0.574), and tumor burden score (0.546). Our results confirmed the idea that tumor burden alone was insufficient to establish clear cut treatment decisions because HCC development was associated with liver cirrhosis in more than 80% of patients [4]. Most patients have two diseases that include HCC and liver cirrhosis and both impact an impaired prognosis. Currently, TACE is suggested for HCC patients with Child–Pugh class A and those with highly selected Child–Pugh class B cirrhosis [20]. As a result, a suitable selection of patients with BCLC-B HCC patients undergoing TACE remains challenging owing to different patient outcomes.

To address this issue, several scoring systems calculate the tumor burden and liver function, e.g., Bolondi’s subclassification, HAP score, and mHAP-II score [12, 16, 17]. However, our study illustrated that the ALBI-TAE model had the highest C-index among these scoring systems and represented better prognostication to differentiate survival outcome in BCLC-B HCC patients who underwent TACE. We demonstrated that the very high-risk patients in ALBI-TAE group D had a significantly higher risk of death than the high-, intermediate- and low-risk patients, which were ALBI-TAE groups C, B, and A, respectively. Therefore, the ALBI-TAE model can serve as a guide in the selection of optimal candidates for TACE.

Patients with high-risk (ALBI-TAE group C) and very high-risk (ALBI-TAE group D) hepatocellular carcinoma (HCC) have a median overall survival (OS) of 8–11 months, which is similar to patients with BCLC-C HCC receiving sorafenib treatment [21]. Furthermore, the response to TACE treatment is notably lower in ALBI-TAE group C (43%) and D (18%), respectively, and there is a high incidence of liver decompensation (9% and 29% in group C and D, respectively). Therefore, caution is advised when considering TACE as a treatment option for these patients. Instead, other alternative treatments such as radioembolization, targeted drugs, or immunotherapy should be considered rather than adhering to TACE until the disease progresses. These options may provide additional benefits and have the potential to improve outcomes for these patients. On the other hand, in the low-risk ALBI-TAE group A patients with an expected OS of 40.8 months, superselective TACE with achievement of portal vein visualization and circumferential safety margin is recommended to minimize local tumor recurrence and prolong survival outcome [22, 23]. However, if patients are eligible, other treatment modalities with a curative purpose, such as liver transplantation, resection, or ablation, should be done to reach a prognosis outcome comparable to BCLC-A patients. In addition, in the ALBI-TAE group B patients with an expected median OS of 20.1 months, TACE remains mandatory. These findings are consistent with a previous large systematic review that included 10,108 patients who underwent TACE [24].

The overall median OS results in our study were lower than the original Taiwan study (16.6 vs. 21.3 months) [18]. Also, by stratifying the ALBI-TAE model into groups A, B, C, and D, the median OS results in our study were lower than the original study with the exception of group D (our study: 40.8, 20.1, 10.6, and 7.5 months vs. original study: 65.9, 30.2, 17.4, and 6.0 months, respectively) [18]. The lower survival rates in our study were possibly due to the higher tumor burden and a lower proportion of Child–Pugh class A patients. The mean ± SD tumor size of the eligible patients in our study was 7.2 ± 4.6 cm compared with 6.7 ± 3.7 cm in the original Taiwan study. Notably, 60% of the patients in our study had large tumors > 5 cm. Moreover, a lower ratio of Child–Pugh class A patients were observed in our study in contrast to the original study (66% vs. 86%), which possibly led to the lower survival rates.

This study has a few noteworthy strengths. First, to the best of our knowledge, this was the first external validation study of the ALBI-TAE model in predicting the outcome of BCLC-B HCC patients undergoing TACE. Second, this study is a real-world evaluation in a heterogeneous population with various etiologies of HCC, i.e. hepatitis B, C, and alcohol, and relatively high tumor burdens. Third, our cohort compared seven scoring systems in patients who underwent conventional TACE in the entire cohort, which may be a potential benefit of this study.

Our study has a few notable limitations. First, this was a retrospective cohort study at a single medical center in Southeast Asia that should be validated in a prospective trial. Second, we did not analyze the expertise of the operators or the level of selective catheterization that might impact the survival outcome. Third, although the radiological response evaluation was recognized as an independent prognostic factor for survival in the multivariate analysis, it was not employed in the scoring system. The purpose of our study was to compare the ALBI-TAE model with six other scoring systems for efficient guidance in patient selection for TACE at the time of diagnosis before initiation of treatment.

Conclusions

We demonstrated that the ALBI-TAE model displayed excellent differentiation in OS in patients who underwent TACE. Among the seven scoring systems, the ALBI-TAE model had the best predictive ability with the highest Harrell’s C-index in prognostication of survival outcome. The ALBI-TAE model is a simple and valuable prognostic tool to identify BCLC-B HCC patients who are suitable to receive TACE as the initial treatment.