FormalPara Key Points for Decision Makers

This is the first model to incorporate comorbidities and other important considerations for individuals living with NASH.

The model provides a standardized framework that could provide the basis of further cost-effectiveness analyses that can be updated as relevant data are made available.

1 Introduction

Nonalcoholic fatty liver disease (NAFLD) is a chronic liver condition, characterized by > 5% hepatic steatosis that is unrelated to alcohol intake [1]. Based on histology, NAFLD can be categorized as nonalcoholic fatty liver or nonalcoholic steatohepatitis (NASH) [1]. While nonalcoholic fatty liver presents with little or no inflammation or liver cell damage, NASH presents with hepatocellular injury that can progress to hepatic fibrosis, cirrhosis, liver failure, and hepatocellular carcinoma (HCC) [1, 2].

NASH affects an estimated 3–5% of the population globally [2], with prevalence estimates varying between countries [3]. In the USA, for example, the estimated prevalence was 5.3% in 2016, and is predicted to rise to 7.6% by 2030 [3]. In addition to hepatic morbidity, NAFLD and NASH are strongly associated with comorbidities including type 2 diabetes (T2D), obesity, and cardiovascular disease (CVD), and it is increasingly recognized that management strategies will need to deliver holistic, individualized care [1, 2, 4, 5].

There are currently no pharmacologic agents approved for the treatment of NASH [6]. Current management recommendations focus on lifestyle modifications, including increased physical activity and a hypocaloric diet to promote weight loss [1, 7]. There is, therefore, an unmet medical need for effective and safe pharmacological treatments. Several investigational drugs are undergoing clinical trial evaluation, including those that may offer treatment benefits that extend beyond hepatic outcomes [8].

Cost-effectiveness analyses are often required to assess the value of new versus current treatments to determine whether those treatments should be adopted and reimbursed [9]. However, a recent systematic literature review that assessed 16 unique health economic models used in cost-effectiveness analyses in NAFLD or NASH populations found that the included models were heterogeneous in scope and associated with limitations [10]. Key limitations across all models included a lack of NASH-specific data and the consistent omission of comorbidities that frequently co-exist with NASH, such as CVD, T2D, and obesity. Other limitations included: assumptions of linear fibrosis progression rates based on progression rates of patients with mild/no fibrosis (F0, F1)—the lack of consistent disease progression data is also highlighted in another systematic literature review [11]; sourcing data for costs, utilities, and transition probabilities for late-stage complications from other liver disease areas; and a lack of data linking surrogate outcomes (such as steatosis) to hard clinical endpoints (e.g., disease progression). Furthermore, most identified models were product specific (e.g., for diagnostic/staging techniques, or for surgical, lifestyle, or unlicensed pharmacologic interventions), highlighting an unmet need for nonintervention-specific models.

Following publication of the systematic literature review by Johansen et al. [10], the Institute for Clinical and Economic Review (ICER) published an update of its 2016 evaluation of obeticholic acid for the treatment of NASH [12, 13]. The key structural difference to previous models is the inclusion of cardiovascular risk, which was achieved by splitting the population into those with or without prior cardiovascular events. Also, while the earlier ICER model was a patient-level simulation (PLS), the updated model followed a cohort approach. To the authors’ knowledge, no other models for the evaluation of pharmacological interventions in NASH have been developed. There remains an unmet need for high-quality, transparent, validated, nonintervention-specific health economic models of NASH, that can capture both hepatic and nonhepatic outcomes, to assess the cost-effectiveness of treatments for NASH in development [10].

We report here on the development and cross-validation of a de novo cost-effectiveness model (CEM) framework for estimating the cost-effectiveness of treatments for NASH that incorporates extra-hepatic disease states and outcomes alongside hepatic components. This is a novel framework which aims to capture the natural history of NASH and its sequalae, including the impact on comorbidities.

2 Methods

2.1 Development of the CEM Framework

2.1.1 CEM Structure

Overview To account for the effect of pharmacologic agents in clinical development that influence hepatic and non-hepatic outcomes, there was a need for a de novo NASH CEM that addressed the complex interaction of comorbidities commonly associated with NASH (such as CVD, T2D, and obesity), how comorbidities and NASH overlap with respect to outcome risks and quality of life (QoL) decrements, the effect of common treatment across these comorbidities and the heterogeneity of patient characteristics and patient pathways (e.g., dynamic treatment duration). Full details can be found in the technical report (Supplementary Appendix).


Key disease states and clinical outcomes Figure 1 shows an overview of the structure for the CEM. In alignment with findings from Johansen et al. [10], both liver fibrosis stages and NASH histology were included. Liver disease was modeled as progression through fibrosis stages F0 to F4, with F4 also referred to as compensated cirrhosis. Late-stage liver disease outcomes included decompensated cirrhosis (DCC), HCC, and liver transplantation. Key outcomes with QoL and cost implications were included for obesity, T2D, and CVD. Risk equations for these comorbidities were parameterized from the literature. Composite outcomes were used for obesity and T2D, details of which are provided in the Supplementary Information.

Fig. 1
figure 1

Causal relationship of key disease states and clinical outcomes. CVD cardiovascular disease, MI myocardial infarction, NASH nonalcoholic steatohepatitis, T2D type 2 diabetes. Green dashed line indicates effect is specific to treatment and may directly affect outcomes other than fibrosis. Red dashed line indicates that no source with quantifiable relationship is identified, and the relationship is not modeled. aNASH histology is not modeled; included as a placeholder. bComorbidities effects will be optionally and independently included or excluded from analyses

2.1.2 CEM Implementation

To understand the effect of the structural differences seen among prior models, e.g., the 2016 and 2020 ICER models, this CEM includes both PLS and cohort-level Markov (CLM) models in the same framework. The PLS and CLM models were both implemented in the discretely integrated condition-event (DICE) framework [14], in a single common Microsoft Excel® workbook. The binary PLS conditions (i.e., presence or absence) representing health states were replaced by continuous conditions representing the proportions of the cohort in the different health states. Key similarities and differences between the PLS and CLM models are presented in Supplementary Table 1.

2.1.3 Target Population and Patient Profiles

The starting point of the CEM comprises the specification of patient profiles, which are sets of values for baseline characteristics of the individual patients representing the modeled populations or the characteristics of the modeled cohort. Patient profiles included in this CEM were based on study NN9931-4296 (NCT02970942), a phase 2 interventional study comprising patients with histologically confirmed NASH and a fibrosis stage of F1 (28%), F2 (22%), or F3 (49%) [15]. The descriptive statistics of the trial population were provided by Novo Nordisk. The detail of the patient-level data available from this trial allowed for a model that could tolerate a high degree of heterogeneity and therefore be representative of the overall NASH population. For the PLS, individual patients’ profiles were generated using the population averages and standard deviations. For the CLM model, the population means and proportions were used for the profiles describing the homogeneous CEM population.

2.1.4 Data Sources

The CEM framework assessed the cost-effectiveness of obeticholic acid, a small-molecule farnesoid-X receptor agonist, versus standard of care (SoC). Obeticholic acid was selected as there is published data on its clinical effectiveness in the treatment of NASH [16] and its cost-effectiveness, including the ICER 2016 [13] and ICER 2020 [12] model reports. During the process of peer review, a new version of the ICER report (2023) for NASH was published [17]. Cross-validation was also attempted in line with the methodology described for the 2016 and 2020 ICER reports, with conclusions in kee** with the previous cross validation exercises. Details of this validation can be found in the technical report (Supplementary Appendix). SoC was included due to the lack of other pharmacological treatments specifically indicated for the treatment of NASH that would be suitable for inclusion as a comparator. The basis for the current SoC has been created using EU and US guidelines recommending lifestyle modifications [1, 7].

Efficacy and adverse event (AE) parameters for both obeticholic acid and SoC were informed by the 18-month interim analysis of the REGENERATE trial (NCT02548351) [16] as this was the most recently published data at the time, following the approach used in the ICER model [12]. In REGENERATE, patients with NASH and fibrosis stages F2–F3 (or F1 with at least one accompanying comorbidity) were randomly assigned to receive oral placebo, obeticholic acid 10 mg, or obeticholic acid 25 mg daily.

2.1.5 Perspective, Time Horizon, and Discounting

In the prototype analysis, the cost-effectiveness analysis takes the perspective of the US payer, thus considering only direct medical costs. However, the CEM was designed to be easily adapted to other countries’ healthcare systems. The CEM has a flexible time horizon, with a lifetime horizon used for the prototype analysis. Costs and health benefits are discounted at 3% in the prototype analysis, in line with the ICER model [12] and other US-based cost-effectiveness analyses.

2.2 Inputs for Parameterization of CEM Framework

2.2.1 Disease Progression

Liver disease was modeled as progression through fibrosis stages F0–F4, via a Markov state transition probability matrix across both modeling approaches (i.e., the PLS and CLM models). In line with the ICER model [12], inputs for fibrosis progression were based on data reported by Singh et al. [18]. However, the specific transition probabilities used in this model, presented in Table 1, were sourced from a secondary analysis by Gal et al. [19, 20] (Table 1). This secondary analysis, while consistent with Singh et al. [18] in terms of data and model structure, derives transition probabilities from each fibrosis stage. This allows for one-stage improvement or worsening at each model cycle, rather than calculating fibrosis progression rate per baseline fibrosis stage. Input parameters for progression to late-stage liver disease [13, 21, 22] (i.e., DCC, HCC, and liver transplantation) are presented in Table 2. Transition probabilities for F3, F4 to DCC, HCC, and liver transplant are constant annual transition probabilities, with a model cycle length of 1 year.

Table 1 Transition probabilities for fibrosis progression
Table 2 Transition probabilities for progression to late-stage liver disease outcomes

2.2.2 Comorbidities

Comorbidities and risk equations are in line with the ICER model [12] and an obesity CEM developed for the submission of liraglutide 3 mg to health technology assessment bodies, including the National Institute for Health and Care Excellence (NICE) [23]. Sources from which comorbidity risk prediction models were derived for CVD, T2D, and obesity are presented in Supplementary Table 2 and further details of how risk prediction models were used to inform the CEM are provided in the Supplementary Information.

2.2.3 Mortality

The CEM includes liver disease-related mortality, CVD-related mortality, cancer mortality, and general population mortality. Mortality rates were informed by disease-specific studies and reviews, and the ICER model [12], as per the CEM inputs presented in Supplementary Table 3. General population mortality was informed by 2017 US Life Tables [24].

2.2.4 Health-Related Quality of Life (Utilities)

The CEM includes both health state and event utilities, as well as disutilities for AEs, based on NASH-specific data for liver disease health states, as outlined in Supplementary Tables 4 and 5. Health state utilities are applied for the time spent in the respective health state. For liver transplant, the first and subsequent years are differentiated and considered a different health state in terms of utility. Myocardial infarction (MI) and stroke incur a one-off event disutility at the time of the event as well as a post-event health state disutility that is applied for the duration of survival after the event. The annual disutility from AEs is applied while on treatment.

2.2.5 Costs

The CEM includes health state costs, event costs, and treatment costs. Health state costs are applied in the first and subsequent years spent in that health state. For liver transplant and obesity-related cancer health states, different costs are applied for the first and subsequent years. Cardiovascular events (MI, stroke, and cardiovascular death) incur associated one-off costs. After nonfatal MI and stroke, patients enter the post-MI and post-stroke health states and annual nondrug costs are accumulated until the end of the modeled time horizon. All costs were adjusted for inflation using the consumer price index (https://www.bls.gov/cpi/data.htm) so as to be reported in 2020 USD.

Health state costs were included for liver disease, CVD, T2D, and obesity. Costs for liver disease and CVD represent annual nondrug costs, with inputs informed by the ICER model [12]. Input parameters for obesity health state costs were guided by the obesity model developed for the liraglutide 3 mg NICE submission [23]. Included event costs were for CVD events and AEs. CVD costs represent annual nondrug costs, and annual cost from AEs is applied while on treatment. CVD event costs and AE costs were informed by the ICER model [12], and frequency of AEs was obtained from an interim analysis of the REGENERATE trial [16]. Input parameters for all health state and event costs are included in Supplementary Table 6. Included treatment costs are associated with drug acquisition and administration and are presented in Supplementary Table 7.

2.3 Treatment Effect

Treatment effect is formalized as a factor affecting risk. The treatment may affect disease progression in three different ways: (1) the treatment directly affects transition probabilities between health states (e.g., risk ratio of fibrosis progression, improvement or worsening, versus natural history), and no direct effect on liver complications and comorbidity outcome risks is assumed; (2) treatment may affect biomarkers and some patient characteristics that are predictors in risk equations (such as body mass index, glycated hemoglobin, low-density lipoprotein [LDL], high-density lipoprotein, total cholesterol, and systolic blood pressure). However, patient characteristics are not currently included as direct predictors of treatment effect, due to lack of appropriate data; (3) changed disease progression could impact risks indirectly (e.g., NASH status is a predictor of CVD risks; risk ratios are assumed relative to no-NASH population).

Relative risks applied to natural history progression for fibrosis followed the approach of the ICER model (Table 3) [12]. As there are concerns around medications that increase lipid levels, particularly LDL-cholesterol, due to the associated increased risk of CVD, increases in LDL-cholesterol observed in patients receiving obeticholic acid were incorporated after 1 year of treatment, following the approach of the ICER model (Supplementary Table 8) [12].

Table 3 Relative risk applied to fibrosis progression assumed under natural history progression

2.4 Treatment Delay and Discontinuations

Treatment in both arms may be delayed by diagnostic inertia. The delay is a CEM input applied equally in each arm. While treatment is delayed, patients progress with NASH and comorbidities according to natural history progression. Treatment-related costs and health benefits start accruing when treatment starts after the user-defined delay. In the prototype analysis, the diagnostic inertia is set to zero (no delay). Intervention-specific discontinuation rates were input into the CEM and applied stochastically in the PLS, and deterministically in the CLM model. The annual discontinuation rate for obeticholic acid was assumed to be 6%, based on data from the REGENERATE trial [16], and for SoC was assumed to be zero. Unless a patient discontinued treatment, they were assumed to be continuously treated with obeticholic acid. The CEM allows for maximum treatment durations and stop** rules, although these were not actioned in the prototype analysis. After discontinuation, patients will continue progression according to natural history of disease.

2.5 CEM Outcomes

The CEM calculates total and incremental costs, life years (LYs), quality-adjusted LYs (QALYs), ICERs, and incremental cost-utility ratios for obeticholic acid and SoC.

The CEM framework permits scenario analyses to be conducted to assess the impact of changes on the results in the selection of underlying CEM parameters or values, including patient subpopulations, time horizon, discount rates, natural history of disease, and treatment efficacy assumptions. The CEM also allows for the accounting of uncertainty in input parameters, through use of deterministic sensitivity analyses and probabilistic sensitivity analyses, which were performed by varying CEM parameters according to appropriate probability distributions around the prototype analysis. Further details on the deterministic and probabilistic sensitivity analysis methodology are provided in the Supplementary Information.

2.6 Validation Approach

The CEM was subject to several rounds of technical validation to identify and resolve any calculation errors that were inadvertently introduced as part of its development. Clinical validation was undertaken by an advisory board of clinical and health economics experts that was convened in 2021 as part of the CEM validation process. In addition, a cross-validation exercise was undertaken to compare results with two previously developed models of obeticholic acid versus SoC [12, 13]. Based on reported outcomes for each published model, the equivalent results from the CEM were extracted and compared (i.e., inputs for the CEM were adjusted to reflect those reported in the published reports, and results were extracted). Any notable differences in results across models were recorded and included as part of a narrative summary of the cross-validation exercise.

3 Results

3.1 CEM Results

The prototype analysis was performed using a PLS. For obeticholic acid versus SoC, respectively, total costs were $911,651 versus $244,263, total LYs were 15.651 versus 14.935, and total QALYs were 12.325 versus 11.637 (Table 4). Clinical outcomes related to liver disease health states (mean fibrosis stage change, DCC, and HCC) were more frequent for patients receiving SoC versus obeticholic acid, while outcomes related to comorbidities were lower for patients receiving SoC versus obeticholic acid (Supplementary Table 9).

Table 4 Summary results for prototype analysis

Costs were disaggregated by outcome, health state, and comorbidity (Supplementary Table 10). The greater overall cost for obeticholic acid versus SoC was driven by higher costs of drug acquisition ($716,879 versus $0, respectively). Drug acquisitions costs for obeticholic acid represented 78% of total costs; for patients receiving SoC, 99% of total costs were due to disease management costs. Except for fibrosis stage F0–F2, costs for individual liver-disease health states were higher with SoC compared with obeticholic acid, while costs for comorbidities were nominally greater with obeticholic acid versus SoC. For both comparators, the largest associated cost by comorbidities was T2D, and by liver-disease health state was HCC.

Deterministic sensitivity analysis and probabilistic sensitivity analysis results can be found in the Supplementary Information.

3.2 Validation Results

Key findings from the clinical validation undertaken by expert advisors were incorporated into this CEM. For example, the core model structure was based on fibrosis staging, in kee** with previous ICER models [12, 13]. A microsimulation approach (PLS) was utilized to adequately account for patient outcome heterogeneity and tracking patient history/changing risk of competing events, or at least an element of this. In addition, the model incorporated the ability for payers to assess the face-validity of component parts of the model (e.g., the ability to turn off comorbidity health states to assess the cost-effectiveness in NASH outcomes alone).

Cross-validation findings showed that the CEM yields markedly different results versus the previously published ICER models (which also present substantially different results from each other), although it should be noted that owing to limited reporting and different model structures, a thorough investigation of cross-validation findings was not feasible. LYs estimated by the CEM were approximately 1.7 times those produced by the ICER 2016 model but were more closely aligned with the ICER 2020 model (approximately 1.2 times). Costs for obeticholic acid in the CEM were nearly twice the costs predicted by the ICER 2016 model, likely reflecting longer treatment duration (due to extended survival). Conversely, the ICER 2020 model estimated higher total costs compared with the CEM (although survival was shorter in the ICER 2020 model). Costs were substantially affected by the specification of a stop** rule for obeticholic acid.

The cross-validation exercise identified key drivers of differences in the CEM and ICER cost-effectiveness results. These included mortality inputs, transition probability estimates (and the methodological approach used to derive these estimates), incorporation (or lack thereof) of treatment effects (and which transition[s] are affected), and the inclusion (or lack thereof) of specific complications and/or comorbidities explicitly captured as part of the model structure and disease process. Details on the cross-validation exercise can be found in the technical report (Supplementary Appendix).

4 Discussion

This report describes the development of a de novo CEM framework to evaluate the potential cost-effectiveness of NASH therapies. As highlighted by Johansen et al. [10], published CEMs in NAFLD or NASH populations have limitations that preclude their immediate suitability to fully explore the potential benefits, costs, and consequences of different types of NASH therapies. These include a lack of NASH-specific data that necessitates inclusion of various assumptions in model input parameters, and omission of common comorbid conditions. This CEM aligns closely with other recent CEM models, such as those outlined by Rustgi et al. [25] and Javanbakht et al. [26], in its approach to transition probabilities and the sources used for NASH data. However, the CEM presented here attempts to address the unmet need highlighted in both of these papers by incorporating the clinical consequences of several comorbidities that frequently occur alongside NASH, including obesity, T2D, and CVD. Furthermore, to the best of our knowledge, this is the first NASH CEM developed using DICE methodology to run analyses. Similar to previous models, this CEM includes several assumptions due to a lack of NASH-specific data available at the time of development. As such, this CEM is considered a framework that, while offering some improvements over existing models, will be further developed in future as more relevant data become available.

A core outcome set (COS) for NASH has been described by the coreNASH panel, who propose that the COS could support regulatory and market access decision-making when evaluating the value of new treatments for NASH [27]. For NASH without cirrhosis (F2–F3), the final COS reported by the panel included disease stage, fibrosis stage, and hepatic-related mortality. For NASH with cirrhosis (F4), the final COS included health-related QoL, hepatic-related morbidity, cirrhosis, fibrosis stage, HCC, clinical need for liver transplant, decompensation events, Model for End-Stage Liver Disease (MELD) score, hospital stay, all-cause mortality, hepatic-related mortality, and survival. Outcomes reported by this CEM broadly align with the COS, although MELD score and hospital stay are not currently included.

Various types of validation were explored and conducted where possible with currently available data. Clinical validation was undertaken by an advisory board of clinical and health economics experts, with key recommendations from advisors incorporated into this CEM. A technical validation was undertaken to identify model errors introduced throughout construction, which were resolved before finalization. A cross-validation against two published models comparing obeticholic acid to SoC was performed, although results were notably different driven by key differences in approach taken to reflect survival, model transitions, and incorporate treatment effects. External and predictive validations were not feasible, owing to an absence of relevant data. As such, this work can be viewed as laying the path for future modeling validation work in NASH.

This work reflects the importance of establishing the need to treat patients holistically, and not limit focus to liver biomarkers and hepatic outcomes.

4.1 Strengths and Limitations

The key strengths of the CEM include its flexibility to evaluate cost-effectiveness in NASH therapies across a variety of settings, the ability to assess input parameter uncertainty, and the ease of incorporating new patient profiles, comparators, and data sources as they become available.

There are several inherent challenges for modeling new interventions in NASH, for example, heterogeneity due to diagnostic techniques (biopsy versus noninvasive techniques), interassessor and biopsy site variability in measured outcomes, and the prevalence of metabolic, cardiovascular, and mortality risk-modifying comorbidities. In addition, fibrosis progression can take several years, and it may be challenging to observe significant improvements if follow-up time is inadequate. Finally, significant placebo effects can be observed in NASH trials that could concern payers regarding the generalizability of SoC treatment. For the latter, real-world evidence, healthcare resource utilization data, and subpopulation analyses may be needed to illustrate the generalizability of trial results.

The main limitations of this CEM stem from a lack of available data, leading to the incorporation of several assumptions. In particular, there is currently inadequate information to inform the link between NASH histology and fibrosis with hard clinical outcomes. Modeling of surrogate outcomes for prediction of long-term outcomes (based on histological changes in liver fibrosis and inflammation) may be required; however, relying on surrogate outcomes will add uncertainty in cost-effectiveness estimates. There is a lack of comprehensive risk equations for NASH comorbidities, and the relationships with NASH are not well established; therefore, some assumptions for cardiovascular outcomes in cardiovascular risk equations were based on temporary changes in risk factors that may not reflect long-term change. In addition, the United Kingdom Prospective Diabetes Study (UKPDS) risk equations [28] were developed in T2D populations and validation within a NASH population would improve their suitability for inclusion in this CEM. For the PLS, individual patients were generated according to population averages and standard deviations. A limitation of this approach is that it ignores the likely important correlations between the baseline characteristics. However, that information was not available for the CEM development. NASH resolution is currently not included in the CEM. While this is due to data limitations, this may be a concern for payers when approving a treatment for NASH. It is also worth noting that for this framework, patient profiles were generated from a study with semaglutide. Though this may mean that the results differ from clinical evaluation models that looked at data from a study using obeticholic acid specifically, the summary analysis based on NASH patient-level data available from this study allowed for greater generalization of the framework, and thus wider applicability.

4.2 Proposed Future Developments

Several drug classes are in clinical development for the treatment of NASH, and, as such, potentially relevant data may become available in the coming years that could be used to improve this CEM or other NASH models. These include incorporation of surrogate outcomes used to predict long-term clinical outcomes once suitable modeling data are available, for example, the incorporation of liver enzymes as potential predictors of treatment effect.

In this CEM, CVD event risk is currently derived from an assumption made for the relative risk of CVD in patients with NASH, relative to CVD risk in patients without NASH. Comparing rates of CVD events using the UKPDS in the NASH and T2D population with cardiovascular events in NASH registries (e.g., the Liver Investigation: Testing Marker Utility in Steatohepatitis [LITMUS] and Non-Invasive Biomarkers of Metabolic Liver Disease [NIMBLE] consortia) may be beneficial.

Factors omitted from the current CEM include indirect costs (not used in US base case) and treatment effect on liver histology (NAFLD Activity Score component scores), liver enzymes, obesity, T2D, and hypertension. Placeholders are included for these inputs for potential implementation at a later stage. To fully align with the NASH COS described by the coreNASH panel [27], MELD score and hospital stay are also proposed for incorporation as part of any future works for this CEM. In addition, it is anticipated that parameters associated with SoC may require updating in the future as other pharmacologic treatments gain approval and become part of the SoC for NASH.

5 Conclusion

This CEM utilizes NASH-specific data to create a framework that can be used to estimate the cost-effectiveness of NASH interventions and is the first model to incorporate the clinical consequences of NASH comorbidities. There is still a lack of information available regarding several important considerations for cost-effectiveness models in NASH, such as the link between NASH histology and clinical outcomes and the interaction of comorbidities with NASH. However, due to its flexibility, the model can be further developed in the future to include the increasing amount of NASH data being generated.