Article Text

Original research
Glucagon-like peptide-1 receptor agonists and risk of major adverse liver outcomes in patients with chronic liver disease and type 2 diabetes
  1. Axel Wester1,
  2. Ying Shang1,
  3. Emilie Toresson Grip1,2,
  4. Anthony A Matthews3,
  5. Hannes Hagström1,4
  1. 1Department of Medicine Huddinge, Karolinska Institutet, Stockholm, Sweden
  2. 2Quantify Research, Stockholm, Sweden
  3. 3Unit of Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
  4. 4Division of Hepatology, Department of Upper GI, Karolinska University Hospital, Stockholm, Sweden
  1. Correspondence to Dr Axel Wester, Department of Medicine Huddinge, Karolinska Institutet, Stockholm, Sweden; axel.wester{at}ki.se

Abstract

Objective Phase II trials suggest glucagon-like peptide-1 receptor (GLP1) agonists resolve metabolic dysfunction-associated steatohepatitis but do not affect fibrosis regression. We aimed to determine the long-term causal effect of GLP1 agonists on the risk of major adverse liver outcomes (MALO) in patients with any chronic liver disease and type 2 diabetes.

Design We used observational data from Swedish healthcare registers 2010–2020 to emulate a target trial of GLP1 agonists in eligible patients with chronic liver disease and type 2 diabetes. We used an inverse-probability weighted marginal structural model to compare parametric estimates of 10-year MALO risk (decompensated cirrhosis, hepatocellular carcinoma, liver transplantation or MALO-related death) in initiators of GLP1 agonists with non-initiators. We randomly sampled 5% of the non-initiators to increase computational efficiency.

Results GLP1 agonist initiators had a 10-year risk of MALO at 13.3% (42/1026) vs 14.6% in non-initiators (1079/15 633) in intention-to-treat analysis (risk ratio (RR)=0.91, 95% CI=0.50 to 1.32). The corresponding 10-year per-protocol risk estimates were 7.4% (22/1026) and 14.4% (1079/15 633), respectively (RR=0.51, 95% CI=0.14 to 0.88). The per-protocol risk estimates at 6 years were 5.4% (21/1026) vs 9.0% (933/15 633) (RR=0.60, 95% CI=0.29 to 0.90) and at 8 years 7.2% (22/1026) vs 11.7% (1036/15 633) (RR=0.61, 95% CI=0.21 to 1.01).

Conclusion In patients with chronic liver disease and type 2 diabetes who adhered to therapy over time, GLP1 agonists may result in lower risk of MALO. This suggests that GLP1 agonists are promising agents to reduce risk of chronic liver disease progression in patients with concurrent type 2 diabetes, although this needs to be corroborated in randomised trials.

  • CIRRHOSIS
  • PHARMACOTHERAPY
  • FATTY LIVER
  • EPIDEMIOLOGY
  • DIABETES MELLITUS

Data availability statement

No data are available. No data are available due to Swedish regulations.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Video Abstract

Disclaimer: this video summarises a scientific article published by BMJ Publishing Group Limited (BMJ). The content of this video has not been peer-reviewed and does not constitute medical advice. Any opinions expressed are solely those of the contributors. Viewers should be aware that professionals in the field may have different opinions. BMJ does not endorse any opinions expressed or recommendations discussed. Viewers should not use the content of the video as the basis for any medical treatment. BMJ disclaims all liability and responsibility arising from any reliance placed on the content.

WHAT IS ALREADY KNOWN ON THIS TOPIC

  • Glucagon-like peptide-1 receptor (GLP1) agonists might resolve metabolic dysfunction-associated steatohepatitis, but their effect on hard clinical outcomes in patients with chronic liver diseases of any aetiology and concurrent type 2 diabetes is unknown.

WHAT THIS STUDY ADDS

  • Using Swedish register data, we emulated a target trial of GLP1 agonists in patients with chronic liver disease and type 2 diabetes and fitted an inverse-probability weighted marginal structural model to estimate 10-year risks of major adverse liver outcomes (MALO). The risk of MALO was 49% lower in initiators of GLP1 agonists in the per-protocol analysis, but our data were not compatible with a protective intention-to-treat effect.

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

  • GLP1 agonists might be a treatment option to reduce MALO risk in patients with type 2 diabetes and any chronic liver disease who adhere to the treatment over time, although this would need to be corroborated by randomised clinical trials.

Introduction

Chronic liver diseases are highly prevalent and can progress to decompensated cirrhosis, hepatocellular carcinoma (HCC) and liver-related death.1–5 Type 2 diabetes strongly predicts presence and severity of metabolic dysfunction-associated steatotic liver disease (MASLD) and is also a major risk factor for disease progression in other liver diseases, likely due to interaction of hepatic steatosis and steatohepatitis with other liver diseases.6–9

Currently, no approved pharmacotherapy exists for MASLD, but one promising drug class is glucagon-like peptide-1 receptor (GLP1) agonists, which are currently approved in patients with type 2 diabetes or obesity to achieve weight loss and control blood glucose.10–12 Importantly, phase II trials indicate that GLP1 agonists resolve metabolic dysfunction-associated steatohepatitis (MASH) in patients with non-cirrhotic MASLD but do not cause fibrosis regression.13 14 Large phase III trials that aim to estimate the effect of GLP1 agonists on resolving MASH or reducing hepatic fibrosis are, however, many years from completion.15

Although achieving surrogate histopathological endpoints (eg, fibrosis regression) is considered by regulatory agencies to likely translate to improved prognosis, robust evidence is needed to understand if GLP1 agonists reduce the risk of long-term clinical outcomes, such as liver decompensation or HCC.16 17 Given that the metabolic syndrome is a major driver of liver-related outcomes both in patients with MASLD and other chronic liver diseases such as alcohol-related liver disease or viral hepatitis C, there could be a similar effect of GLP1 agonists in patients with chronic liver diseases of any aetiology with concomitant metabolic traits, such as type 2 diabetes.18–20 For instance, insulin resistance is the strongest predictor of liver fibrosis in patients with alcohol-related liver disease.21 Therefore, we designed a target trial that would estimate the long-term causal effect of GLP1 agonists on major adverse liver outcomes (MALO) in patients with any chronic liver disease and type 2 diabetes, and then emulated it using observational data from Swedish healthcare registers.

Methods

Data sources

The Decoding the Epidemiology of LIVER disease in Sweden (DELIVER) cohort includes data from Swedish national healthcare registers on all patients with any chronic liver disease in Sweden 1964–2020.22 The data include all International Classification of Diseases (ICD) codes from inpatient and specialised outpatient care, dates and causes of death, and automatically recorded information on filled prescriptions from any pharmacy in Sweden.23–29 The positive predictive value is 96% for MASLD with comorbid type 2 diabetes, and >90% for most diagnoses related to cirrhosis.30 31 A detailed overview of the registers is provided in online supplemental methods.

Supplemental material

Target trial specification and emulation

A causal question is best answered by data from randomised trials, but when unavailable, researchers often resort to observational data from existing databases. To avoid common methodological pitfalls in observational studies, causal inference from such data can be viewed as an attempt to emulate a hypothetical pragmatic randomised trial—a target trial.32 An overview of the target trial emulation concept is provided in reference.33 After specifying the target trial protocol, it is emulated using the available observational data and appropriate methodology. For this observational study, we first specified the protocol of a target trial that would estimate the effect of GLP1 agonists on MALO risk in patients with chronic liver disease and type 2 diabetes. We then emulated the target trial using data from DELIVER. Table 1 summarises all protocol components from the target trial and its emulation, which we describe in detail below. All diagnoses and medications were defined by the ICD or Anatomical Therapeutic Chemical (ATC) codes in online supplemental tables 1–3. Diagnoses were identified from January 1997 and forward (when ICD-10 was introduced in Sweden), and drugs from July 2005 and forward (when the Swedish Prescribed Drug Register was initiated).

Table 1

Protocol of a target trial and an emulated trial using observational data

Eligibility criteria

All Swedish residents ≥18 years of age between January 2010, when uptake of GLP1 agonists increased in Sweden, and November 2020 with any chronic liver disease and type 2 diabetes were identified.22 To avoid structural positivity issues (ie, patients with zero probability of initiating a GLP1 agonist at baseline), patients were required to have filled at least one prescription of metformin within a year before baseline corresponding to a daily dose of ≥1 g (ie, patients potentially eligible for second-line treatment with a GLP1 agonist). Patients were excluded if they previously filled prescriptions of GLP1 agonists, had a history of a contraindication to GLP1 agonists (defined as pancreatitis, inflammatory bowel disease or severe chronic kidney disease) or prior MALO (defined below). As the non-initiator population was large, we randomly sampled 5% of this group to increase computational efficiency (figure 1).

Figure 1

Flow chart of the study population. Numbers represent study participants (initiators or non-initiators), while numbers in parentheses represent the corresponding number of unique patients. Note that the numbers of excluded study participants represent the total number of times that unique patients were non-eligible for any of the emulated target trials. If somebody was non-eligible for all the 131 emulated target trials, they would contribute with an addition of 131 to this total number.

Treatment strategies

We compared two treatment strategies: initiation of any GLP1 agonist (ATC code A10BJ) at baseline and continuation of treatment over follow-up unless a contraindication was diagnosed after baseline; and no initiation of a GLP1 agonist at baseline and continuation of no GLP1 agonist treatment during follow-up, unless indicated as deemed by the treating physician. Since we lacked data to specifically determine who were indicated for GLP1 agonists, we assumed that it was indicated in all non-initiators who started the drug during follow-up. We assessed drug continuation by summing the number of months of filled prescriptions. A gap between two successive prescriptions was allowed if it was less than twice the time intended for the most recently filled prescription. For example, an initiator who filled a prescription for 3 months treatment was considered to have stopped the treatment after 6 months, unless the prescription was refilled before that.

Treatment assignment

Patients were classified into two groups according to the strategy their data were compatible with at baseline, that is, GLP1 agonist initiators and non-initiators. We assumed groups were exchangeable at baseline conditional on baseline covariates (similar probability of initiating the drug in both arms, within levels of the covariates): age, sex, education (<10, 10–12 and >12 years), diabetes duration, liver disease aetiology, compensated cirrhosis, and a range of comorbidities and medications: obesity, cardiovascular disease, microvascular complications to diabetes, chronic obstructive pulmonary disease, alcohol use disorder, mental health disorder, the use of antidiabetic medications except metformin or GLP1 agonists and direct-acting antivirals in patients with viral hepatitis. As the relationship between age and the probability of initiating a GLP1 agonist might not be linear, we modelled age using linear, quadratic and cubic terms. If patients had coding for more than one liver disease aetiology, they were classified according to a predefined hierarchy (online supplemental methods).

Outcome

The outcome of interest was the first MALO during follow-up, defined as decompensated cirrhosis (variceal bleeding, ascites, portal hypertension or hepatorenal syndrome), HCC, liver transplantation or MALO-related death. MALO was defined by ICD codes in the National Patient Register (main or secondary diagnosis), the Cancer Register or the Cause of Death Register (main or contributing cause) (online supplemental table 3). These outcomes have been validated and found to have positive predictive values >90% (online supplemental table 4).31

Follow-up

Everyone was followed from baseline to the earliest of MALO, emigration from Sweden, 10 years of follow-up or December 2020. The follow-up was measured in calendar months.

Causal contrasts

We estimated observational analogues of the intention-to-treat and per-protocol effects.

Statistical analyses

We sequentially emulated the target trial as a series of separate target trials starting in each 131 calendar months between January 2010 and November 2020, meaning that patients could enter multiple target trials if eligible. To avoid immortal time bias, the baseline is best defined as the time when eligible patients initiate a treatment strategy.32 The GLP agonist initiators naturally have one such time point. The definition of the baseline is, however, more challenging in the non-initiators since the same individual can be eligible at multiple times. One solution that avoids immortal time bias is to emulate a target trial that uses all those eligibility times as the baseline and consider each individual at each of those times as different individuals.34 For example, a patient who fulfilled all eligibility criteria in our study in January 2010 and did not initiate a GLP1 agonist in that month would enter the target trial that started in January 2010 as a non-initiator. If still eligible in February 2010, this patient would also enter the target trial that started in this month as a non-initiator. If the same patient initiated a GLP1 agonist in March 2010 and still fulfilled all eligibility criteria, the patient would enter this target trial as an initiator and be non-eligible for all subsequent target trials as previous use of GLP1 agonists was an exclusion criterion. Target trials with only initiators or only non-initiators were excluded. Allowing repeated eligibility is statistically more efficient than choosing only 1 month as baseline and accounts for the fact that patients can be eligible in several different months during the study period.35 36

We fitted a marginal structural model using parametric pooled logistic regression with an indicator for treatment group, a flexible time-varying intercept (linear and quadratic terms), product terms between treatment group and time, and a target trial indicator to pool data for all the emulated trials and estimate intention-to-treat and per-protocol effects. For estimation of the intention-to-treat effect, we weighted the model using inverse-probability of treatment weights (IPTW). The IPTW models included all baseline covariates, and the weights were stabilised. The balance between treatment groups was assessed using standardised mean differences (SMD) and inspection of kernel density plots. An SMD<0.1 is generally regarded to indicate good balance.37 For estimation of the per-protocol effect, the same marginal structural model as above was used, but patients were additionally censored when deviating from their assigned treatment strategy and stabilised inverse-probability of censoring weights (IPCWs) were applied to adjust for baseline and time-varying covariates associated with adherence.34 38 The marginal structural model was weighted using the product of the IPTWs and the IPCWs. The IPCW models included age, sex, education, diabetes duration and liver disease aetiology at baseline, and the following time-varying covariates: compensated cirrhosis and the same range of comorbidities and medications as described above. If patients stopped GLP1 agonist treatment because of a contraindication during follow-up (eg, an episode of pancreatitis), or started GLP1 agonist treatment, their censoring weights remained constant from that date forward. The weights are described in online supplemental table 5.

The average 10-year absolute risks under each strategy were estimated using the predicted values from the marginal structural models, then resulting risk differences (RDs) and risk ratios (RRs) were calculated. Non-parametric bootstrapping with 500 replications was used to estimate 95% CIs.

We examined the intention-to-treat effect in subgroups according to liver disease aetiology (MASLD, other than MASLD), and liver disease severity at baseline (compensated cirrhosis, no cirrhosis). Several sensitivity analyses were done to assess the robustness of our results. First, we updated the eligibility criteria to include those with a lower daily dose of metformin (0.5 g instead of 1 g). Second, the time gap between two successive prescriptions was restricted to ≤30 days. Third, we censored non-initiators if they initiated a GLP1 agonist during follow-up, regardless of whether it was indicated, and IPCWs were applied as described above. Fourth, the inverse-probability weights were truncated at the 1st and 99th percentile before being applied to the marginal structural model, to avoid the impact of extreme values on the risk estimates. Fifth, we used standardisation to adjust for confounding at baseline, rather than IPTW.39 In this analysis, the IPCWs were used in the per-protocol analysis as described above. Sixth, we estimated intention-to-treat and per-protocol point estimates including all non-initiators (without sampling 5%), to assess whether the sampling affected our risk estimates. Finally, to estimate how strongly an unmeasured confounder would need to be associated with both the exposure and outcome to fully explain any differences in risk between treatment groups, we calculated the E-value.40

We additionally calculated the intention-to-treat and per-protocol effects at 2, 4, 6 and 8 years of follow-up. Because 2 years is a more plausible duration of a future randomised trial of GLP1 agonists in patients with chronic liver disease than 10 years, we computed the minimum sample size required in a clinical trial of GLP1 agonists to demonstrate an effect of equal strength as our estimated 2-year RR, using a 5% alpha and 80% power.

Analyses were done in Stata V.17.0 (StataCorp).

Patient and public involvement

Patients or the public were not involved in the design, conduct, reporting or dissemination plans of this research.

Results

We included 1026 initiators of GLP1 agonists and 15 633 non-initiators, who participated in up to 123 target trials. The patient characteristics at baseline before and after weighting are summarised in table 2. After IPTW, all baseline characteristics were well balanced (SMD<0.1). Kernel density plots also indicated good balance (online supplemental figure 1). Of the initiators, 635 (61.9%) started treatment with liraglutide, 231 (22.5%) semaglutide, 120 (11.7%) dulaglutide, 25 (2.4%) exenatide and 15 (1.5%) lixisenatide at baseline.

Table 2

Patient characteristics at baseline before and after inverse-probability of treatment weighting

In the intention-to-treat analysis, initiators and non-initiators were followed for a median (p25–p75) of 64 (36–96) and 76 months (50–100), respectively. 350 of 605 (57.9%) initiators still at risk at 2 years were continuous users at this time. The corresponding numbers at 4, 6, 8 and 10 years were 143 of 331 (43.2%), 64 of 188 (34.0%), 24 of 98 (24.5%) and 5 of 19 (26.3%), respectively. In the per-protocol analysis, where the patients were censored if they deviated from their assigned treatment strategy, the median follow-up was 43 (21–75) and 76 months (50–100), respectively. After a median of 14 months (8–26) follow-up, 517 (50.4%) initiators stopped the treatment; 21 (2.1%) initiators stopped the treatment after developing a contraindication, and 496 (48.3%) stopped the treatment without one of the prespecified contraindications. Of the 1026 initiators, 361 (35.2%) were censored in the per-protocol analysis because they stopped the treatment with a prespecified contraindication the first 2 years, another 92 (9.0%) between 2 and 4 years, 28 (2.7%) between 4 and 6 years, 11 (1.1%) between 6 and 8 years, and 4 (0.4%) between 8 and 10 years. Among the non-initiators, 2357 (15.1%) started treatment with a GLP1 agonist after a median follow-up of 31 months (14–52).

In the intention-to-treat analysis, MALO occurred in 42 initiators and 1079 non-initiators. The events in non-initiators corresponded to 486 distinct events (since participation in multiple target trials was allowed, some contributed with events to more than one target trial). The 10-year risk of MALO was 13.3% (95% CI=7.4% to 19.2%) in the initiators and 14.6% (95% CI=13.1% to 16.1%) in the non-initiators (RD=−1.3, 95% CI=−7.2 to 4.6, RR=0.91, 95% CI=0.50 to 1.32) (table 3, figure 2A).

Figure 2

Inverse-probability weighted risk curves of major adverse liver outcomes comparing initiators of glucagon-like peptide-1 receptor (GLP1) agonists with non-initiators. (A) intention-to-treat effect, (B) per-protocol effect.

Table 3

Risk of major adverse liver outcomes according to intention-to-treat and per-protocol analyses

In per-protocol analysis, we observed 22 events of MALO in the initiators and 1079 in the non-initiators. The 10-year MALO risk was 7.4% (95% CI=2.1% to 12.6%) in the initiators and 14.4% (95% CI=12.9% to 15.9%) in the non-initiators (RD=−7.1, 95% CI=−12.5 to –1.6; RR=0.51, 95% CI=0.14 to 0.88) (table 3, figure 2B). This corresponds to a number needed to treat (initiate and continue with the GLP1 agonist treatment strategy) to avoid one event of MALO over the course of 10 years of 14 (95% CI=8 to 63).

Results from the intention-to-treat subgroup analyses are summarised in table 4. In patients with MASLD, the 10-year risk of MALO was 15.8% in the initiators and 11.2% in the non-initiators (RR=1.41, 95% CI=0.53 to 2.30). The 10-year risk in patients with compensated cirrhosis was 36.5% and 34.6% in initiators and non-initiators, respectively (RR=1.05, 95% CI=0.20 to 1.91).

Table 4

Risk of major adverse liver outcomes at 10 years according to intention-to-treat analyses in subgroups

The risk estimates from a sensitivity analysis that allowed patients to have a lower dose of metformin at baseline were similar to the main analyses (intention-to-treat RR=0.94, 95% CI=0.58 to 1.30; per-protocol RR=0.63, 95% CI=0.27 to 0.99) (online supplemental table 6). The per-protocol estimates were also similar when restricting the gap between two successive prescriptions to ≤30 days (RR=0.48, 95% CI=0.11 to 0.85) (online supplemental table 7). In addition, similar per-protocol estimates were found when the non-initiators were censored if they initiated a GLP1 agonist during follow-up (RR=0.55, 95% CI=0.18 to 0.92) (online supplemental table 8). Both the intention-to-treat and per-protocol estimates were similar to the main analyses when the weights were truncated (intention-to-treat RR=0.84, 95% CI=0.48 to 1.20; per-protocol RR=0.55, 95% CI=0.16 to 0.93) (online supplemental table 9). Moreover, results were similar to the main analysis when using standardisation to adjust for confounders at baseline (intention-to-treat RR=0.87, 95% CI=0.53 to 1.22; per-protocol RR=0.58, 95% CI=0.19 to 0.97) (online supplemental table 10). Point estimates were similar to the main analysis when including all non-initiators (n=312 661) (intention-to-treat RR=1.01, per-protocol RR=0.53) (online supplemental table 11). The E-value for the RRs in the main per-protocol analysis was 3.33 for the point estimate and 1.53 for the 95% CI, suggesting how strong an unmeasured confounder needs to be to fully explain the estimated per-protocol effect and shift the 95% CI to include the null.

Risk estimates for the intention-to-treat and per-protocol analyses at 2, 4, 6 and 8 years are presented in table 3. At 2 years follow-up, a more plausible duration of a clinical trial than 10 years, the risk of MALO was 2.3% in the initiators and 3.2% in the non-initiators (overall event probability of 2.7%) when analysed by the intention-to-treat principle (RD=−0.9, 95% CI=−2.2 to 0.4; RR=0.72, 95% CI=0.31 to 1.13). A clinical trial of GLP1 agonists on the risk of MALO with 2 years of follow-up would need enrolment of at least 10 776 patients per arm to provide evidence for such effect.

Discussion

We emulated a nationwide target trial to answer the question whether GLP1 agonists can prevent development of MALO in patients with chronic liver diseases and type 2 diabetes. The main finding was that the 10-year risk of MALO was 49% lower in patients who initiated and adhered to GLP1 agonists compared with non-initiators in the per-protocol analysis, but the estimates from the intention-to-treat analysis were imprecise with a 95% CI for the RR ranging from 0.50 to 1.32.

A placebo-controlled randomised phase II trial of 320 patients with MASH reported that the proportion achieving MASH resolution without worsening fibrosis after 72 weeks was more than tripled in the arm receiving 0.4 mg of the GLP1 agonist semaglutide compared with placebo (59% vs 17%).14 We followed patients with any chronic liver disease and type 2 diabetes for up to 10 years and found a similarly strong effect of GLP1 agonists on the risk of MALO in per-protocol analysis, but an imprecise intention-to-treat effect. However, MALO risk was similar in the initiator arm when restricting the analysis to patients with MASLD, although this should be interpreted cautiously as we noted wide CIs for these estimates. The phase II trial only included patients with fibrosis stages F2–F3 in the primary outcome analysis, whereas we included patients of any fibrosis stage including compensated cirrhosis.14 Our study did not provide support for a protective effect of GLP1 agonists in the subgroup with compensated cirrhosis, which is in line with a recent phase II trial in patients with MASH-related compensated cirrhosis.41

Based on the observed probability of MALO, our data indicate that a clinical trial of GLP1 agonists in patients with any chronic liver disease and type 2 diabetes using 2-year MALO risk as the outcome would demand an immense number of patients, at least 10 776 patients per arm. This can be contrasted to the ongoing phase III trial of semaglutide looking at both histopathological endpoints (after 1.5 years) and MALO (after 4.5 years) that is planning to recruit 1200 patients with MASH (identifier NCT04822181). The emulation of a target trial is an appealing option to give timely answers to key research questions when data from large clinical trials are currently unavailable.32 The difference between our intention-to-treat and per-protocol estimates is that many initiators stopped treatment without one of our prespecified contraindications. Patients who ended their treatment might have done so for good reasons, for example, severe gastrointestinal symptoms, but we were unable to capture this in our data. Moreover, since we lacked data to specifically determine whether GLP1 agonists were indicated for the non-initiators during follow-up, we assumed that it was indicated for all non-initiators that started treatment. Therefore, none of the non-initiators were censored for not adhering to protocol, whereas many initiators stopped treatment and were then censored in the per-protocol analysis. Large observational studies including detailed data with relevance for the choice of pharmacological treatment in type 2 diabetes, such as glycated haemoglobin, are warranted.

The main strength of our study was the use of an emulated target trial design to overcome common biases in observational analyses, including immortal time bias.34 In fact, bias in observational studies often arise predominantly due to poor study design, rather than confounding due to lack of randomisation, and estimates from carefully emulated target trials closely resemble those from randomised trials.36 There were two reasons why we designed a target trial that compared GLP1 agonists to non-initiators rather than an active comparator, another common approach.42–44 First, it would have asked a different research question, that is, whether MALO risk differs from the active comparator. An ideal active comparator has an identical indication as the drug of interest and no effect on the outcome, but the effect of other antidiabetic medications on MALO risk is mostly unknown. Second, the average treatment effect in the whole study population is not identifiable when comparing to an active comparator.45 We additionally fitted an inverse-probability weighted marginal structural model to account for possible time-varying confounders associated with adherence when estimating the per-protocol effect. Whereas the intention-to-treat effect is the most used causal contrast in clinical trials, the per-protocol analysis indicates what the effect of a treatment strategy would be if adhered to, which is of great importance in the real-world when physicians and patients decide on an appropriate treatment strategy.38 When there is feedback between time-varying confounders and treatments, then standard regression models (eg, Cox) will produce biased estimates.39 The family of g methods has been developed for this purpose (including inverse-probability weighted marginal structural models and standardisation).39 In addition, we used the validated population-based Swedish national healthcare registers to minimise selection bias and estimate the long-term effect of GLP1 agonists.23–25 29–31

Some limitations should be acknowledged. Despite a median follow-up of 5–6 years, few initiators experienced MALO, yielding estimates with low precision and preventing estimation of per-protocol effects in subgroups. The low number of events in initiators during late follow-up (only one event in the last 4 years and none in the last 2 years) could possibly explain part of the per-protocol effect, however, the RRs were stable across time from year 6 and forward. Additionally, we lacked data on some important covariates. First, we had no data on fibrosis stage, beyond classifying patients as cirrhotic (F4) or non-cirrhotic (F0–F3). Trials in MASLD are usually confined to patients with F2–F3, or cirrhosis, but we likely included some patients with F0–F1 where MALO is unlikely to occur. Second, we had no laboratory data, such as glycated haemoglobin, to assess diabetes severity and need for escalating to second-line treatment with GLP1 agonists. Diabetes duration, microvascular complications and other antidiabetic medications were, however, used as proxies for diabetes severity. The E-value suggested that an unmeasured confounder would need to increase the probability of both initiating a GLP1 agonist and experiencing MALO more than threefold to fully explain the estimated per-protocol effect.40 This suggests that the observed RR in the per-protocol analysis might be explained by residual confounding (eg, fibrosis stage). For example, a study of patients with biopsy-proven MASLD found that patients with F3 had a fourfold higher hazard of MALO than patients with F0.17 However, to fully explain the estimated effect, an unmeasured confounder would also need to be three times more likely to occur in either group. The presence of compensated cirrhosis was balanced between groups and other parameters associated with fibrosis such as age and cardiovascular disease were also well balanced and thus suggests that large differences in fibrosis are unlikely. Additionally, we sampled 5% of non-initiators to increase computational efficiency. Point estimates were, however, similar when including all non-initiators.

In conclusion, the risk of MALO in patients with chronic liver diseases and type 2 diabetes was lower if they initiated a GLP1 agonist and adhered to this treatment over time. The data were, however, not compatible with a protective intention-to-treat effect. Randomised trials using MALO as an outcome might be unfeasible, motivating further large observational studies using appropriate methodology to further delineate the effect of GLP1 agonists on the risk of MALO, complementing future phase III trials of GLP1 agonists.

Data availability statement

No data are available. No data are available due to Swedish regulations.

Ethics statements

Patient consent for publication

Ethics approval

This study involves human participants and the Regional Ethics Board of Stockholm approved the study and waived informed consent (protocol number 2017/1019-31/1).

Acknowledgments

AW was supported by the Syskonen Svensson foundation for medical research (2021-00284), Mag-tarmfonden, the Bengt Ihre foundation (SLS-973809), Professor Nanna Svartz foundation (2022-00448) and the Stockholm County Council (FoUI-985859). AAM was supported by the Strategic Research Program in Epidemiology at Karolinska Institutet, Forte (2020-00029), and the Swedish Research Council (2021-02236). HH was supported by the Stockholm County Council (FoUI-960537), the Swedish Research Council (2021-01293) and the Swedish Cancer Society (22210 Pj01H).

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • AAM and HH are joint senior authors.

  • X @wester_axel, @YingShang1, @hanneshagstrom

  • Correction notice This article has been corrected since it published Online First. The senior author statement has been added.

  • Contributors Contributor and guarantor information:

    Study conception and design: All

    Acquisition of data: HH

    Statistical analysis: AW and AAM.

    Analysis and interpretation of data: All

    Drafting of manuscript: AW.

    Critical revision: All

    Guarantors of article: AW and HH.

  • Funding AW was supported by the Syskonen Svensson foundation for medical research (2021-00284), Mag-tarmfonden, the Bengt Ihre foundation (SLS-973809), Professor Nanna Svartz foundation (2022-00448), and the Stockholm County Council (FoUI-985859). AAM was supported by the Strategic Research Program in Epidemiology at Karolinska Institutet, Forte (2020-00029), and the Swedish Research Council (2021-02236). HH was supported by the Stockholm County Council (FoUI-960537), the Swedish Research Council (2021-01293), and the Swedish Cancer Society (2 2210 Pj 01 H). The funders had no role in the conduct of this study.

  • Disclaimer The funders had no role in the conduct of this study.

  • Competing interests HH: institution has received research grants from Astra Zeneca, EchoSens, Gilead, Intercept, MSD, Novo Nordisk and Pfizer, all outside the current study. The other authors declare no conflicts.

  • Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

Linked Articles