Objectives: To estimate the effectiveness, cost-effectiveness and resource impact of faecal occult blood testing (FOBT) and flexible sigmoidoscopy (FSIG) screening options for colorectal cancer to inform the Department of Health’s policy on bowel cancer screening in England.
Methods: We developed a state transition model to simulate the life experience of a cohort of individuals without polyps or cancer through to the development of adenomatous polyps and malignant carcinoma and subsequent death in the general population of England. The costs, effects and resource impact of five screening options were evaluated: (a) FOBT for individuals aged 50–69 (biennial screening); (b) FOBT for individuals aged 60–69 (biennial screening); (c) once-only FSIG for individuals aged 55; (d) once-only FSIG for individuals aged 60; and (e) once-only FSIG for individuals aged 60, followed by FOBT for individuals aged 61–70 (biennial screening).
Results: The model suggests that screening using FSIG with or without FOBT may be cost-saving and may produce additional benefits compared with a policy of no screening. The marginal cost-effectiveness of FOBT options compared to a policy of no screening is estimated to be below £3000 per quality adjusted life year gained.
Conclusions: Screening using FOBT and/or FSIG is potentially a cost-effective strategy for the early detection of colorectal cancer. However, the practical feasibility of alternative screening programmes is inevitably limited by current pressures on endoscopy services.
- CRC, colorectal cancer
- FOBT, faecal occult blood test
- FSIG, flexible sigmoidoscopy
- LYG, life year gained
- ONS, Office for National Statistics
- QALY, quality adjusted life year
- RCT, randomised controlled trial
- colorectal neoplasms
- costs and cost analysis
- mass screening
Statistics from Altmetric.com
- CRC, colorectal cancer
- FOBT, faecal occult blood test
- FSIG, flexible sigmoidoscopy
- LYG, life year gained
- ONS, Office for National Statistics
- QALY, quality adjusted life year
- RCT, randomised controlled trial
Colorectal cancer (CRC) is the third most common form of cancer in the UK, where approximately 34 500 new cases of CRC are diagnosed each year,1 resulting in around 16 200 CRC-related deaths annually.2 Recent evidence from randomised controlled trials (RCT) of faecal occult blood testing (FOBT) suggest that population-based screening for CRC can significantly reduce mortality,3–7 and early results from the UK demonstration pilot suggest that population screening using FOBT is feasible within current healthcare resource constraints.8 Evidence relating to the effectiveness of flexible sigmoidoscopy (FSIG) as a screening option for CRC is less well established, although three RCTs are currently underway.9–11
It has been estimated that current annual expenditures on the surgical, adjuvant and palliative treatment of CRC are around £300 million11; this is likely to increase substantially as the National Institute for Health and Clinical Excellence (NICE) issues guidance on newer cytotoxic therapies for the adjuvant and palliative treatment of CRC. While the introduction of a national CRC screening programme would inevitably entail substantial immediate costs, such expenditures may be offset through a reduction in CRC incidence and through the less intensive treatment required for cancers detected earlier. Two mathematical models have been previously developed to estimate the cost-effectiveness of FOBT screening in the UK, based upon the Nottingham trial12–14 and the UK FOBT demonstration pilot.15 Both analyses indicated that FOBT screening may be potentially cost-effective in comparison to no screening. The aim of this study was to estimate the likely effectiveness, cost-effectiveness and resource impact resulting from the implementation of alternative CRC screening programmes using FOBT, FSIG or a combination of both tests. The study was undertaken on behalf of the English Bowel Cancer Screening Working Group, to inform the Department of Health’s CRC screening policy in England.
Modelling methodology and structure
We developed a state transition model to simulate the life experience of a cohort of individuals without polyps or cancer through to the development of adenomatous polyps and malignant carcinoma and subsequent death in the general population of England. The health economic model consists of three interlinked sub-models:
A state transition model which simulates the natural history of CRC;
A model of the screening intervention and subsequent colonoscopic surveillance which interacts directly with the natural history model; and
A mortality model which reflects age-specific “other-cause” mortality, mortality due to CRC and mortality resulting from perforation due to endoscopic procedures.
The state transition modelling methodology is particularly useful for modelling diseases or conditions whereby risk is ongoing over time, where events may occur more than once, and where the timing of events is important.16,17 Central to this methodology is the division of the given disease process into a finite number of mutually exclusive health states, and the division of the relevant time horizon for the analysis into equal increments of time (Markov cycles). At any point in time, all patients must exist within one of the defined health states. The distribution of patients across the health states over time is governed by a series of transition matrices which describe the probability of transiting from the current health state to an alternative health state during each model cycle. Costs and utilities are associated with spending time in each health state or with the transition between health states; these are aggregated over the time horizon to provide an estimate of the expected costs and health outcomes of each screening option.
The model estimates the expected costs, life years gained (LYGs) and quality adjusted life years (QALYs) gained associated with five screening options as compared to a policy of no screening:
Biennial FOBT for individuals aged 50–69;
Biennial FOBT for individuals aged 60–69;
Once-only FSIG for individuals aged 55;
Once-only FSIG for individuals aged 60; and
Once-only FSIG for individuals aged 60, followed by biennial FOBT for individuals aged 61–70.
The structure of the health economic model was informed by a critical review of existing CRC screening models,18 in particular the earlier work of Frazier and colleagues,19 and was developed alongside methodological guidelines for modelling screening interventions20 (for details of parametric and structural assumptions used within previous CRC screening models see supplementary appendix available at http://gut.bmj.com/supplemental).
CRC natural history model
The natural history of CRC was modelled as a series of transitions between mutually exclusive health states (fig 1). The model uses age-dependent transition matrices to calculate the number of individuals in each health state for each model cycle through a process of iterative matrix multiplication. Transitions between model health states are calculated using an annual cycle length until the entire model cohort has been absorbed into the dead states. Health states were defined according to the true underlying histological state of the individual. Health states describing the presence of adenomas and pre-clinical cancers were defined in terms of the “index” lesion; that is, the greatest malignant potential of the adenoma or the most advanced cancer present. Individuals with adenomas were defined as “low-risk” or “high-risk” to reflect current guidelines for endoscopic surveillance following adenoma removal.21 Individuals with pre-clinical or diagnosed CRC were modelled according to the Turnbull modification of Dukes’ staging.22,23 Owing to the lack of direct evidence concerning the rate at which de novo cancers develop, the model assumes that all cancers arise from pre-existing adenomas. Separate health states were assigned to the presence of distal and proximal polyps and cancers, as FSIG can only detect neoplasia in the distal portion of the colon. Incidence rates for adenomas and cancers are assumed to be higher in the distal colon than in the proximal colon. Owing to a lack of evidence to the contrary, the model assumes identical disease progression rates in both the distal and proximal colon.
At the beginning of the simulation, the model cohort is aged 30, at which point the prevalence of polyps and CRC is assumed to be zero. Subsequently, the cohort is filtered through the health states according to transition probabilities based on relevant literature24–31 and through the calibration of model outputs against published CRC incidence1 and mortality rates.32
Screening and surveillance intervention model
We applied a screening intervention and surveillance model to the natural history model to estimate the potential impact of the earlier detection and removal of polyps through endoscopy, the detection and treatment of CRC, and the ongoing surveillance of high-risk individuals in whom adenomas are detected. The test characteristics of FOBT, FSIG and colonoscopy were defined in terms of the probability of achieving a positive or negative test result given an individual’s true underlying histological state. The impact of the screening test, follow-up colonoscopy and treatment of detected polyps and cancers was modelled by re-distributing the model cohort across the health states at the point of screening and surveillance. Individuals in whom adenomas are found are assumed to undergo polypectomy and are subsequently assigned a higher risk of adenoma recurrence.33 Individuals identified as high-risk at screening are assumed to enter into a 3-yearly colonoscopic surveillance programme. Individuals with pre-clinical detectable CRC may present symptomatically during any model cycle; the probability of clinical presentation was estimated through model calibration and is assumed to increase according to cancer stage. Individuals in whom previously undetected CRC is identified enter into a clinical management state. Individuals in whom neither adenomas nor cancer are detected are assumed to be re-invited to attend CRC screening during the next round.
The model includes three types of mortality: other cause mortality, CRC-specific mortality and death following endoscopic perforation. The probability of dying from other causes was modelled as an age-dependent probability depending on the age of the cohort during each model cycle, based on UK life tables.34 Dukes’ stage-specific mortality rates were estimated within the model calibration process through fitting the model against published Office for National Statistics (ONS) incidence1 and mortality data.32 The probability of death due to endoscopic perforation was modelled according to the experience of existing screening studies.11,35
Within the base case analysis, constant participation rates were assumed for screening and colonoscopic surveillance, based upon evidence from UK CRC screening trials and the UK FOBT pilot.11,15,36 Participation rates during each screening round were assumed to be independent of the individual’s previous participation in screening or surveillance for CRC.
Health outcome valuation
For the cost-effectiveness analysis, health outcomes were valued in terms of LYGs. For the cost-utility analysis, LYGs were adjusted according to the level of health-related quality of life associated with each health state, thus giving an estimate of the number of QALYs gained. Spending one model cycle in a particular model health state is associated with a state-specific utility score (whereby 0 = “dead” and 1 = “perfect health”). A constant utility score was applied to all non-cancers states. Lower utility scores were assumed for clinically diagnosed CRC states to reflect the severity of the disease and the invasiveness of treatment. These estimates were derived from a standard gamble health utility study in individuals who had previously undergone removal of colorectal adenoma.37
Resources and costs
The model incorporates two groups of costs: costs associated with the screening programme and costs associated with the diagnosis, treatment and follow-up of CRC. The cost of FOBT was assumed to include the cost of two tests and an additional administration cost. The cost of FSIG was derived from Whynes et al.38 The cost of colonoscopy was taken from the NHS Reference Costs.39 Resources and costs associated with the diagnosis, treatment and follow-up of patients with CRC were sourced from the literature and using expert clinical opinion. Key assumptions underpinning these cost estimates and the treatment pathways they are intended to reflect are detailed within the full study report.18 In line with current UK recommendations,40 all costs and health outcomes were discounted at 3.5% per year.
Model calibration methods
Several model parameters, in particular transition probabilities from adenoma health states to CRC, and probabilities of transiting through the pre-clinical cancer states cannot be empirically observed.19 Where possible, likely ranges for uncertain or unknown parameters were derived from the available literature. Each model parameter was then assigned a wide uniform distribution; Monte Carlo sampling was undertaken whereby all unknown parameters were simultaneously sampled and propagated through the model over 60 000 iterations. Those parameter combinations which generated the minimum mean squared errors between the model predictions and published incidence,1 mortality,32 stage distribution data41 and prevalence estimates26–31 were retained for inclusion in the model. The mean values of these samples were used to produce the central estimates of cost-effectiveness. Screening outcomes estimated by the model were validated against UK trial data.3,11
Table 1 presents a list of all parameter values assumed within the model, together with the sources of evidence used to inform each parameter.
Sensitivity analysis was undertaken to explore the impact of alternative parameter values on the marginal cost-effectiveness of each of the screening options. One-way sensitivity analysis was undertaken to explore the impact of changing individual values for cost and participation parameters on central estimates of cost-effectiveness. Probabilistic sensitivity analysis was undertaken using the potentially valid parameter sets identified within the calibration process, together with uniform probability distributions for the sensitivity and specificity of the screening and surveillance tests. The model was run 2000 times in order to generate a distribution of expected costs and health outcomes; the results of these data are presented as marginal cost-effectiveness planes.
Central estimates of effectiveness, cost-effectiveness and cost-utility
Table 2 presents marginal health outcomes for each the five core screening options compared to a policy of no screening. The model indicates that a policy offering FOBT screening to individuals aged 50–69 would result in the greatest number of screen-detected cancers. However, the model suggests that offering FSIG at age 60 followed by biennial FOBT at age 61–70 may result in the greatest reduction in both CRC incidence and mortality.
The corresponding marginal estimates of cost-effectiveness and cost-utility for each of the five core screening options versus the no screening policy are presented in table 3. While the expected health gains for each individual offered screening are small, ranging from 4.59 to 9.89 days of life gained, the average cost per person invited to attend screening is also expected to be low. Offering FSIG once to individuals aged 60 followed by FOBT screening from age 61 to 70 is predicted to have the greatest impact on both survival and quality-adjusted survival (9.89 and 10.28 days, respectively), although this is not the least expensive strategy. Offering FOBT to individuals aged 60–69 is expected to offer the smallest survival and quality-adjusted survival gain over no screening (4.59 and 3.79 days, respectively). Offering FSIG to all individuals aged 55 or 60 appears to be the most cost-effective option compared to a strategy of no screening; under the base case scenario these options dominate the no screening policy (ie, they are less expensive and more effective).
Table 4 presents the results of the one-way sensitivity analysis; these results show the impact of alternative parametric assumptions on the marginal cost-utility estimate for each screening option as compared to no screening. Each scenario describes alternative assumptions concerning between one and four model parameters, as described in the table. The final two rows show the impact of alternative sets of natural history parameter values, as estimated through the calibration of the model, on the marginal cost-utility of each screening option.
Table 4 suggests that the model is relatively insensitive to changes in most of the groups of parameter values. The greatest impact on cost-utility results from the use of alternative sets of calibrated transition probabilities; these clearly represent an important area of uncertainty within the current evidence base.
Figure 2 presents the results of the probabilistic sensitivity analysis using a marginal cost-effectiveness plane; this shows the distribution of marginal costs and QALYs resulting from the use of alternative screening options in comparison to a policy of no screening. The uncertainty analysis demonstrates wide variability in marginal costs and effects for all five screening options, in particular the policy of offering FOBT biennially to all individuals aged 50–69. While the plane suggests a high degree of clustering of costs and effects, there is some overlap between the once-only FSIG options when compared to no screening. Despite the uncertainty surrounding the natural history of the disease and the true sensitivity of the screening tests, all of the screening options appear to be economically attractive when compared to a policy of no screening. In particular, FSIG offered to individuals aged 55 or 60 is always expected to dominate the no screening policy. Figure 2 clearly suggests that the FOBT 60–69 policy is always expected to be more expensive and less effective than the once-only FSIG options.
Additional resource use estimates
An assessment of the impact of each screening strategy upon endoscopy services was undertaken alongside the cost-effectiveness analysis. This involved identifying the relevant age cohort of screen-eligible patients for each screening strategy and propagating them through the model to determine the expected total number of colonoscopies and FSIGs, in addition to the number of patients being treated with surgery, chemotherapy and radiotherapy (both for primary disease and subsequent relapse). This analysis considered the expected impact in the 5 year period following the implementation of each screening strategy, and enabled the generation of estimates of additional resource requirements in terms of nurse-trained endoscopists, gastrointestinal consultants and endoscopy units. There are currently around 240 hospital-based endoscopy units in England, at which approximately one third of all activity is concerned with CRC. It is assumed that 80% of these units have two endoscopy rooms, with the remainder having either one room or more than two rooms. Table 5 outlines the results of the resource use analysis for the first year following implementation of a screening programme. These figures relate to the additional resource use requirements of each screening option over a policy of no screening for the entire population of England.
Health economic results
The CRC screening model suggests that screening using FSIG or FOBT, or a combination of both, is likely to have a cost-effectiveness profile which is better than many interventions which are currently funded on the NHS. The health economic model suggests that once-only screening using FSIG is more effective and less expensive than a policy of no screening. While the lifetime costs of FOBT screening may be more expensive than those for FSIG screening, the model suggests that biennial screening using this test may produce health gains at a cost which is currently considered acceptable to health care policy-makers.
The resource use analysis indicates that there are considerable differences between the screening strategies in terms of endoscopy staffing and capital requirements. If total endoscopy services are constrained, the favoured option is likely to be the programme of biennial FOB testing between the ages of 60 and 69. However, this option is estimated to be the least effective in terms of quality adjusted survival and is estimated to have a total first year cost in the middle of the estimated range. If financial resources are constrained, the FSIG options may be considered the most appropriate, with lower costs due to a smaller cohort of people being screened than with FOBT strategies. The two FSIG strategies have a similar impact upon the marginal QALYs gained compared to a policy of no screening, and minimise the requirement for consultant gastroenterologists, though relying on a greater number of nurse endoscopists. The cost and resource analysis suggests that the most costly option in terms of screening and cancer management costs in the first year of the screening programme would be for the FOBT age 50–69 strategy, owing to the high number of people who would be offered screening each year. The higher number of cancers detected under the FOBT age 50–69 option would have significant cost implications, particularly in terms of the increase in surgery volume of around 5500 in the first year. Significant investment would be needed to meet the requirements of the implementation of any of the screening strategies on a nationwide basis.
Limitations of the model
This model covers a broader scope than existing UK models,12–15 which have considered only the expected costs and health outcomes resulting from FOBT screening. In addition, the surveillance model is more representative of current UK endoscopic practice, and up-to-date cancer stage-specific utility data are used. Importantly, cost and ethical considerations would prohibit a trial-based evaluation of all potential CRC screening options. Model-based evaluations allow the available evidence to be brought to bear on policy decisions. As with any health economic model, our analysis incorporates both structural and parametric assumptions which influence the estimated costs and consequences of alternative screening options. It is important that these cost-effectiveness results are interpreted in the light of the uncertainties within the existing evidence base.
Evidence concerning the true prevalence of colorectal adenomas within the general population of England is limited. Age-specific adenoma prevalence estimates were drawn from six autopsy studies26–31; however these studies are dated and may not reflect current prevalence rates in England. This is an important gap in the existing evidence base.
The absence of direct evidence on rates of transition between disease states means that several of the model parameters had to be fitted to published data. However, there are several potentially valid solutions which fit the data. This is a critical point; few previous economic analyses of CRC screening have reported the way in which model parameters have been fitted against published incidence and mortality data. In turn, such analyses are unlikely to have allowed for the impact of the joint uncertainty in these parameter values on resulting cost-effectiveness estimates. Given the uncertainty surrounding the underlying natural history of CRC, alternative combinations of these parameter values have the ability to produce favourable or unfavourable estimates of cost-effectiveness for any CRC screening modality. Our analysis attempts to comprehensively and explicitly model the uncertainty surrounding the natural history of CRC through the probabilistic calibration of model outputs against published incidence and mortality, as well as the use of Monte Carlo sensitivity analysis to describe the impact of this uncertainty on the resulting cost-effectiveness estimates.
It is broadly accepted that most CRCs arise from pre-existing adenomatous polyps; while there is some indirect evidence which suggests the possibility that a small proportion of CRC arises de novo, this is subject to considerable uncertainty. Our model assumes that all cancers derive from pre-existing adenomas; this assumption favours all screening options. In particular, the impact of this assumption is that those screening strategies which have a high sensitivity for detecting adenomas (ie, FSIG) will be favoured by the model. Notably, this assumption has also been applied within several existing health economic models of CRC screening.15,19,48–50 Further, probabilities of cancer progression are assumed to be equivalent in both the distal and proximal colon.
Evidence concerning the true differential sensitivity of FOBT in detecting small low-risk adenomas, high-risk adenomas and CRC is generally weak. Some previous models have assumed that the sensitivity of FOBT in detecting small and large adenomas is the same,15,19,51,52 while others have assumed different sensitivities for small and large adenomas.48 The absence of good quality empirical evidence makes it difficult to justify the values that these parameters should take. We assumed a single 5% sensitivity value for all adenomas, which broadly reflects the findings of the Nottingham RCT3; this assumption was based upon the advice of the English Bowel Cancer Screening Working Group. However, the impact of this assumption on the cost-effectiveness of FOBT is minimal.
The uncertainties surrounding the health economic model presented here are a result of the paucity of direct evidence concerning the natural history of CRC and the operating characteristics of the screening tests. The central uncertainty concerns the true prevalence of undiagnosed asymptomatic CRC in England; inevitably, this is highly influential in determining the effectiveness and cost-effectiveness of any CRC screening programme. While the natural history of the disease cannot be observed using standard study designs, there are two potential designs that could provide information.
Firstly, valuable information could be obtained through undertaking a large autopsy series in England. Given a sufficiently large sample size, such a study could be used to obtain better estimates of adenoma incidence rates and natural history parameters, and to determine the underlying prevalence of pre-clinical CRC. Secondly, an analysis of existing screening trials is warranted. The problem with the screening trials, and conventional methods of analysis, is that the results confound the natural history and the characteristics of the screening test. Analytical methods which synthesise data from other sources (for example ONS incidence and mortality, stage distribution at diagnosis, and survival estimates from audit studies) allow information concerning natural history and test characteristics contained within the trial data to be drawn out. Bayesian synthesis analysis of existing trial data, based upon an underlying model of disease natural history and incorporating data from a range of available sources, should be undertaken.
Screening for CRC using either FOBT, FSIG or a combination of both strategies may provide health gains at a cost which is currently considered acceptable to NHS policy-makers. The introduction of any of these screening options will inevitably require significant investment; consideration of the specific nature of current resource constraints should be considered prior to the roll-out of any of these screening options. In particular, the practical feasibility of the alternative screening programmes in the UK will be limited by current pressures on endoscopy services.
This study was commissioned by the Department of Health in England, on behalf of the Bowel Cancer Screening Working Group. We would like to acknowledge the input of members of the Bowel Cancer Screening Working Group in providing support and direction for this work, in particular, Professor Wendy Atkin, Mr Ron Parker and Professor Bob Steele. We are also indebted to Dr Sue Moss and Professor John Scholefield for providing data from the Nottingham RCT, and to Professor Wendy Atkin for providing early data from the UK flexible sigmoidoscopy trial.
Published Online First 1 December 2006
Competing interests: None.
Ethics approval: Ethical approval was not required for this study.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.