Introduction The use of patient reported outcome measures to support routine inflammatory bowel disease (IBD) care is not widespread and suggests that existing questionnaires lack relevance to day-to-day decisions or are too cumbersome to administer. We developed a simple, generic tool for capturing disease control from the patient's perspective to address these barriers.
Methods Development based on literature review, patient focus groups/interviews and a steering group, defining a limited set of generic questions. The ‘IBD-Control’ questionnaire comprises 13 items plus a visual analogue scale (VAS) (0–100). Prospective validation involved baseline completion of IBD-Control, quality of life (QoL) questionnaire (UK-IBD-Q), EuroQol (EQ-5D), Hospital Anxiety and Depression Score; and clinician assessment (blinded to questionnaire; recording Harvey-Bradshaw Index or Simple Clinical Colitis Activity Index; Global Clinician Rating; treatment outcome).
Results 299 patients returned baseline surveys (Crohn's disease, n=160; ulcerative colitis, n=139) and 138 attended for repeat visits. Completion time (mean; SD): 1 min 15 s; 25 s; Internal consistency: Cronbach's α for all 13 items (0.85); for subgroup of eight questions (‘IBD-Control-8’; 0.86). Strong correlation between IBD-Control-8 and IBD-Control-VAS (r=0.81). Test-retest reliability (2 week repeat): intra-class correlation=0.97 for IBD-Control-8 and 0.96 for IBD-Control-VAS. Construct validity: Moderate-to-strong correlations between IBD-Control-8 and IBD-Control-VAS versus activity indices, UK-IBD-Q and EQ-5D (utility) with r values 0.52–0.86. Discriminant validity (mean instrument scores for remission, mild, moderate or severe): p<0.001 (analysis of variance (ANOVA)). Sensitivity to change: Effect sizes: 0.76–1.44.
Conclusions The IBD-Control is a rapid, reliable, valid and sensitive instrument for measuring overall disease control from the patient's perspective. Unlike existing patient reported outcome measures, its simplicity, ease-of-use and generic applicability make it a candidate for supporting routine care.
- Inflammatory Bowel Disease
- Ibd Clinical
- Quality Of Life
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/
Statistics from Altmetric.com
Significance of this study
What is already known about this subject?
The routine use of patient-reported outcome measures (PROMs) in clinical practice is increasingly advocated as a means of supporting patient-centred care, informing decisions and driving service quality.
A range of PROMs has been developed to measure quality of life in inflammatory bowel disease but none has found widespread practical use in routine settings.
Barriers to adoption of PROMs include patient and administrator burden and lack of tangible relevance to day-to-day decision-making or quality assurance of services.
What are the new findings?
The IBD-Control questionnaire is the first PROM to capture disease control from the patient's perspective using a simple set of generic items applicable to all patients with IBD.
Summary scores from the instrument show strong validity versus more complex quality of life questionnaires, generic utility measures, disease activity scores and global physician assessment.
As a patient-completed screening tool, the IBD-Control provides a rapid and reliable means of identifying those in a quiescent state and the individual items and summary scores have potential to support efficient models of disease follow-up.
How might it impact on clinical practice in the foreseeable future?
In routine practice, IBD-Control has potential to provide patients and healthcare teams with a practical means of capturing simple but reliable outcome data in a standardised and easily interpretable form.
The strong measurement properties suggest potential for use in clinical studies, particularly in long-term or large-scale trials, surveys or registries where its minimal user and administrator burden will favour serial capture of PROM data in routine settings.
The goal of therapy for inflammatory bowel diseases (IBD) is to achieve and maintain disease control and thereby optimise quality of life (QoL). Hence, assessment of disease control is a core component of care and underpins management decisions. Surprisingly, no survey instrument has been developed for routine clinical practice with the specific aim of capturing disease control from the patient's perspective.
The routine use of patient reported outcome measures (PROMs) in healthcare is gaining increasing political and professional support as a means of informing day-to-day decisions and driving service quality.1 ,2 PROMs are standardised, validated questionnaires intended for completion by patients in order to measure their perceptions of their own functional status and well-being. PROMs have begun to find a role in national audits and registers and there is rapidly growing interest in their potential to inform individual care.1
No single PROM instrument has gained widespread popularity in the UK or internationally for routine use in IBD. When formal measurement of health status is undertaken (eg, for clinical trials) the traditional emphasis has been on clinician-reported indices assigning scores for selected symptoms, vital signs and laboratory or endoscopic parameters.3–7 However, it has long been recognised that clinician-reported indices are not entirely objective 8 and fail to capture the impact of disease from the patient's perspective.
The last decade has seen a wealth of qualitative research employing focus groups, interviews, expert panels and factor analysis to identify key issues for patients with IBD.9–17 This has resulted in a number of multidomain PROMs being developed as measures of disease-specific QoL, typically combining symptom questions with more generic questions that relate to overall well-being, energy/vitality, bodily pain and impacts of disease on physical, social and emotional function.9–17 Only a limited number of these instruments has found use in clinical trials 10 ,16 but none has established a significant place in routine practice. This may reflect, in part, the patient and administrator burden inherent in using lengthy multidomain questionnaires. Shortened forms may have more potential for routine clinical applications 18 ,19 but are not used widely.
If PROMs are to find a place in routine care, instruments need to be acceptable to patients and healthcare teams, demonstrating added value to normal practice and with simple and clinically-relevant interpretation. This means combining a high level of user-friendliness (ie, short and rapidly completed tools with low burden and easy interpretation) with strong and explicitly stated measurement properties to support decisions at the level of the individual patient. The objective of this study was to develop and validate a novel questionnaire intended to rapidly capture disease control from the patient's perspective (‘IBD-Control’). Our design criteria mandated that the tool should be short, simple and generic in content in order to maximise its potential for routine use across the full spectrum of patients with IBD. As there is no gold standard measure of disease control in IBD, we used a wide array of external measures to validate the questionnaire, including disease activity indices, generic and disease-specific QoL questionnaires and pragmatic clinical end points (global physician assessment and treatment outcome).
Design criteria: Based on the intended practical clinical application for the IBD-Control questionnaire, we formulated a number of guiding principles for its development. These design criteria were used as a filter to select items generated from literature review and the qualitative patient study. Essential attributes for the choice of individual question items were specified as follows: (A) Include items that capture a patient's global self-assessment of overall disease control; (B) Items must reflect generic areas of concern, function or impact that are important to patients with both main forms of IBD; (C) Items should not focus on individual gastrointestinal symptoms since their relevance and relative importance will vary depending on disease type and extent.
Criteria for the overall design of the instrument were: (A) Minimal user burden (ie, a rapid completion time with a limited total number of question items and simple response categories); (B) The tool should generate overall summary scores with clearly defined interpretation at the individual patient level; and (C) The development and validation process should provide evidence for its acceptability, reliability, validity and responsiveness based on modern psychometric standards.20–22 The project was guided by a steering group of medical and nursing professionals with expertise in the management of IBD.
Literature review of existing PROMS used in IBD: A detailed review of the English-language literature was undertaken using the PubMed electronic database supplemented with manual searching of bibliographies and conference proceedings within relevant journals. We used combinations of the search terms ‘inflammatory bowel disease’, ‘ulcerative colitis’, ‘Crohn's disease’, ‘questionnaire’, ‘validation’, ‘quality of life’, ‘patient-reported outcome’ and ‘control’. We excluded clinician-reported measures (ie, disease activity indices) and those developed specifically for paediatric patients. Self-administered PROMS reported for adult patients with patients with IBD where categorised as either generic (instruments developed originally for general use across different diseases) or disease-specific (tools developed specifically for patients with IBD). From the literature review we generated a list of core domains (figure 1) to serve as a framework for analysing outputs of the qualitative work with patients.
Qualitative study: We undertook two focus groups (six patients), one for each condition (ulcerative colitis (UC) or Crohn's disease (CD)) plus 13 one-to-one interviews with informed consent. A semistructured approach was taken to encourage patients to talk about the general concept of ‘control’ of IBD and the things they associated with ‘good control’ or ‘poor control’. Verbatim contributions were recorded and summarised as field notes by the lead interviewer (CO). These items or themes were grouped and mapped onto a framework based on the review of existing IBD questionnaires. At the end of each session, the patients were shown a selection of questionnaires and asked to contribute ideas or preferences about question format, layout, response categories, item number and completion time. ‘Saturation’ was reached when patient interviews ceased to add new items or themes to the existing framework.
Selection of items: First, we selected a set of generic questions to represent each of the core domains (figure 1), identifying where possible individual question items that have broad, general coverage rather than specific wording to conform to our design criteria. For example, the item selected for social impact would ask about missing ‘planned activities’ (rather than a more specific question about missing ‘work’, ‘study’ or ‘a social event’). To qualify for potential inclusion in the IBD-Control, each item needed to be identified as relevant to the construct of disease control in the patient contributions and validated in piloting.
Piloting and modification: To test for acceptability, lack of ambiguity and content validity we pretested the draft instrument in a pilot group of 30 patients who were asked to complete the IBD-Control and provide verbal or written feedback or annotations regarding content and format. This resulted in changes to item wording or layout.
Prospective validation phase (psychometric testing)
The prospective validation study recruited patients at the investigating centre, a university hospital serving a population of approximately 330 000 people with an established secondary care IBD service providing outpatient and inpatient medical and surgical care.23 Patients were recruited during routine visits to the outpatient clinics prior to consultation with the doctor or specialist nurse, during other treatment-related visits (eg, azathioprine monitoring clinic or infusion visit for biologics) or at the time of admission for inpatient care. Inclusion criteria specified a confirmed diagnosis of IBD on the basis of clinical, endoscopic, radiological and/or histological criteria with disease duration of at least 6 months. Exclusion criteria specified non-English speaking subjects, cognitive impairment or serious active psychiatric disease.
After informed consent, the patients completed the IBD-Control questionnaire and were then asked to complete a questionnaire pack comprising a disease-specific QoL questionnaire (the UK-IBD-QoL),16 ,17 a generic health status instrument (EuroQol, EQ-5D-3L),24 and the Hospital Anxiety and Depression Scale.25 The research team undertook a simultaneous assessment of current disease activity, using the Harvey-Bradshaw Index (HBI) for CD 5 and the Simple Clinical Colitis Activity Index (SCCAI) for UC.6 The research team and clinicians were blinded to the results of patient-completed questionnaires.
Where the study visit was taking place at a scheduled clinical review (eg, outpatient attendance), the treating clinician or specialist nurse was asked to complete a questionnaire at the end of the consultation to indicate the current state of IBD (Global Physician Assessment) using a categorical scale (remission, mild, moderate or severe), blinded to patient surveys. All treatment decisions were recorded, capturing whether new therapies were started, existing drug doses changed, therapies discontinued or surgery recommended. The research team reviewed the hospital case records and clinical information systems to extract background clinical information regarding diagnosis, duration of disease, previous hospitalisation and surgery, disease extent, presence of stoma or perianal fistulae, major comorbid illness and current therapy for their IBD.
Assessment of psychometric (measurement) properties of IBD-Control
Internal consistency was assessed for categorical questions and any subscales derived from them using Cronbach's α.26 In addition, we used Spearman's correlation to study the association between individual categorical questions and the rating from the instrument's visual analogue scale (IBD-Control-VAS).
Reproducibility was assessed under two sets of conditions using serial observations and the Intraclass Correlation Coefficient. First, a group of 20 patients underwent a repeat assessment at 2 weeks from baseline, completing the IBD-Control and UK-IBD-QoL on both occasions (test-retest group). In this group, we defined a stable health state based on the patient's response to the IBD-Control transition question item (Q2, response=‘No change’) plus ≤10-point change on the UK-IBD-Q total score. Second, we compared IBD-Control scores obtained from a group of patients who completed assessments at baseline and then at a subsequent routinely scheduled visit (routine care group). In this group, we defined a stable state on the basis of there being no change in the physician global assessment between visits.
Construct validity was assessed by comparing the scores derived from the questionnaire with several independent external measures of patient health state. These measures included generic QoL (EQ-5D utility index and EQ-5D visual analogue score), disease specific QoL (UK-IBD-QoL), physician global assessment and clinical disease activity indices. Bivariate correlations were expressed as Spearman's correlation coefficients and between-group mean scores were compared by t test or analysis of variance (ANOVA). Multiple variable linear regression was used to explore the relative strength and independence of individual question items as predictors of the various external measures of health state and to select items to include in subscores.
Responsiveness was assessed in those patients who completed baseline and follow-up assessments and experienced a change in their health state. There is no true external gold standard to define a change in disease control and so we performed responsiveness analyses using two alternative external criteria to define a change in health state between visits. First, we defined improvement or deterioration on the basis of self-reported UK-IBD-QoL total score, with a change of >10 points taken as a cut-off. As an alternative, we used the physician global rating to identify a change in clinical state (any change from baseline category, eg, mild to moderate). A range of responsiveness statistics were calculated including correlations in change scores, effect size, standardised response mean (SRM) and modified SRM (MSRM).
Definition and evaluation of cut-off values for detecting the ‘quiescent’ state: A key potential application for IBD-Control was to serve as a screening test to rapidly and reliably identify patients with optimal self-reported disease control. Such patients are suitable for alternative forms of chronic disease follow-up (eg, self-care, telephone or virtual clinics) rather than traditional, costly and inconvenient office or clinic attendance. We defined a state of ‘quiescent’ disease using a combination of parameters, as follows:
Disease-specific measures consistent with clinical remission as defined by disease activity indices within reported remission ranges (SCCAI<4; HBI<5), UK-IBD-QoL total score ≥90 points and Physician Global Assessment rated as ‘remission’. To allow for missing or incomplete data, patients were required to have sufficient data to satisfy any two of the criteria.
Not currently receiving oral corticosteroid therapy, a course of antibiotics, therapeutic (polymeric) diet, induction with biologics or awaiting surgical treatment.
IBD-Control transition question (Q2) indicates condition was unchanged or improved in the last 2 weeks (ie, excluding patients who responded that bowel symptoms were ‘worse’).
No escalation of therapy occurred as a consequence of the clinical consultation.
Individuals satisfying all four of the above criteria would be regarded as well-controlled. Patients who did not meet these criteria where designated as ‘not quiescent’. We used receiver-operating characteristics analysis to assess the performance of the IBD-Control as a tool for identifying this subgroup of quiescent patients. Given the intended clinical application, we defined optimum cut-off scores with an emphasis on achieving high specificity (≥85%) to minimise the risk of false positives (identifying a non-quiescent patient as quiescent). The cut-off value with the highest sensitivity×specificity product was selected, as described.27
Literature review: A PubMed search identified 7244 references with a combination of search terms: [‘Patient reported outcome measure’ OR ‘PROM’ OR ‘quality of life’ OR ‘questionnaire’ OR ‘outcome measure’] AND [‘Inflammatory bowel disease’ OR ‘ulcerative colitis’ OR ‘Crohn's disease’ OR ‘colitis’ OR ‘IBD’]. Of these, 682 articles included the additional terms: [‘development’ or ‘validation’ OR ‘psychometric’]. We identified 71 relevant articles relating either to IBD-specific questionnaires (23 instruments for adults including variants or short forms; 4 for paediatrics were excluded) or to the validation of existing generic PROMS in patients with IBD (3 instruments). Of the disease-specific PROMS for adults with IBD, only the original McMaster IBD-Q (32 items) 10 and a similar instrument developed in the UK 17 were found to have undergone detailed psychometric evaluation in accordance with modern guidelines 20 ,21 and subsequent use as outcome measures in clinical trials. Hence, we chose the UK IBD-Q as our measure of QoL in the validation study. Of the generic instruments, the EuroQoL EQ-5D 24 had the most publications and so was our choice of utility measure. No instrument was identified that measured IBD disease control from the patient perspective but we reviewed questionnaires for other chronic diseases.
Focus groups and one-to-one interviews: Saturation was reached after completing two focus groups and 13 one-to-one interviews. Consistently, the idea of disease control was expressed by contributors as their overall sense that disease-related symptoms were minimised or absent (ie, a global sense of control). As expected, the control of a wide variety of specific gastrointestinal symptoms was identified as important by individual patients but these did not meet our criteria for generic item selection. However, a common theme across all participants was the idea that good control implied that their particular array of bowel symptoms was minimised and not impacting on day-to-day physical functioning (overall energy and vitality; and sleep), social functioning (daily activities, work and recreational) or emotional functioning (feelings of anxiety or depression). In addition, the control of symptoms of discomfort or pain was a distinct factor. Loss of disease control was expressed as either a worsening of overall bowel symptoms or the development of an unfamiliar new symptom.
Control of IBD was linked also with ideas about the perceived effectiveness or acceptability of current treatments. This is not a core theme that appears in the established model for QoL in IBD.9–19 Good control would imply feelings that treatment was ‘right’ or ‘was working’ whereas poor control would invoke the sense that treatment was ineffective. For some patients, the ability to control the condition themselves (ie, by making adjustments to medication according to symptoms) was seen as part of the concept of disease control. Additional treatment-related themes included perceived difficulties with using certain drugs or side effect. Good control would imply a lack of such treatment concerns.
Description and specification of the IBD-Control instrument
The IBD-Control instrument is shown in online supplementary appendix 1. The paper version comprises a single-sided A4 document that includes five sections numbered 1 through 5. The first four sections contain a series of 13 categorical questions, each of which has three response options. Of the categorical questions, 12 items have response options of ‘Yes’, ‘No’ or ‘Not sure’ and the remaining question is a transition question for overall bowel symptoms with options of ‘Better’, ‘No change’ or ‘Worse’. The fifth section contains the horizontal VAS, anchored between zero (worst possible control) and 100 (best possible control). The time horizon for disease control assessment is the ‘past 2 weeks’.
Scoring: Each response to the 13 individual items is scored as follows: zero points for least favourable reply; one point for intermediate or indeterminate reply; two points for most favourable reply. The IBD-Control-8 subscore is calculated by summing scores for Q1a, Q1b, Q3a, Q3b, Q3c, Q3d, Q3e and Q3f resulting in a range of 0–16 (0=worst control). The rationale for selecting this subgroup of eight questions is explained later in the results section. The IBD-Control-VAS scores are in the range 0–100 (0=worst control).
The characteristics of the patient sample at baseline are summarised in table 1. There were 299 patients who completed the baseline assessment, of whom 160 (53.5%) had a diagnosis of CD and 139 (46.5%) were suffering from UC. The spectrum of disease severity was very similar between the two forms of IBD (CD vs UC), with no significant difference in mean scores for generic QoL (EQ-5D utility or VAS), disease-specific QoL (UK-IBD-QoL total score) or proportion of patients in remission (physician global assessment). Of the baseline cohort, 138 patients were recruited to undertake a repeat assessment at a scheduled follow-up visit (returning patients), summarised in table 2. Patients were recruited between March 2011 and June 2012.
Acceptability and feasibility
Completion time was measured in a sample of 10 patients (aged between 25 years and 64 years), with a mean completion time for the IBD-Control of 1 min and 15 s (SD: 25 s). The range was between 42 s and 2 min 1 s. Completion rates for the 13 individual questions ranged between 93.3% (Q2) and 99% (Q3a and Q3d). Of the 299 patients who completed the IBD-Control at baseline visit, 259 (86.6%) provided responses to all 13 question items and 272 (91%) completed the VAS question. We prespecified that any question returning the same response from 80% or more of the patients would be considered for exclusion from the questionnaire. Such questions are unlikely to be sensitive to different levels of severity. However, none of the question items yielded the same response from 80% or more of the patients.
Scored responses to each of the 13 individual questions showed significant positive correlations with the IBD-Control-VAS score, with Spearman's ρ values ranging from 0.24 to 0.70 (table 3). These findings suggest that the individual items within the questionnaire are measuring aspects of the same construct (‘disease control’) and that there is internal consistency within the questionnaire. A linear regression model retaining eight of the question items (IBD-Control-8) accounted for 70% of the variance in the IBD-Control-VAS score (R-Square 0.704). Bivariate correlation coefficients for each of the eight items within the IBD-Control-8 subscore ranged from 0.39 to 0.70.
We assessed degree of agreement between the 13 categorical question items by Cronbach's α statistic.26 The higher the α value, the higher the internal consistency between the individual questions. A score above 0.70 is regarded as desirable but values in excess of 0.90 are suggestive of excessive overlap or redundancy between individual items. In the present study, Cronbach's α was 0.85 overall for all 13 question items and 0.86 for the subscore of eight questions (IBD-Control-8). This provides further evidence for internal consistency for the ‘disease control’ construct.
The IBD-Control-8 subscore and the IBD-Control-VAS score are intended to represent a summary measure of the same construct (ie, an overall rating of disease control from the patient's perspective). Hence, the two summary scores should show a strong positive correlation with each other. This was confirmed for patients with IBD overall (r=0.811; p<0.001), and for CD (r=0.78; p<0.001) and UC (r=0.83; p<0.001) separately.
This was tested under two sets of conditions. First, we evaluated reliability in a test-retest group of patients who completed evaluations 2 weeks apart. Of the 20 subjects, 13 reported no change in their disease control over the past 2 weeks (IBD-Control transition question, Q2) and had stable UK-IBD-QoL scores (±10-points from baseline). There was no significant difference in mean IBD-Control-8 or IBD-Control-VAS scores between visits and the intraclass correlation coefficient was 0.97 for the IBD-Control-8 subscore and 0.96 for IBD-Control-VAS (table 4A; p<0.01).
Second, reliability was assessed in 32 patients who completed questionnaires at consecutive clinical visits and were categorised as having no change in their physician global assessment between the two visits. The mean interval between the two visits was 131 days. Again, there was no significant difference in mean summary scores on the IBD-Control questionnaire between visits. For these patients, intraclass correlation coefficient was 0.87 for the IBD-Control-8 subscore and 0.81 for IBD-Control-VAS (table 4A; p<0.01).
There is no true gold standard for measuring disease control from the patient's perspective. However, it is possible to test a number of hypotheses concerning the logical relationships that should exist between the item scores or summary scores obtained with the IBD-Control instrument and a range of other external criteria, measures or patient characteristics.
Validity of individual question items: Good disease control would be expected to be associated with better QoL. Consistent with this, each individual question item within the IBD-Control demonstrated a significant positive correlation with the total score for the UK-IBD-QoL (table 3). Hence, ‘positive’ responses to questions within the IBD-Control are correlated with better overall disease-specific QoL. The correlation coefficients for each of the IBD-Control-8 subscore items and total UK-IBD-QoL score ranged from 0.38 to 0.70. These findings suggest the individual items within the instrument are valid.
We used a single question item to capture feelings of anxiety or depression (‘… felt anxious or depressed because of your IBD’). Responses to this question (No, Not Sure, Yes) showed moderate-to-strong correlation with the summary scores for the multi-item Hospital Anxiety and Depression Scale questionnaire 25for anxiety and depression subscores (r=0.57 and r=0.62, respectively, p<0.001).
Validity of IBD-Control summary scores: The summary scores generated by the disease control measure are predicted to correlate positively with disease-specific (UK-IBD-QoL) and generic (EQ-5D utility and VAS) QoL scores. Consistent with this, we found moderate-to-strong positive correlations between the IBD-Control summary scores (IBD-Control-8 and IBD-Control-VAS) and each of the QoL scales (table 5 and figure 2). A linear regression model containing the eight items from the IBD-Control-8 subscore accounted for 75% of the variance in UK-IBD-QoL total score (R Square 0.747).
Conversely, negative correlations would be expected to exist between IBD-Control summary scores and traditional clinical measures of disease activity or severity. In the present study, we obtained clinician-reported outcomes in the form of traditional symptom-based activity indices (HBI or SCCAI) and also as a global physician rating of current disease activity. We found highly significant moderate-to-strong correlations between the IBD-Control scores and the disease severity measures in the expected direction (ie, negative coefficients) (table 5).
These associations provide strong evidence to support the construct validity of the IBD-Control instrument. In addition, a valid measure of disease control would be expected to differentiate between mutually exclusive groups of patients categorised using an accepted external clinical criterion. Hence, mean scores for disease control would be expected to be significantly different between patients ‘in remission’ and those with ‘active disease’. Comparison of mean IBD-Control-8 and IBD-Control-VAS scores across the four physician global assessment categories (remission, mild, moderate or severe) confirmed good discriminant validity across the spectrum of disease (figure 3).
Multiple linear regression models examined whether additional patient factors (age, gender, disease type, previous surgery) were independently associated with IBD-Control scores after taking account of current disease severity. All models confirmed the strong association between IBD-Control summary scores and disease activity indices, generic and disease-specific QoL and global assessment. However, no consistent independent associations were observed for the other predictor variables.
Mapping of IBD-Control-8 to EQ-5D utilities: Ease of use makes the IBD-Control a candidate outcome measure for large scale observational clinical studies, trials, surveys or registries. Hence, the ability to convert IBD-Control summary scores to an estimate of utility could support health economic evaluations. Figure 4 illustrates the association between IBD-Control-8 subscore and the EQ-5D utility index.
Items not included in the IBD-Control-8 subscore: Five questions were excluded from the IBD-Control-8 summary subscore. The transition question (Q2) was excluded since this asks about change in status over 2 weeks rather than measuring current health status. The remaining questions (grouped together as items Q4a to 4d in the final questionnaire) had significant but weaker correlations with IBD-Control-VAS and/or UK-IBD-QoL and did not add significantly to performance of linear regression models predicting these measures. However, the development of new symptoms (Q4d) was regarded as an important ‘stand-alone’ item to retain in the IBD-Control instrument since a new symptom should serve as an automatic trigger for traditional clinical evaluation.
The three remaining questions (Q4a to 4c) are related to treatment concerns. These items were found to be predictive of the clinical outcome of the consultation in terms of likelihood of treatment escalation (note that clinicians were blinded to questionnaire responses). Of 67 patients reporting no treatment concerns on the three items, only 5 (7.5%) had their treatment escalated, whereas 59 of 192 (30.7%) patients with indeterminate or positive responses to at least one of the three questions had treatment escalated (p<0.001, χ2 test; OR: 4.1 (95% CI 1.7 to 9.8)).
A vital measurement property for any PROM is that the instrument is sensitive to gains or losses in the measurement concept of interest. We evaluated the responsiveness of the instrument by comparing serial data collected in the patients who returned for a second visit.
Change scores: A crude method for assessing responsiveness is to calculate ‘change scores’ for the instrument (ie, subtract score at visit 1 from the score at visit 2) and to examine the correlation between this value and a change score derived from another relevant clinical measure. For change scores in IBD-Control-8 subscore and IBD-Control-VAS, there were significant correlations with change scores for each of the key external outcome measures (r values ranging from 0.25 to 0.78). The strongest correlation for change in IBD-Control scores were versus the change in UK-IBD-QoL total score (IBD-Control-8, r=0.72; IBD-Control-VAS, r=0.55; p<0.001) and the weakest for HBI (IBD-Control-8, r=−0.25; IBD-Control-VAS, r=−0.32; p<0.001).
Responsiveness statistics: Several methods have been proposed to provide more robust quantitative expressions of the magnitude and meaning of changes in instrument scores. Effect size (ES) is calculated as the size of the change in scores (ie, the difference between mean scores at the baseline and repeat assessments) divided by the SD of baseline scores. Benchmarks for assessing the relative size of change have been proposed whereby an ES of 0.2 is considered small, 0.5 as medium and ≥0.8 as large. The IBD-Control-8 and IBD-Control-VAS showed moderate-to-strong ES with values ≥0.76 (table 6) indicating good responsiveness.
The SRM differs from ES in that the denominator is the SD of change scores (rather than of baseline scores). This takes account of variability in change per se rather than variability in absolute baseline scores. Responsiveness of the IBD-Control summary scores measured by SRM was strong (table 6).
Finally, in the modified SRM (MSRM) the numerator remains the mean change in absolute scores for the group but for this statistic the denominator is the SD of change scores taken specifically from individuals who are identified by other means as clinically stable. In the present study, this was taken from a subgroup of patients who showed no change in physician global rating between assessments and had a stable disease-specific QoL score (UK-IBD-Q total score changed by no more than 10 points between visits). The IBD-Control-8 and the IBD-Control-VAS scores have MSRM values ≥0.86 (table 6).
Sensitivity and specificity of IBD-control in identifying ‘quiescent’ patients
Of 217 patients at baseline who had formal clinical assessment and all relevant data, 80 individuals could be categorised as quiescent at the time of their routinely scheduled clinic visit. None required treatment escalation (including induction with biologics or referral for surgery). Receiver-operating characteristics analysis for IBD-Control-8 subscore and IBD-Control-VAS indicated strong performance could be achieved with either measure as a test to identify quiescent patients from the remaining cases (figure 5). For IBD-Control-8, a cut-off of ≥13 points identified patients with quiescent IBD with 67.5% sensitivity and 90.6% specificity, whereas for IBD-Control-VAS a cut-off of ≥85 points achieved 64.3% sensitivity and 90% specificity. These cut-off scores would mean that only 1 in 10 patients failing to meet our stringent clinical criteria for quiescent IBD would be falsely categorised as quiescent but none of these ‘false positives’ required any treatment change at the time of consultation and all were considered to be either in remission or only mildly active by the attending clinician. These data confirm the potential clinical application of IBD-Control as a rapid triage tool to categorise low-risk (quiescent) patients suitable for alternative forms of follow-up.
The IBD-Control questionnaire is a novel PROM designed to provide a rapid, user-friendly assessment of disease control from a patient's perspective. It was developed with patient involvement in defining the measurement construct (ie, what is ‘disease control’?) and in generating patient-centred items. Key requirements were that IBD-Control should be applicable to routine clinical care and that it would contain items relevant to both main forms of IBD. This resulted in the generation of a short and deliberately generic set of questions.
As expected, there was considerable consensus and overlap between items or domains identified by patient discussions of disease control and the themes reported in the extensive literature on QoL in IBD.9–19 28–30 Hence, good control implies the absence of disease impacts on energy/vitality, ability to perform daily activities, sleep, mood and freedom from pain or discomfort. Question items on these topics feature prominently in short forms of existing disease-specific QoL questionnaires and/or in the generic instruments that have been applied to IBD. In addition, our qualitative study identified a discrete domain of ‘treatment concerns’, encompassing perceptions of effectiveness of treatment, satisfaction with choice of therapy, ability to adjust own treatment and concern about side effects and we incorporated these items into the IBD-Control.
This study shows that IBD-Control is acceptable to patients based on its rapid completion time and high completion rates. Acceptability is a vital characteristic for any PROM if the intended application is to support routine care—a property that may be lacking with the current multi-item, multidomain QoL measures. Despite its simplicity, the IBD-Control exhibits strong measurement properties.
Strengths of the present study include our emphasis on patient involvement in item generation and questionnaire design to complement literature review and clinical opinion. Furthermore, we adopted a comprehensive approach to assessing the full range of psychometric (measurement properties) of the questionnaire in order to conform to current international standards.20–22 This required the simultaneous collection of a wide array of patient-reported and clinician-reported outcome measures in a large cohort of patients. A key strength of the IBD-Control tool is that it has been developed primarily for clinical application and to support patient-centred care. Hence, our validation has not rested simply on showing correlations with traditional disease measures across the patient population but has considered the performance of summary scores at individual level and for a specific clinical purpose.
We were particularly interested in generating a tool that would reliably identify a patient in a ‘quiescent’ state, defined stringently using a composite of criteria including physician rating and treatment outcome. This has application to models of care that seek to reduce the need for traditional face-to-face consultation (eg, ‘virtual’ clinics). The instrument produces scores relevant to the planning of day-to-day care, serving as a potential trigger or triage tool to guide the necessity, timing and nature of more formal clinical assessment. The inclusion of items relating to concern about treatment or new symptoms further distinguishes the IBD-Control from traditional QoL tools. Hence, it is intended to provide information to support patient-focused consultations and summary scores to support traditional clinical assessment and decision-making.
Potential limitations of our approach to PROM development was the adoption of rigid, prespecified design criteria that mandated the IBD-Control should contain only a small number of ‘general’ items with simple response categories to optimise acceptability. This meant a pragmatic approach was taken to item selection to favour broad, generically worded questions covering core domains with active exclusion of items about individual gastrointestinal symptoms. We did not undertake a process of generating a very large number of items followed by factor analysis and item reduction. We believe our approach was justified by the wealth of existing published information identifying key areas of concern for patients with IBD. We used qualitative data from patients to confirm that each item was relevant to the concept of disease control. Nevertheless, there was a risk that producing an instrument to this strict specification could have resulted in suboptimal measurement properties. However, the findings of our prospective evaluation suggest quite the opposite—the IBD-Control questionnaire has very strong properties when compared with a wide array of external measures.
There is inevitable selection bias inherent in recruiting patients willing to complete multiple questionnaires for a validation study. However, our patient sample size was large and contained individuals with a broad spectrum of demographic and clinical attributes that are similar to our local denominator IBD population.23 Although the proportion of cases with moderate-to-severe disease was only 15–20%, the linear trends and consistent confidence limits for IBD-Control scores observed across the disease spectrum suggests sufficient cases in the study. By recruiting patients at the time of ‘real-life’ clinical visits and asking for IBD-Control completion before other assessments we sought to reproduce the intended application for the tool. Clearly, further validation in other settings and in specific patient subgroups is desirable.
Current guidelines for IBD do not endorse the use of any specific PROM to guide the organisation or monitoring of care nor to support patient-level decision-making. Hence, currently available PROMs serve largely as research tools. This contrasts with some other chronic diseases where the use of specific patient-centred outcome measures has been incorporated into guidelines and care pathways.1 Recently, quality improvement programmes for IBD in a range of healthcare settings have highlighted the potential for routine capture of PROMs to help drive better patient-centred services.31 ,32 We suggest that the simplicity of the IBD-Control in combination with its strong measurement properties and relevance to both forms of IBD make it ideally suited to support such initiatives, in addition to having a potential role as an outcome measure in pragmatic clinical trials. As an addition to routine clinical care, the instrument offers the realistic possibility of capturing serial PROM data with minimal user burden and its content is eminently suited to adaptation to electronic capture via web-based systems, portals or mobile devices.
The authors wish to thank the IBD-Control Collaborative for participating in the steering group and/or contributing to the development and validation of the IBD-Control: Suhail Ahmed, Katherine Bowering, Elizabeth Brown, Anne Hurst and Tamsin Gledhill (Aintree University Hospital, Liverpool, UK), Ash Bassi (Whiston Hospital, UK), Ian London (Leighton Hospital, Crewe, UK) and Dr Sree Subramanian (Royal Liverpool University Hospital, Liverpool, UK). Thanks also to Richard Driscoll (former CEO, Crohn's and Colitis UK [NACC]), Prof John Williams and Prof I Russell (University of Swansea) and Alistair Curry and Karen Bean (AbbVie) for comments and feedback on the IBD-Control concept. We thank the EuroQoL Group Foundation for permission to use the EQ-5D-3L in our study and for providing the algorithms for computing the EQ5D index values (http://www.euroqol.org).
Review history and Supplementary material
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
Files in this Data Supplement:
- Data supplement 1 - Online supplement
Collaborators IBD-Control Collaborative: Suhail Ahmed; Katherine Bowering; Elizabeth Brown; Anne Hurst; Tamsin Gledhill.
Contributors KB: conception and design; analysis of the data; steering group; drafting of the manuscript; CO: qualitative study, contributed to patient recruitment, data extraction and to the steering group; DS and MH: contributed to patient recruitment, data extraction, and to the steering group. All authors contributed to revising the article for important intellectual content and approved the final manuscript.
Funding This study was supported by an unrestricted research grant from AbbVie (formerly Abbott Laboratories) and this provided salary for DS during the course of the study.
Competing interests KB has received research support (unrestricted educational research grant), speaker fees and support for conference attendance from AbbVie (formerly Abbott Laboratories). DS salary was funded by AbbVie.
Patient consent Obtained.
Ethics approval National Research Ethics Service (NHS Health Research Authority).
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement There are no additional unpublished data available at this time but a website is planned to host additional background information and online calculator for IBD-Control.
Potential users of IBD-Control The authors reserve copyright for the IBD-Control questionnaire and encourage potential users to visit our website for more information www.ibdcontrol.org. No charges or restrictions are placed on using the IBD-Control.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.