Introduction

Coeliac disease is a chronic disease characterised by an inflammatory response in the gastrointestinal tract and an impaired intestinal epithelial barrier.1 It is a complex genetic disorder, involving multiple genetic variants. Genetic studies on coeliac disease patients indicate that multiple chromosomal regions predispose to disease susceptibility, confirming the suggestion that common genes contribute to this disorder. Two of the coeliac disease loci, CELIAC2 on chromosome 5q23–q33 and CELIAC4 on 19p13.1,2, 3 coincide with linkage regions for inflammatory bowel disease (IBD),4, 5 giving rise to the hypothesis of common disease susceptibility. The IBD5 and IBD6 loci on 5q31 and 19p13.1, respectively, are among the most significant and consistently replicated IBD loci.6, 7 There is ample evidence that (auto)immune-related disorders share some of their genetic susceptibility factors, the best examples being variations in the CTLA-4 gene8 and PTPN22,9, 10, 11 both of which help regulate T-cell responsiveness.

IBD, clinically classified as Crohn's disease and ulcerative colitis, also shows a chronic inflammation of the gastrointestinal tract.6 Coeliac disease and IBD co-occur in families and patients, with an approximately five-fold increased prevalence of IBD in coeliac disease patients,12, 13, 14, 15 further suggesting common pathophysiological mechanisms for both diseases.

The sharing of the linkage regions, the concurrence in patients and the pathological commonalities between coeliac disease and IBD led us to hypothesise that genes on chromosomes 5q31 and 19p13.1 might be associated with susceptibility for both disorders, most probably by influencing inflammation in the gut. The CELIAC2 locus on 5q23–q33 overlaps with a 250 kb haplotype on 5q31 associated with Crohn's disease (IBD5).16 The implicated haplotype contains five genes (P4HA2, IRF1, SLC22A5/OCTN2, SLC22A4/OCTN1, PDLIM3) (see Table 1), all of which are positional candidate genes for coeliac disease pathogenesis. The CELIAC4 locus ranging from 15.38 to 21.08 Mb (99% confidence interval (CI), Ensembl version 35) lies completely within the IBD6 locus, which spans a large part of chromosome 19p. Of the 141 genes in the overlapping region, five are known to be involved in inflammation and were considered as functional candidate genes: CYP4F3, CYP4F2, HSH2D, IL12RB1 and IFI30 (see Table 1). A complementary approach to identifying the susceptibility gene for coeliac disease in the chromosome 19 region involves fine-mapping of the region. We simultaneously undertook such a strategy, which led to the identification of the myosin IXB gene (MYO9B) as a susceptibility gene for coeliac disease.17 The odds ratio (OR) for this gene is lower than the expected OR for the CELIAC4 locus, leaving room for another susceptibility gene in this locus. In this study, we report our association findings of the five positional candidate genes comprising the 250 kb haplotype on 5q31 and the five functional candidate genes from 19p13.1; all were tested using a haplotype block tag approach for association with coeliac disease in a cohort of Dutch patients.

Table 1 Overview of the 10 positional and functional candidate genes studied in relation to the overlapping regions in coeliac disease and IBD

Materials and methods

Samples

A case–control cohort consisting of 309 independent coeliac disease patients and 358 independent controls, all of Dutch Caucasian origin, was used for the genetic association study. Patients were diagnosed according to the ESPGHAN criteria and approximately 93% of the patients were HLA-DQ2 positive. We included for this study only coeliac disease patients with a biopsy-proven Marsh III lesion, as described by Van Belzen et al.3 Blood samples were collected and DNA was isolated according to the standard laboratory procedures.3 The study was approved by the Medical Ethics Committee of the University Medical Centre in Utrecht and informed consent was obtained from all individuals.

Candidate gene and single-nucleotide polymorphism (SNP) selection

A total of 16 haplotype tagging SNPs were selected from the 5q31 region, as described before.16 We observed comparable haplotypes to the ones published by Rioux et al.16 We also selected five functional candidate genes located in the overlapping CELIAC4 and IBD6 locus on chromosome 19 for this study, based on their known function, expression studies and HapMap SNP coverage. We selected SNPs based on block-tagging ability, validation status, allele frequency and SNP density. Previous performance rating (genotyping >90%) from the IBD cohort typing, on the Sequenom platform, was also taken into account. Haplotype tagging SNPs were selected on the basis of HapMap data as of September 2004 (www.hapmap.org) using Haploview to determine the haplotype tagging SNPs. These SNPs were selected at an early stage of the HapMap project when there was less information than available today. If tag SNPs could not be determined from HapMap data, Supplemental data sets (Programs for Genomic Applications (PGA) and previous haplotype analyses by our group (JD Rioux, personal communication)) were used for haplotype tagging SNP selection using Haploview as described above. Supplementary Table 1 lists the selected tagging SNPs per gene that were tested on the coeliac disease cohort.

Genotyping

Genomic DNA extracted from whole-blood samples was used. Genotyping assays were designed using the Sequenom Assay Design program and genotypes were obtained using the Sequenom Mass Array system at the Broad Institute of MIT and Harvard as described by Gabriel et al.18 Genotyping data was analysed for Hardy–Weinberg equilibrium and allele frequency. The quality control criteria used to determine if genotyping results were successful were: a minimum of 75% genotyping success for each SNP, Hardy–Weinberg equilibrium values >0.01 and observed heterozygosity >0.5%. Of 44 SNPs tested in the region, 42 SNPs passed the above criteria (one was monomorphic and the other failed owing to Hardy–Weinberg errors).

Statistical analysis

Allele and haplotype counts in cases versus controls were analysed for association. A single-marker and multimarker association study for each gene in the case–control cohort was conducted by a standard χ2 test (2 × 2 contingency table). Haplotypes were constructed using Haploview for the unrelated cases and for the control cohorts separately.19 OR represents maximum-likelihood estimate of odds ratio, and the corresponding 95% CI was approximated using Woolf's method.

The relative risk for coeliac disease associated with the CELIAC4 locus base on the linkage data is calculated to be 2.3.3 The OR associated with MYO9B heterozygosity is estimated to be 1.6 and homozygosity 2.3.17 We calculated the power for this study based on a relative risk of 1.7, which gives us more than 75% power to detect a disease variant with minor allele frequencies ranging from 0.1 to 0.3, assuming a dominant inheritance, a disease prevalence of 0.1, a D′ of 1 and an equal frequency of the tested SNP and the high-risk variant.

Results

Coeliac disease and IBD share linkage regions on chromosome 5q23–q33 (CELIAC2 and IBD5) and 19p13 (CELIAC4 and IBD6). We performed association studies on 10 positional and functional candidate genes to search for association with genes that might play a primary role in both disorders. Forty-four SNPs were selected to tag the haplotype blocks, thereby excluding redundant typing, and were genotyped in a cohort of 309 independent Dutch coeliac disease cases and 358 Dutch controls. Two tag SNPs failed to pass our quality control standards.

In the chromosome 5 region, one SNP showed association (Pnominal <0.05): rs7705826 located in SLC22A5 (Pnominal 0.033, OR 1.39, 95% CI 1.03–1.88). Owing to the high linkage disequilibrium (LD) in the region, we would have expected to find more than one SNP from this region to be associated. Therefore, this observation is probably false positive also because this SNP deviates from Hardy–Weinberg equilibrium in the control population.

In the chromosome 19 region, four SNPs showed single SNP association, two of which were located in CYP4F2 and two in CYP4F3 (Table 2, Figure 1). These genes are located head-to-head 218 kb apart. SNPs rs7252046 (Pnominal 0.0427, OR 1.28, 95% CI 1.01–1.61) and rs3093156 (Pnominal 0.013, OR 1.33, 95% CI 1.06–1.66) showed association in CYP4F2, and SNPs rs1290622 (Pnominal 0.0447, OR 1.58, 95% CI 1.01–2.48) and rs1290625 (Pnominal 0.0375, OR 1.77, 95% CI 1.03–3.05) in CYP4F3. Heterozygotes for the most associated SNP (rs3093156 in CYP4F2) have a 1.6 times higher risk of coeliac disease (Pnominal 0.021, 95% CI 1.07–2.4), whereas in homozygotes the risk increases to 1.84 (Pnominal 0.01, 95% CI 1.15–2.93). The results obtained for these genes are independent as the r2 between the SNPs in the genes is <0.06. Haplotype analyses of each of the two SNPs in the CYP4F2 and CYP4F3 genes are shown in Table 3. The other seven genes (IRF1, SLC22A4/OCTN1, PDLIM3, P4HA2, IF130, IL12RB1 and HSH2D) showed no association.

Table 2 Association results for the SNPs that were polymorphic and in Hardy–Weinberg equilibrium (HWE)
Figure 1
figure 1

Overview of CYP4F2 (a) and CYP4F3 (b), isoforms a and b, respectively, showing the exons, the locations of the tag SNPs (associated tag SNPs are listed in bold italics) and the observed LD based on the HapMap information. **The locations of the SNPs outside the genes are not true to scale. *rs2108622 is a coding SNP; # shows the position of the SNPs in the LD plot; and + shows for the SNPs that were not typed in the HapMap, the relative position of the SNPs in the LD plot.

Table 3 Haplotype analyses of associated SNPs in CYP4F3 (rs1290622 and rs1290625) and CYP4F2 (rs7252046 and rs3093156)

Discussion

In the last decade, much research has been devoted to elucidating the genetic basis of complex traits such as coeliac disease. Although increasingly successful, this work has been complicated by the fact that multiple genes can be associated with a trait, but each has insufficient impact to account for the total genetic susceptibility of the disease under study. Given the extent of linkage regions and the large number of genes they usually encompass, it can be a daunting task to select the most plausible candidate gene, particularly when little or no knowledge is available on gene function or the biological process perturbed. Both the coeliac disease loci on chromosome 5q23–q33 and 19p13.1 coincide with linkage regions for IBD. We therefore hypothesised that genes related to biological processes common to both disorders would make excellent functional candidates.

In this study, we observed nominal association to a single SNP located in SLC22A5 on chromosome 5. Although it is not clear which of the five genes in this region really confer susceptibility to Crohn's disease,16 changes in the SLC22A4 and SLC22A5 have been observed in relation to Crohn's disease,20 which potentially have a functional relevance. The effect of the association in coeliac disease is quite modest compared to the original findings in Crohn's disease. Furthermore, the other SNPs that we tested in the region show no association despite being in near-complete LD. Given these results, we feel that the most parsimonious explanation is that this association of SLC22A5 to celiac disease is a false positive, but this will require confirmation by independent groups.

Recently, a study has been published investigating 56 candidate genes from chromosome 19 for their involvement in IBD.21 All but one (CYP4F2) of the genes included in this study overlap with the genes published by Tello-Ruiz et al.21 We observed association to the cytochrome genes, CYP4F2 and CYP4F3, in a cohort of Dutch coeliac disease patients. Although the genes are juxtaposed on chromosome 19, the observed association signals are independent as there is hardly any LD between them (maximum r2 between the associated SNPs of the two genes is <0.06). The relative risk associated to each of the two genes ranges from 1.6 to 1.8, which is insufficient to fully explain the linkage result observed earlier.3 It is interesting that variants in the myosin IXB gene (MYO9B), located approximately 1.2 Mb proximal to CYP4F2, have recently been shown to be associated with increased risk (OR=1.7) to coeliac disease.17 The association found in the CYP4F2 and CYP4F3 genes is not owing to long-range LD between these genes and MYO9B (maximum r2 <0.0044). Although the observation with the CYP4F2 and CYP4F3 genes has not been corrected for multiple testing and needs to be independently replicated, it is tempting to speculate that the original strong linkage signal on 19p13.13 resulted from the presence of multiple susceptibility genes.

If we look at their function, the involvement of these two CYP4F genes is intriguing. Both CYP4F isoforms are involved in the oxidative degradation of leukotriene B4 (LTB4), the arachidonic acid-derived lipid inflammatory mediator responsible for the recruitment and activation of neutrophils. The genetic association of LTB4-regulating gene variants connects the innate immune response of neutrophil mobilisation with that of the established Th1 adaptive immunity present in coeliac disease. These genetic variants may influence neutrophil migration and thus create an environment in the mucosa that contributes to coeliac disease pathogenesis. It has been reported that activated neutrophils have an effect on epithelial tight junctions that results in enhanced permeability22 in the intestine, which could facilitate the influx of various commensals from the lumen leading to the recruitment of even more phagocytic neutrophils. This self-sustaining cycle of barrier impairment could also enable gluten to enter the lamina propria, where it could be presented to resident CD4+ T cells to evoke the Th1 response. In turn, activation of the Th1 immune pathway would further undermine barrier integrity through the release of interferon-γ.23 In a parallel gene expression study, we observed increased neutrophil numbers not only in untreated coeliac patients but also in patients in complete remission, probably reflecting a genetic impairment of the epithelial barrier.25 This observation is in line with the recently identified MYO9B gene, which is expected to confer genetic susceptibility to coeliac disease pathogenesis by also affecting the intestinal barrier.17 A more detailed assessment of the contribution of CYP4F2 and CYP4F3 variants to neutrophil recruitment and intestinal permeability will require further functional analysis. Replication in coeliac disease and expanding genetic association studies of these cytochrome genes to other inflammatory conditions should reveal whether their causative influence is true and extends beyond coeliac disease.