Whole exome sequencing analyses reveal gene–microbiota interactions in the context of IBD

Shixian Hu; Arnau Vich Vila; Ranko Gacesa; Valerie Collij; Christine Stevens; Jack M Fu; Isaac Wong; Michael E Talkowski; Manuel A Rivas; Floris Imhann; Laura Bolte; Hendrik van Dullemen; Gerard Dijkstra; Marijn C Visschedijk; Eleonora A Festen; Ramnik J Xavier; Jingyuan Fu; Mark J Daly; Cisca Wijmenga; Alexandra Zhernakova; Alexander Kurilshikov; Rinse K Weersma

doi:10.1136/gutjnl-2019-319706

Article Text

other Versions

You are currently viewing an earlier version of this article (January 07, 2021).
View the most recent version of this article

PDF

XML

Inflammatory bowel disease

Original research

Whole exome sequencing analyses reveal gene–microbiota interactions in the context of IBD

http://orcid.org/0000-0002-1190-0325Shixian Hu1,2,
http://orcid.org/0000-0003-4691-5583Arnau Vich Vila1,2,
Ranko Gacesa1,2,
Valerie Collij1,2,
Christine Stevens3,
Jack M Fu4,5,6,
Isaac Wong4,5,
Michael E Talkowski4,5,6,7,8,
Manuel A Rivas9,
Floris Imhann1,2,
Laura Bolte1,2,
Hendrik van Dullemen1,
http://orcid.org/0000-0003-4563-7462Gerard Dijkstra1,
Marijn C Visschedijk1,
Eleonora A Festen1,
Ramnik J Xavier10,11,
Jingyuan Fu2,12,
Mark J Daly3,
Cisca Wijmenga2,
Alexandra Zhernakova2,
Alexander Kurilshikov2,
http://orcid.org/0000-0001-7928-7371Rinse K Weersma1

¹Department of Gastroenterology and Hepatology, University of Groningen and University Medical Center Groningen, Groningen, The Netherlands
²Department of Genetics, University of Groningen and University Medical Center Groningen, Groningen, The Netherlands
³Program in Medical and Population Genetics, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
⁴Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts, USA
⁵Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA
⁶Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States
⁷Division of Medical Sciences, Harvard Medical School, Boston, Massachusetts, United States
⁸Stanley Center for Psychiatric Research, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, Massachusetts, United States
⁹Department of Biomedical Data Science, Stanford University, Stanford, California, USA
¹⁰Center for Microbiome Informatics and Therapeutic, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
¹¹Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts, United States
¹²Department of Pediatrics, University of Groningen and University Medical Center Groningen, Groningen, The Netherlands

Correspondence to Professor Rinse K Weersma; r.k.weersma{at}mdl.umcg.nl

Abstract

Objective Both the gut microbiome and host genetics are known to play significant roles in the pathogenesis of IBD. However, the interaction between these two factors and its implications in the aetiology of IBD remain underexplored. Here, we report on the influence of host genetics on the gut microbiome in IBD.

Design To evaluate the impact of host genetics on the gut microbiota of patients with IBD, we combined whole exome sequencing of the host genome and whole genome shotgun sequencing of 1464 faecal samples from 525 patients with IBD and 939 population-based controls. We followed a four-step analysis: (1) exome-wide microbial quantitative trait loci (mbQTL) analyses, (2) a targeted approach focusing on IBD-associated genomic regions and protein truncating variants (PTVs, minor allele frequency (MAF) >5%), (3) gene-based burden tests on PTVs with MAF <5% and exome copy number variations (CNVs) with site frequency <1%, (4) joint analysis of both cohorts to identify the interactions between disease and host genetics.

Results We identified 12 mbQTLs, including variants in the IBD-associated genes IL17REL, MYRF, SEC16A and WDR78. For example, the decrease of the pathway acetyl-coenzyme A biosynthesis, which is involved in short chain fatty acids production, was associated with variants in the gene MYRF (false discovery rate <0.05). Changes in functional pathways involved in the metabolic potential were also observed in participants carrying rare PTVs or CNVs in CYP2D6, GPR151 and CD160 genes. These genes are known for their function in the immune system. Moreover, interaction analyses confirmed previously known IBD disease-specific mbQTLs in TNFSF15.

Conclusion This study highlights that both common and rare genetic variants affecting the immune system are key factors in shaping the gut microbiota in the context of IBD and pinpoints towards potential mechanisms for disease treatment.

inflammatory bowel disease
genetics
intestinal microbiology

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

https://doi.org/10.1136/gutjnl-2019-319706

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Significance of this study

What is already known about this subject?

Gene–microbiome interactions are important in the pathogenesis of IBD.
Multiple genetic and epidemiological factors have been identified to be associated to changes in gut microbiome homeostasis in both IBD and the general population.
The identified gene–microbiome interactions in IBD contain mostly common genetic variants.

What are the new findings?

Novel associations between common genomic variants located in IBD implicated genes (MYRF, IL17REL, SEC16A and WDR78) or immune-related genes (CABIN1) to the gut microbial features have been identified in both IBD and the general population cohort.
By using high-resolution sequencing data, we were also able to identify rare and deleterious variants in five genes (GPR151, CYP2D6, TPTE2, LEKR1 and CD160) that may also be involved in the regulation of the gut microbiota.
Disease-specific host microbiota interactions were assessed by taking into account potential cofounding factors such as medication use.

How might it impact on clinical practice in the foreseeable future?

Our research revealed the host–microbiota interactions in context of IBD, which helps us to understand the pathology of IBD and potentially move towards new therapeutic targets for IBD.

Introduction

IBD, comprising Crohn’s disease (CD) and UC, is a chronic inflammatory condition of the gut with an increasing incidence in westernised countries.1 Large-scale genome-wide association studies (GWAS) have identified more than 200 genetic loci associated with IBD, including genes implicated in the immune pathways involved in responses to gut microbes.2

Extensive changes in the composition of the gut microbiota have been reported in patients with IBD. Several studies have described similar alteration on the faecal microbiota of patients with IBD, mainly a decreased microbial richness, the depletion of strictly anaerobic commensal species and the expansion of pathobiont.3–5 Despite these observations, the gut microbiota composition of patients with IBD is heterogeneous and mainly influenced by disease behaviour together with the impact of clinical and environmental factors.6 7 As neither genetics nor microbiome studies have revealed the triggering factors for IBD, there is an increasing need to study host–microbial interactions in order to understand the aetiology and progression of the disease.8 9

To date, both mouse models and human studies have shown that IBD-associated genes interact with the intestinal microbiome via regulation of the mucosal physical barrier as well as immune responses. For example, the nucleotide-binding oligomerisation domain (NOD)-like receptor 2 (NOD2) is involved in the bacterial peptidoglycan recognition.10 It has been shown that NOD2 knock-out mice show ineffective recognition and clearance of bacterial pathogens. As a consequence, these mice present increased abundances of pathogenic bacteria from the Bacteroides and Escherichia genera.11–13 Another host–microbiome interaction involves ATG16L1, a gene implicated in autophagy. In patients with CD, ATG16L1-T300A mutation carriers have more pathosymbionts in their gut mucosa.14 Recently, genome-wide host–microbiota association analyses have reported correlations between variants in immune-related genes and microbial features. For example, IL10 has been associated with the abundance of Enterobacteriaceae15 and IL1R2 associated with the overall community composition (beta diversity).16

Host genetics–microbiome association studies have been described in cohorts based on the general population.15 16 These studies tend to miss the genetics signals that are more pronounced in a disease context like IBD. On the other hand, the microbial quantitative trait loci (mbQTL) studies in IBD cohorts available to date have been limited in either sample size or in genomic and microbiome resolution. Also details in phenotypes capturing the heterogeneity present within IBD has been lacking in previous studies.17 18 The discovery of host–microbiota interactions, moreover, has been hampered by the large influence of intrinsic and environmental factors on the gut microbiome and relatively low microbial heritability.19

The aim of this study was to expand current knowledge of host–gut microbiota interactions.20 We combined whole exome sequencing (WES) of the host genome with metagenomics sequencing of faecal samples in a population cohort and in an IBD cohort. In addition to whole-exome-wide analyses, we investigated disease-specific interactions and the influence of rare variants on the gut microbiota in order to identify mechanisms involved in gut homeostasis and disease development.

Methods

Study cohorts

This study included two independent Dutch cohorts: a population-representative cohort (LifeLines-DEEP) from the northern part of the Netherlands and an IBD cohort made up of patients diagnosed in the specialised IBD clinic of the University Medical Center Groningen (Groningen, the Netherlands). The LifeLines-DEEP cohort (M12.113965) was approved by the ethics committee of the University Medical Centre Groningen, with registering at the LifeLines Research Site in Groningen. All individuals were also asked to fill in the questionnaire on GI symptoms. The IBD cohort (IRB-number 2008.338) was approved by University Medical Centre Groningen IRB (online supplementary table 1).

Supplemental material

[gutjnl-2019-319706supp001.xlsx]

WES and data processing

WES was performed on blood samples. Library preparation and sequencing were done at the Broad Institute of MIT and Harvard. On average, 86.06 million high-quality reads were generated per sample and 98.85% of reads were aligned to a human reference genome (hg19). Moreover, 81% of the exonic regions were covered with a read depth >30×. Next, the Genome Analysis Toolkit21 of the Broad Institute was used for variant calling. Variants with a call rate <0.99 or Hardy-Weinberg equilibrium test with p<0.0001 were excluded using PLINK tool (V.1.9). To remove genetic outliers, we combined WES data with genomes of Europeans from publically available 1000 Genome Project (phase 3) data (http://www.internationalgenome.org/), and performed principal component analysis (PCA) analysis based on single nucleotide polymorphisms (SNPs) shared between datasets. Outliers were defined as samples which fall outside of a mean±3 SD interval in both of the first two PCs, and these samples were removed. We also removed sex-mismatching samples based on the inbreeding coefficient (lower than 0.4 for females and higher than 0.7 for males) and related samples with identity-by-descent>0.185.22 GATK germline copy number variant (gCNV)23 was used for copy number variant (CNV) detection. GATK-gCNV uses a Bayesian model to adjust for known bias factors of exome capture and sequencing, such as GC content and mappability, while also controlling for other technical and systematic differences. Raw sequencing files are compressed into read counts over the set of exons defined under Gencode Annotation (V.33). After processing, variant quality and frequency filters (<1% site frequency) are applied to produce the final CNV callset (https://gatkforums.broadinstitute.org/gatk). In summary, 73 164 common variants (minor allele frequency (MAF) >5%), 98 878 rare variants (MAF <5%) and 1046 CNVs (site frequency <1%) from 920 LifeLines-DEEP and 435 individuals with IBD were considered for further analyses.

Metagenomic sequencing and data processing

Metagenomic sequencing was performed for faecal samples, using the Illumina MiSeq platform. Reads belonging to the human genome were removed by mapping the data to the human reference genome (version NCBI37) with kneaddata (V.0.5.1, http://huttenhower.sph.harvard.edu/kneaddata).

Profiling of microbiome taxonomic and functional composition was done using MetaPhlan (V.2.6.0)24 (http://huttenhower.sph.harvard.edu/metaphlan) and HUMAnN2 (V.0.6.1)25 (http://huttenhower.sph.harvard.edu/humann2). For each cohort, taxa present in fewer than 10% of total samples and pathways present in fewer than 25% of samples were excluded from the analyses (online supplementary methods, online supplementary table 2). We then normalised the relative abundances of 242 microbial taxa and 301 pathways present in both cohorts through inverse rank transformation.

Supplemental material

[gutjnl-2019-319706supp002.pdf]

Host genetics and gut microbiota differences between cohorts

IBD genetic signature

To assess the similarity of the genetic makeup of our IBD cohort compared with other GWAS studies on IBD, we performed case-control analyses in terms of genetics (population controls vs patients with CD, controls vs patients with UC and controls vs all patients combined) and compared the results with the largest IBD GWAS meta-analysis of populations of European ancestry published to date.2 Logistic regression analysis was used (PLINK V.1.9) adjusting for age, sex and smoking status. P values were adjusted for multiple testing by using the Bonferroni method and an false discovery rate (FDR) <0.05 was considered statistically significant.

IBD-associated gut microbial taxa and pathways

Then, we compared relative abundance of microbial taxa and pathways between the groups. The analyses were performed using Maaslin2 software (https://bitbucket.org/biobakery/maaslin2/src/default/). We selected covariates for our linear models based on factors which have often been used in mbQTL studies to increase comparability to other studies. Furthermore, we added covariates which have shown to have a large impact on the gut microbiome composition.3 15–17 20 26–30 This resulted in the inclusion of the following covariates: age, sex, body mass index, smoking, read depth, medication use (proton pump inhibitors, laxatives and antibiotics) and disease location for the IBD cohort. Bonferroni procedure was used to adjust for multiple testing and an FDR<0.05 was considered statistically significant.

mbQTL analyses

Microbial taxa and functional pathways were treated as quantitative traits. For all analyses, linear regression (where variants were encoded as 0 for homozygote of major allele, 1 for heterozygotes and 2 for homozygote of minor allele, online supplementary methods) was used to adjust for the effect of the confounders mentioned above. The Spearman correlation method was applied to determine the relationship between non-zero microbial data and host genotype in a four-step approach (figure 1).

Figure 1

Schematic overview of the study. (DATA part) We performed whole exome sequencing of the host genome and whole genome shotgun sequencing of faecal samples of 525 individuals (IBD) and 939 controls (LifeLines-DEEP). Nine covariates (age, sex, body mass index (BMI), smoking status, medication use (antibiotics, proton pump inhibitors (PPIs) or laxatives), disease location (in the IBD cohort) and sequencing read depth) were corrected for relative abundances of 242 taxa and 301 pathways. (ANALYSES WORKFLOW part) A four-step analysis was performed: step 1 includes a meta-analysis (p<6.83 × 10⁻⁷, corresponding to FDR<0.05) in which 73 164 exome-wide common variants with minor allele frequency (MAF) >5% were used for association analyses for microbial traits. Step 2 includes a meta-analysis (p<1.5 × 10⁻⁵, corresponding to FDR<0.05) using a targeted approach that only tested for 3010 variants located in IBD-associated genes known from IBD genome-wide association studies and PTVs with MAF >5%. Step 3 includes a meta-analysis (p<5 × 10⁻⁵, corresponding to FDR<0.05) using a gene-based burden test for 980 genes with rare PTVs (MAF <5%); a meta-analysis (p<1.87 × 10⁻⁴, corresponding to FDR<0.05) using a gene-based test for 267 genes with rare copy number variants (site frequency <1%). Step 4 includes joint analysis combining the two cohorts for disease and genetics interaction analyses. Step 4 focused only on single-cohort-significant microbial quantitative trait loci (mbQTLs) from steps 1 and 2 while adding a disease and a genetic interaction term into the model. All analyses were confined to non-zero values of taxa and pathways. All significance thresholds were set up by Bonferroni correction taking all variants/genes used into account.

Step 1: whole-exome-wide association meta-analyses

Seventy three thousand one hundred and sixty-four common variants (MAF >5%) were correlated with the relative abundances of microbial taxa and metabolic pathways using the same method in the previous study.15 First, we tested associations in the LifeLines-DEEP cohort (discovery stage) and selected signals with p<5 × 10^–5. Second, we replicated these in the IBD cohort and only kept associations with the same allelic direction that passed a replication threshold p<0.05 (replication stage). Third, we performed meta-analyses on these datasets using a weighted-Z-score approach by ‘Metap’ package in R V.3.5.0. The criteria of significance were p values that met a whole-exome-wide threshold of 6.83 × 10^–7, corresponding to exome-wide FDR=0.05 (Bonferroni method, n=73 164 variants). We then repeated this analysis switching the discovery and replication cohorts: using the IBD cohort as discovery and LifeLines-DEEP as replication.

Step 2: meta-analyses of selected variants

We selected two sets of variants for targeted analysis: protein truncating variants (PTVs)31 and variants located in known IBD-associated genes.2 We predicted 316 stop-gain, splice-disrupting and frameshift variants with MAF >5% in this analyses. We selected all genetic variants with an MAF >5% present in genomic loci that have been associated to IBD2 (n=3010). Associations between these variants and microbiome traits were performed following the same procedure described above in step 1. The significance threshold was adjusted according to the number of genetic variants tested: p<0.001 in the discovery cohort, p<0.05 in the replication cohort and a final meta p meeting 1.5 × 10⁻⁵, corresponding to FDR=0.05 (Bonferroni method, n=3309 variants).

Step 3: gene-based burden test meta-analyses

To identify the effect of rare SNPs, we performed gene-based burden tests by using the variant’s score instead of individual genotype in correlation analyses (MetaSKAT packages32 in R V.3.5.0), keeping only PTVs with MAF <5% and calculating per-gene scores.33 The number of genes implicated in this analysis was 980, so the final meta p was 5 × 10^-5, corresponding to gene-wise FDR=0.05, with a discovery p of 0.005 and a replication p of 0.05. To identify the effect of CNVs, we used a strategy similar to the one for rare SNVs and overlapped genes with CNVs. For each gene, a score was assigned based on the number of CNV sites and then used in association tests.33 34 This analysis was conducted for 267 genes with deletions and duplications separately. We chose signals with p<0.05 in each cohort, and the final meta p<1.87×10⁻⁴, FDR of 0.05 (Bonferroni method, n=267 genes).

Step 4: assessing disease effect in the host–microbiota correlations

Next, we investigated the mbQTLs that were only significant in one of the cohorts in steps 1 and 2. To identify whether the presence and absence of IBD could have an effect on the observed mbQTLs, we performed association analyses combining both cohorts and adding diseases and the interaction between genotype and diseases as covariates (online supplementary methods).35 Significance thresholds at whole-exome-wide level were p<6.83 × 10⁻⁷ (Bonferroni method, n=73 164 variants) for the discovery cohort, p>0.05 for the replication cohort and significant interaction p (IBD ×genotype)<0.0013, corresponding to FDR=0.05 (Bonferroni method, n=38 variants, including 17 IBD-specific and 21 LifeLines-DEEP-specific observed mbQTLs; online supplementary table 3, online supplementary table 4). The criteria for significance in the targeted-level analyses were discovery cohort p<1.5 × 10⁻⁵, replication cohort p>0.05, significant interaction p (IBD ×genotype)<0.0014, corresponding to FDR=0.05 (Bonferroni method, n=36, including 12 IBD-specific and 24 LifeLines-DEEP-specific mbQTLs; online supplementary table 3, online supplementary table 4). To avoid inflated statistics in these analyses, we randomly permutated the disease status across all samples 999 times (online supplementary methods). In addition, taking into account the heterogeneity of patients with IBD, we also considered the clinical IBD subphenotypes and performed a case-control mbQTL analyses in patients with CD and patients with UC separately.

Annotation of genetic variants

To further explore the function of the observed mbQTLs, we examined tissue-specific gene expression (expression quantitative trait loci (eQTLs)) in the GTEx Consortium database36 and used the Enrichr37 and FUMAGWAS38 databases to annotate the biological function and immunological signatures of the genes with a mbQTL effect in the whole-exome-wide analyses.

Results

Cohort description

The two cohorts in this study are derived from the Netherlands. The LifeLines-DEEP cohort comprises 939 individuals (59.74% female, mean age 45.24±13.46) and the IBD cohort comprises 525 patients with IBD (61.33% female, mean age 43.18±14.46), including 291 patients with CD, 202 patients with UC and 32 IBD unclassified (IBDU) patients. Eighteen individuals from LifeLines-DEEP and 17 patients from IBD cohort were removed through genetic PCA analysis. One individual from LifeLines-DEEP and seven patients from IBD were failed in quality control (QC) (online supplementary methods). The presence of an ileoanal pouch or a stoma was an exclusion criterion in the IBD cohort (n=66; online supplementary table 1). Finally, 920 LifeLine-DEEP individuals and 435 patients with IBD (CD=242, UC=161 and IBDU=32) were used for analysis.

Differences on host genetics and gut microbiota between cases and controls

IBD was associated to genomic variants located in previously reported IBD risk loci (FDR<0.05, online supplementary tables 5,6), including genes in human leukocyte antigen (HLA) loci (eg, rs77504727, c.740C>T, p.Arg247His, OR_IBD=2.65, P_IBD=1.25×10⁻¹³, FDR_IBD=8.71×10⁻⁰⁹, OR_CD=2.88, P_CD=1.16×10⁻¹⁰, FDR_CD=8.12×10⁻⁶) and NOD2 (rs2066843, c.1296C>T, OR_CD=1.83, P_CD=3.35×10⁻⁰⁸, FDR_CD=0.0023). An increased abundance of the phylum Bacteroidetes was detected in patients with IBD compared with general population controls (FDR=1.30×10⁻²³, online supplementary table 7). In terms of microbial pathways, pathways involved in fermentation of pyruvate to propanoate were decreased in IBD (FDR_IBD=3.10×10⁻⁶, FDR_CD=2.35×10⁻³, FDR_UC=7.14×10⁻³), while the pathway of fermentation of pyruvate to acetate and lactate was decreased in patients with CD compared with population controls (FDR=1.77×10⁻¹¹).

Whole-exome-wide analysis reveals mbQTLs in immune-related genes

The exome-wide mbQTL analysis (step 1) identified associations between 10 genetic variants and 11 microbial features (FDR<0.05). Four variants were associated to bacterial metabolic pathways involved in degradation of glucarate, the tricarboxylic acid cycle (TCA) cycle, coenzyme A (CoA) biosynthesis and glycogen biosynthesis, while the other six variants were associated with relative abundance of bacteria (figure 2, online supplementary table 8). The most significant associations were found between the minor allele of an intronic SNP (rs2238001, c.46+4245T>C) in the MYRF gene, which is located in an IBD-associated loci,2 and decreased abundance of two microbial pathways involved in carbohydrate metabolism: acetyl-CoA biosynthesis (PWY-5173, meta p=7.50 × 10⁻⁸, FDR=0.0058) and glyoxylate bypass (TCA-GLYOX-BYPASS, meta p=6.16 ×10⁻⁷, FDR=0.048; figure 3A; online supplementary figure 1). In the step 2 analysis, the same SNP was also observed to be associated with another metabolic pathway (GLYCOLYSIS-TCA-GLYOX-BYPASS, meta p=2.73 × 10⁻⁶, FDR=0.02). These pathways are mainly predicted from Escherichia coli. Concordantly, E. coli shows the strongest association among all 242 microbial taxa to MYRF (meta p=6.00 × 10⁻³), although it does not meet the statistically significant threshold. Examination of the GTEx database revealed that the rs2238001 has a eQTL effect specific to colon tissue that results in increased expression of MYRF (p=2.50 × 10⁻⁷; figure 3C).

Supplemental material

[gutjnl-2019-319706supp003.pdf]

Figure 2

Whole-exome-wide meta-analysis results from LifeLines-DEEP and IBD cohorts. Seventy three thousand one hundred and sixty-four common variants (minor allele frequency >5%), 242 taxa and 301 pathways (corrected for all covariates) were used in the association analyses. The discovery significance threshold was p<5 × 10⁻⁵ and the replication significance threshold was p<0.05. Manhattan plot displays −log10 p values for all association tests. Green and blue represent taxonomies and pathways, respectively. Red line indicates the whole-exome-wide association significance threshold: meta p<6.83 × 10⁻⁷, corresponding to exome-wide FDR<0.05 (n=73 164 variants, Bonferroni correction).

Figure 3

Microbial quantitative trait loci and eQTL analyses of MYRF. (A) Spearman correlation between genotype (TT, TC, CC) of rs2238001 in MYRF and the relative abundance of acetyl-coenzyme A (CoA) biosynthesis (IBD cohort, p=1.43 × 10⁻³, r=−0.19; LifeLines-DEEP (LLD) cohort, p=1.47 × 10⁻⁵, r=−0.20; meta p=7.50 × 10⁻⁸, FDR<0.05), the glyoxylate bypass and tricarboxylic acid cycle (TCA) MetaCyc pathways (IBD cohort, p=0.0149, r=−0.16; LLD cohort, p=1.04 × 10⁻⁵, r=−0.22; meta p=6.07 × 10⁻⁷, FDR<0.05). (B) The rs2238001 locus zoomed in on the IBD-associated region, including the IBD-associated genes MYRF, FADS2 and FADS3. P values are derived from meta-analyses between variants and the relative abundance of acetyl-CoA biosynthesis. (C) eQTL analysis between rs2238001 and MYRF gene expression in colon tissue from the GTEx database (n=246 tissues, p=2.46 × 10⁻⁷). r, Spearman correlation coefficient.

The minor allele of a synonymous variant in the immune-related gene CABIN1 (rs17854875, c.5745C>T, p.Ala1915Ala) was associated with an increase of D-glucarate degradation (GLUCARDEG-PWY, meta p=4.15 × 10⁻⁷, FDR=0.032). Another SNP located near the gene IL17REL (rs5845912, AC >A) was correlated with a lower abundance of the species Alistipes indistinctus (meta p=4.36 × 10⁻⁷, FDR=0.033). Variants in this gene have been reported to be associated with UC. IL17REL encodes interleukin 17 (IL-17) receptor E-like, a homolog of IL-17 receptor E that is considered to be a part of the IL-17 pathway that initiates a T helper 2–mediated immune response.39

Gene function enrichment analysis of all 10 mbQTLs (table 1) identified enrichment in gene functions related to mature B cell differentiation (GO:0002313, p=0.005, FDR=0.038) and CD4 and CD8 T-cell differentiation pathways (GSE31082, p=2.81 × 10⁻⁶, FDR=0.0103; online supplementary table 9).

View this table:

Table 1

Microbial quantitative trait loci associated with microbial taxonomies and pathways identified in a whole-exome-wide approach

Targeted analysis identifies mbQTLs in IBD-associated genes

Two additional IBD-associated genes with mbQTLs were identified in this targeted approach (step 2; table 2; online supplementary table 10). The top significant variant, rs10781497 (c.834G>A, p.Asp278Asp) located in the SEC16A gene, was associated with lower levels of bacterial biosynthesis of thiamin phosphate (THISYN-PWY) and thiazole (PWY-6892) (online supplementary figure 2A), and an SNP in WDR78 (rs74609208, c.2497-18C>A) was associated with higher level of biosynthesis of rhamnose (DTDPRHAMSYN-PWY; online supplementary figure 2B).

Supplemental material

[gutjnl-2019-319706supp004.pdf]

View this table:

Table 2

Microbial quantitative trait loci associated with microbial taxonomies and pathways identified in a targeted approach

Gene-based burden test highlights rare mutation mbQTLs

To study the effect of rare variants with predicted protein changing properties and CNVs, we performed gene-based burden tests (step 3). Here, we identified eight associations between four genes and eight microbial pathways (table 3). Two transcriptional stop-gain mutations in the GPR151 gene were significantly associated with lower levels of bacterial carbohydrate metabolism pathways (ANAEROFRUCAT-PWY with meta p=4.78 × 10⁻⁶, FDR=0.0047, GLYCOLYSIS with meta p=5.45 × 10⁻⁶, FDR=0.0053, PWY-5484 with meta p=4.63 × 10⁻⁶, FDR=0.0045, and PWY-6901 with meta p=3.05 × 10⁻⁵, FDR=0.003; figure 4; online supplementary figure 3). In addition, two frameshift variants in the IBD-associated gene CYP2D6 were associated with a decreased level of bacterial biosynthesis of vitamin K (PWY-5838 with meta p=1.45 × 10⁻⁵, FDR=0.014). We also observed that the gene CD160 with exon duplications was significantly associated with decreased abundance of Lachnospiraceae (meta p=1.65 × 10⁻⁴, FDR=0.044, online supplementary table 14).

Supplemental material

[gutjnl-2019-319706supp005.pdf]

Figure 4

Associations between gene GPR151 and microbial pathways. (A) Meta p values based on burden test between 30 genes with rare protein truncating variants (PTVs) on chromosome 5 and relative abundance of MetaCyc pathway homolactic fermentation (top). Blue dot represents meta p value of gene GPR151. Lower panel shows the variants found along with the coding region in GPR151. Different colours indicate different variant categories. Red indicates two rare stop-gain mutations, rs114285050 and rs140458264. (B) Box plots for associations between the relative abundance of the homolactic fermentation (meta p=4.78 × 10⁻⁶, FDR<0.05), glucose xylose degradation (meta p=3.05 × 10⁻⁵, FDR<0.05) microbial pathways and GPR151, respectively. b, effect size. GPR151, without rare PTVs. GPR151*, with rare PTVs.

View this table:

Table 3

Rare microbial quantitative trait loci identified by gene-based burden meta-analyses

Interaction analyses identifies IBD-specific mbQTLs

Since both the gut microbiota and host genetics are different in patients with IBD compared with the general population, we reanalysed current dataset including an interaction factor between disease and genetics. This analysis identified IBD-specific interactions comprising 18 genetic variants and 19 microbiome features (10 pathways and 9 taxa; FDR<0.05, online supplementary table 12), which were also calibrated by permutation tests to avoid inflated statistics bias (online supplementary methods, online supplementary figure 4). For example, a missense variant

Supplemental material

[gutjnl-2019-319706supp006.pdf]

(rs2076523, c.586T>C, p.Lys196Glu) in the IBD-associated gene BTNL2, which is involved in regulation of T cell proliferation,40 was associated with an increase in Bacteriodes cellulosilyticus in patients with IBD (interaction p=1.31 × 10⁻⁵, interaction FDR=4.98 × 10⁻⁴). We also replicated three previously identified mbQTLs. The well-known association between the LCT gene and Bifidobacterium abundance15 41 42 was confirmed in the population-based cohort (rs748841, GG genotype associated with higher abundance of Bifidobacterium adolescentis, recessive model, p=1.70 × 10⁻⁴, FDR=0.046, online supplementary figure 5, online supplementary table 13), while previously reported genetic variants with mbQTL effect located in the IBD-associated genes TNFSF15 (rs4246905, c.302-63T>C) and HLA-B (rs2074496, c.900C>T, p.Pro300Pro)18 were associated with a glycogen degradation microbial pathway (GLYCOCAT-PWY, interaction p=7.98 × 10⁻⁵, interaction FDR=0.0029) and a strain of Ruminococcaceae bacterium (interaction p=3.32 × 10⁻⁵, interaction FDR=0.0012), respectively.

Supplemental material

[gutjnl-2019-319706supp007.pdf]

Finally, we assessed mbQTL effect in patients with CD and UC separately. Two mbQTLs passed the significant threshold in patients with CD (FDR<0.05). For example, rs61732050 (c.1701G>A, p.Ala567Ala, MAF_CD=0.052, MAF_UC=0.068), located in IBD-associated gene NDST1 and associated with decreased abundance of the family Lachnospiraceae, was only significant in patients with CD (Spearman correlation coefficient=−0.32, p=3.03×10⁻⁰⁷, FDR=0.023). The 23 out of 27 IBD-specific mbQTLs identified earlier were nominally significant (p<0.05) in both CD and UC groups (online supplementary table 4), with all 27 showing the same directions of effect.

Discussion

To study the interaction between host genomics and gut microbial features in the context of IBD, we performed a large mbQTL analysis using high-resolution host genomic and gut microbiome data. This identified putative associations between common genomic variants located in IBD (MYRF, IL17REL, SEC16A and WDR78) or immune-related genes (CABIN1) to the abundance of specific microbial taxa and gut microbiome metabolic pathways. The use of WES data also allowed us to identify rare and deleterious variants in five genes (GPR151, CYP2D6, TPTE2 LEKR1 and CD160) that could potentially be involved in the regulation of the gut microbiota. Finally, genetics–disease interaction models revealed disease-specific mbQTL signals.

The patients with IBD in this study showed similarities of their genetic and microbial signatures compared with other studies.2–4 43 For example, NOD2 variants were associated with CD, while the SNPs in HLA loci were associated with both CD and UC. The gut microbiota of patients with IBD was characterised by a decreased abundance of Firmicutes, including Faecalibacterium prausnitzii (FDR=9.69×10⁻⁰⁹), and an expansion of Proteobacteria, including E. coli (FDR=0.029), compared with the population controls. These differences were also evident in the predicted microbial pathways, with a decreased abundance of genes involved in short chain fatty acid (SCFA) metabolism.

In whole-exome-wide level analysis, we found that decreased levels of the microbial acetyl-CoA and glyoxylate metabolic pathways correlated with the minor allele (C) of a variant located in the gene MYRF. Acetyl-coA is a precursor in the synthesis of SCFAs, including butyrate and acetate,44 which are important in maintaining gut health.45 Interestingly, the MYRF gene is located in a genomic region that has previously been associated with IBD and other immune-mediated diseases.46 47 This genomic region also contains the FADS1 and FADS2 genes that are involved in the metabolism of polyunsaturated fatty acids,48 and the n-3 polyunsaturated fatty acid has been suggested to have protective effects on IBD.49 Therefore, the current analyses suggest a potential link between inflammation and microbial pathway dysregulation through host genomic variation. Another mbQTL we identified is located in the immune-related gene CABIN1. This gene is involved in negatively regulating T-cell receptor signalling50 and was associated to an increase of D-glucarate degradation pathway. Interestingly, enterobacteria such as E. coli, a potentially pathogenic bacteria known to be enriched in dysbiotic conditions, can use this sugar as a carbon source for growth.51 This implies a potential role between host genetics and a beneficial environment for E. coli to grow. We also found an association between IL17REL, which likely oligomerizes and binds a specific IL17 cytokine, and the bacterium Alistipes. Changes in the abundance of Alistipes have been reported in several conditions, including paediatric CD,52 colorectal cancer53 and obesity.54 Previous studies have reported a negative correlation between the abundance of Alistipes and the lipopolysaccharide (LPS)-induced tumour necrosis factor (TNF) alpha response.55 Therefore, mbQTLs identified at whole-exome-level suggest a potential complex interaction between host genetics, microbial composition and the immune system.

Next, we focused on a subset of selected variants located in genes within IBD-susceptibility regions and predicted protein-disrupting variants that could potentially lead to disease or abnormal phenotype by altering the gut microbiome. Here, we found two mbQTLs located in the IBD-associated genes SEC16A and WDR78. SEC16A is involved in the transitional endoplasmic reticulum and is located within a haplotype block that contains the INPP5E and CARD9 genes.56 The SEC16A-affected pathway biosynthesis of thiamin (vitamin B1, an essential vitamin) is necessary for the proper functioning of the immune system and thiamin is supplied to the host through diet and the gut microbiota.57 WDR78 was associated with L-rhamnose biosynthesis, and L-rhamnose is a precursor of a common enterobacterial antigen. In addition, WDR78, together with genes GPR65 and TNFAIP3, is reported to cooperate in regulation of the macrophage component.58 Therefore, this study reveals a potential link that suggests WDR78 may potentially regulate microbial function through antigen recognition by immune cells.

In contrast to the regular genotyping arrays used in GWAS, WES enables the detection of rare variants with mbQTLs effects. We identified independent rare variants with predicted functional consequences within the G-protein coupled receptor 151, GPR151, that are associated with multiple functional microbial pathways (homolactic fermentation, glucose and xylose degradation). GPR151 is a critical element of antigen recognition and activation of the immune response,59 60 and PTVs in GPR151 have been reported to have a protective effect against obesity and type 2 diabetes in the UK Biobank.61 In addition, lower levels of bacterial carbohydrate degradation lead to lower carbohydrate absorption in the gut by the host, which pinpoints potential mechanisms by which GPR151 variants can protect against metabolic diseases. Limited by the artefacts on capturing exomes using WES, we restricted our analyses on CNV site frequency lower than 1%. The strongest association between genes with CNV and microbiota was CD160, and Lachnospiraceae. CD160 is reported to be highly expressed in small intestine, inducing production of proinflammatory cytokines and antipathogen protein.62 63 Moreover, depletion of gene CD160 has been shown to be associated with increased pathogenic bacteria in mice.64

Finally, we joined the two cohorts to perform genetics–disease interaction analysis, rather than comparing single-cohort-significant mbQTLs separately, to identify disease-specific mbQTLs and to achieve more power. This approach was able to show that genetics potentially exerts a different influence on the microbiome in IBD compared with a healthy situation. The known association between the LCT gene and Bifidobacterium abundance was only present in the population cohort. This could potentially be explained by the fact that Bifidobacterium abundance is decreased in the gut microbiota of CD3 which was observed in this study, and therefore this mbQTL was not present in the IBD cohort. Furthermore, we observed mbQTL effects in known IBD genes18 such as TNFSF15 only in the IBD cohort. When analysing mbQTL effects in patients with CD and UC separately, we could only identify two mbQTLs in patients with CD that reached the significance threshold. This could be due to the limited statistical power resulted by subdividing the IBD group in its two main subtypes.

Heritability studies have shown that part of the microbiome development and composition is under genomic control.41 Studies looking into genome–microbiome interaction have been performed using GWAS technologies in healthy or population-based cohorts.15 16 26 In LifeLines-DEEP cohort, we replicated the association between variants in the LCT gene and abundance of Bifidobacterium, and the association between TIRAP gene (rs560813, T>C, p=0.024) and abundance of genus Holdemania previously reported in Bonder et al,15 which contained partially overlapping samples with the current study. On the level of the general population, the effect of genetic makeup on the variance of microbiome composition is lower compared with the cumulative effect of environmental exposure.20 However, the genetic effects might show more substantial contribution in more specific conditions, such as IBD, which shows more pronounced effects on both genetic and microbial components. Several earlier studies in IBD cohorts have also reported IBD-specific mbQTL variants. We identified variants in the IBD-associated genes TNFSF15 and HLA-B, both genes that have been reported earlier in a study combining mucosal 16s sequencing data and GWAS data.18 The lack of replication of other studies including Lloyd-Price et al27 could partially be explained by the cohort recruitment, for example, Groningen patients with IBD are over 18 years old with long-term disease problems while half of the patients in Lloyd-Price et al are early onset paediatric cases, which have different IBD genetic makeup and microbial features.65 66 Besides, sample size, datasets, included confounders and analysis strategies might also explain differences in results across studies. In the current study, we performed a large-scale mbQTL analysis of gut microbiome composition and function that combined two high-resolution techniques, WES and shotgun metagenomics, while controlling for major confounders known to influence the gut microbiome. While we are only beginning to dissect the genomic architecture that drives microbiome evolution and composition in health and disease, this study adds considerable insights and provides leads for further functional analyses or targets for therapies in the context of IBD.

This research highlights that both common and rare host genetic variants affecting the immune system are key factors in shaping the gut microbiota taxonomy and function, knowledge which further enhances our understanding of the intricate host–microbiome interaction involved in IBD pathogenesis.

Acknowledgments

The authors thank the LifeLines-DEEP and IBD cohort participants. They thank Kate Mc Intyre for substantive English editing and B.H. Jansen for technical support.

References

↵
1. Ng SC,
2. Shi HY,
3. Hamidi N, et al
. Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies. Lancet 2018;390:2769–78.doi:10.1016/S0140-6736(17)32448-0pmid:http://www.ncbi.nlm.nih.gov/pubmed/29050646
OpenUrl CrossRef PubMed
↵
1. de Lange KM,
2. Moutsianas L,
3. Lee JC, et al
. Genome-Wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease. Nat Genet 2017;49:256–61.doi:10.1038/ng.3760pmid:http://www.ncbi.nlm.nih.gov/pubmed/28067908
OpenUrl CrossRef PubMed
↵
1. Vich Vila A,
2. Imhann F,
3. Collij V, et al
. Gut microbiota composition and functional changes in inflammatory bowel disease and irritable bowel syndrome. Sci Transl Med 2018;10. doi:doi:10.1126/scitranslmed.aap8914. [Epub ahead of print: 19 Dec 2018].pmid:http://www.ncbi.nlm.nih.gov/pubmed/30567928
OpenUrl PubMed
↵
1. Franzosa EA,
2. Sirota-Madi A,
3. Avila-Pacheco J, et al
. Gut microbiome structure and metabolic activity in inflammatory bowel disease. Nat Microbiol 2019;4:293–305.doi:10.1038/s41564-018-0306-4pmid:http://www.ncbi.nlm.nih.gov/pubmed/30531976
OpenUrl CrossRef PubMed
↵
1. Schirmer M,
2. Garner A,
3. Vlamakis H, et al
. Microbial genes and pathways in inflammatory bowel disease. Nat Rev Microbiol 2019;17:497–511.doi:10.1038/s41579-019-0213-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/31249397
OpenUrl CrossRef PubMed
↵
1. Knights D,
2. Lassen KG,
3. Xavier RJ
. Advances in inflammatory bowel disease pathogenesis: linking host genetics and the microbiome. Gut 2013;62:1505–10.doi:10.1136/gutjnl-2012-303954pmid:http://www.ncbi.nlm.nih.gov/pubmed/24037875
OpenUrl Abstract/FREE Full Text
↵
1. Turpin W,
2. Goethel A,
3. Bedrani L, et al
. Determinants of IBD heritability: genes, bugs, and more. Inflamm Bowel Dis 2018;24:1133–48.doi:10.1093/ibd/izy085pmid:http://www.ncbi.nlm.nih.gov/pubmed/29701818
OpenUrl CrossRef PubMed
↵
1. Hall AB,
2. Tolonen AC,
3. Xavier RJ
. Human genetic variation and the gut microbiome in disease. Nat Rev Genet 2017;18:690–9.doi:10.1038/nrg.2017.63pmid:http://www.ncbi.nlm.nih.gov/pubmed/28824167
OpenUrl CrossRef PubMed
↵
1. Cohen LJ,
2. Cho JH,
3. Gevers D, et al
. Genetic factors and the intestinal microbiome guide development of Microbe-Based therapies for inflammatory bowel diseases. Gastroenterology 2019;156:2174–89.doi:10.1053/j.gastro.2019.03.017pmid:http://www.ncbi.nlm.nih.gov/pubmed/30880022
OpenUrl CrossRef PubMed
↵
1. Kobayashi KS,
2. Chamaillard M,
3. Ogura Y, et al
. Nod2-Dependent regulation of innate and adaptive immunity in the intestinal tract. Science 2005;307:731–4.doi:10.1126/science.1104911pmid:http://www.ncbi.nlm.nih.gov/pubmed/15692051
OpenUrl Abstract/FREE Full Text
↵
1. Mondot S,
2. Barreau F,
3. Al Nabhani Z, et al
. Altered gut microbiota composition in immune-impaired Nod2(-/-) mice. Gut 2012;61:634–5.doi:10.1136/gutjnl-2011-300478pmid:http://www.ncbi.nlm.nih.gov/pubmed/21868489
OpenUrl FREE Full Text
↵
1. Rehman A,
2. Sina C,
3. Gavrilova O, et al
. Nod2 is essential for temporal development of intestinal microbial communities. Gut 2011;60:1354–62.doi:10.1136/gut.2010.216259pmid:http://www.ncbi.nlm.nih.gov/pubmed/21421666
OpenUrl Abstract/FREE Full Text
↵
1. Butera A,
2. Di Paola M,
3. Pavarini L, et al
. Nod2 deficiency in mice is associated with microbiota variation favouring the expansion of mucosal CD4+ LAP+ regulatory cells. Sci Rep 2018;8:14241. doi:10.1038/s41598-018-32583-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/30250234
OpenUrl CrossRef PubMed
↵
1. Sadaghian Sadabad M,
2. Regeling A,
3. de Goffau MC, et al
. The ATG16L1-T300A allele impairs clearance of pathosymbionts in the inflamed ileal mucosa of Crohn's disease patients. Gut 2015;64:1546–52.doi:10.1136/gutjnl-2014-307289pmid:http://www.ncbi.nlm.nih.gov/pubmed/25253126
OpenUrl Abstract/FREE Full Text
↵
1. Bonder MJ,
2. Kurilshikov A,
3. Tigchelaar EF, et al
. The effect of host genetics on the gut microbiome. Nat Genet 2016;48:1407–12.doi:10.1038/ng.3663pmid:http://www.ncbi.nlm.nih.gov/pubmed/27694959
OpenUrl CrossRef PubMed
↵
1. Wang J,
2. Thingholm LB,
3. Skiecevičienė J, et al
. Genome-Wide association analysis identifies variation in vitamin D receptor and other host factors influencing the gut microbiota. Nat Genet 2016;48:1396–406.doi:10.1038/ng.3695pmid:http://www.ncbi.nlm.nih.gov/pubmed/27723756
OpenUrl CrossRef PubMed
↵
1. Aschard H,
2. Laville V,
3. Tchetgen ET, et al
. Genetic effects on the commensal microbiota in inflammatory bowel disease patients. PLoS Genet 2019;15:e1008018. doi:10.1371/journal.pgen.1008018pmid:http://www.ncbi.nlm.nih.gov/pubmed/30849075
OpenUrl CrossRef PubMed
↵
1. Knights D,
2. Silverberg MS,
3. Weersma RK, et al
. Complex host genetics influence the microbiome in inflammatory bowel disease. Genome Med 2014;6:107. doi:10.1186/s13073-014-0107-1pmid:http://www.ncbi.nlm.nih.gov/pubmed/25587358
OpenUrl CrossRef PubMed
↵
1. Kurilshikov A,
2. Wijmenga C,
3. Fu J, et al
. Host genetics and gut microbiome: challenges and perspectives. Trends Immunol 2017;38:633–47.doi:10.1016/j.it.2017.06.003pmid:http://www.ncbi.nlm.nih.gov/pubmed/28669638
OpenUrl CrossRef PubMed
↵
1. Rothschild D,
2. Weissbrod O,
3. Barkan E, et al
. Environment dominates over host genetics in shaping human gut microbiota. Nature 2018;555:210–5.doi:10.1038/nature25973pmid:http://www.ncbi.nlm.nih.gov/pubmed/29489753
OpenUrl CrossRef PubMed
↵
1. McKenna A,
2. Hanna M,
3. Banks E, et al
. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010;20:1297–303.doi:10.1101/gr.107524.110pmid:http://www.ncbi.nlm.nih.gov/pubmed/20644199
OpenUrl Abstract/FREE Full Text
↵
1. Anderson CA,
2. Pettersson FH,
3. Clarke GM, et al
. Data quality control in genetic case-control association studies. Nat Protoc 2010;5:1564–73.doi:10.1038/nprot.2010.116pmid:http://www.ncbi.nlm.nih.gov/pubmed/21085122
OpenUrl CrossRef PubMed Web of Science
↵
1. Babadi M,
2. Lee S,
3. Smirnov A, et al
. Precise common and rare germline CNV calling with GATK 2018.
↵
1. Segata N,
2. Waldron L,
3. Ballarini A, et al
. Metagenomic microbial community profiling using unique clade-specific marker genes. Nat Methods 2012;9:811–4.doi:10.1038/nmeth.2066pmid:http://www.ncbi.nlm.nih.gov/pubmed/22688413
OpenUrl CrossRef PubMed Web of Science
↵
1. Franzosa EA,
2. McIver LJ,
3. Rahnavard G, et al
. Species-level functional profiling of metagenomes and metatranscriptomes. Nat Methods 2018;15:962–8.doi:10.1038/s41592-018-0176-ypmid:http://www.ncbi.nlm.nih.gov/pubmed/30377376
OpenUrl CrossRef PubMed
↵
1. Turpin W,
2. Espin-Garcia O,
3. Xu W, et al
. Association of host genome with intestinal microbial composition in a large healthy cohort. Nat Genet 2016;48:1413–7.doi:10.1038/ng.3693pmid:http://www.ncbi.nlm.nih.gov/pubmed/27694960
OpenUrl CrossRef PubMed
↵
1. Lloyd-Price J,
2. Arze C,
3. Ananthakrishnan AN, et al
. Multi-Omics of the gut microbial ecosystem in inflammatory bowel diseases. Nature 2019;569:655–62.doi:10.1038/s41586-019-1237-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/31142855
OpenUrl CrossRef PubMed
↵
1. Falony G,
2. Joossens M,
3. Vieira-Silva S, et al
. Population-Level analysis of gut microbiome variation. Science 2016;352:560–4.doi:10.1126/science.aad3503pmid:http://www.ncbi.nlm.nih.gov/pubmed/27126039
OpenUrl Abstract/FREE Full Text
↵
1. Zhernakova A,
2. Kurilshikov A,
3. Bonder MJ, et al
. Population-Based metagenomics analysis reveals markers for gut microbiome composition and diversity. Science 2016;352:565–9.doi:10.1126/science.aad3369pmid:http://www.ncbi.nlm.nih.gov/pubmed/27126040
OpenUrl Abstract/FREE Full Text
↵
1. Imhann F,
2. Vich Vila A,
3. Bonder MJ, et al
. Interplay of host genetics and gut microbiota underlying the onset and clinical presentation of inflammatory bowel disease. Gut 2018;67:108–19.doi:10.1136/gutjnl-2016-312135pmid:http://www.ncbi.nlm.nih.gov/pubmed/27802154
OpenUrl Abstract/FREE Full Text
↵
1. Rivas MA,
2. Pirinen M,
3. Conrad DF, et al
. Human genomics. Effect of predicted protein-truncating genetic variants on the human transcriptome. Science 2015;348:666–9.doi:10.1126/science.1261877pmid:http://www.ncbi.nlm.nih.gov/pubmed/25954003
OpenUrl Abstract/FREE Full Text
↵
1. Lee S,
2. Teslovich TM,
3. Boehnke M, et al
. General framework for meta-analysis of rare variants in sequencing association studies. Am J Hum Genet 2013;93:42–53.doi:10.1016/j.ajhg.2013.05.010pmid:http://www.ncbi.nlm.nih.gov/pubmed/23768515
OpenUrl CrossRef PubMed
↵
1. Purcell SM,
2. Moran JL,
3. Fromer M, et al
. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 2014;506:185–90.doi:10.1038/nature12975pmid:http://www.ncbi.nlm.nih.gov/pubmed/24463508
OpenUrl CrossRef PubMed Web of Science
↵
1. Frenkel S,
2. Bernstein CN,
3. Sargent M, et al
. Genome-Wide analysis identifies rare copy number variations associated with inflammatory bowel disease. PLoS One 2019;14:e0217846. doi:10.1371/journal.pone.0217846pmid:http://www.ncbi.nlm.nih.gov/pubmed/31185018
OpenUrl CrossRef PubMed
↵
1. Peters JE,
2. Lyons PA,
3. Lee JC, et al
. Insight into genotype-phenotype associations through eQTL mapping in multiple cell types in health and immune-mediated disease. PLoS Genet 2016;12:e1005908. doi:10.1371/journal.pgen.1005908pmid:http://www.ncbi.nlm.nih.gov/pubmed/27015630
OpenUrl CrossRef PubMed
↵
1. GTEx Consortium
. Human genomics. The Genotype-Tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 2015;348:648–60.doi:10.1126/science.1262110pmid:http://www.ncbi.nlm.nih.gov/pubmed/25954001
OpenUrl Abstract/FREE Full Text
↵
1. Kuleshov MV,
2. Jones MR,
3. Rouillard AD, et al
. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 2016;44:W90–7.doi:10.1093/nar/gkw377pmid:http://www.ncbi.nlm.nih.gov/pubmed/27141961
OpenUrl CrossRef PubMed
↵
1. Watanabe K,
2. Taskesen E,
3. van Bochoven A, et al
. Functional mapping and annotation of genetic associations with FUMA. Nat Commun 2017;8:1826. doi:10.1038/s41467-017-01261-5pmid:http://www.ncbi.nlm.nih.gov/pubmed/29184056
OpenUrl CrossRef PubMed
↵
1. Franke A,
2. Balschun T,
3. Sina C, et al
. Genome-Wide association study for ulcerative colitis identifies risk loci at 7q22 and 22q13 (IL17REL). Nat Genet 2010;42:292–4.doi:10.1038/ng.553pmid:http://www.ncbi.nlm.nih.gov/pubmed/20228798
OpenUrl CrossRef PubMed Web of Science
↵
1. Prescott NJ,
2. Lehne B,
3. Stone K, et al
. Pooled sequencing of 531 genes in inflammatory bowel disease identifies an associated rare variant in BTNL2 and implicates other immune related genes. PLoS Genet 2015;11:e1004955–19.doi:10.1371/journal.pgen.1004955pmid:http://www.ncbi.nlm.nih.gov/pubmed/25671699
OpenUrl CrossRef PubMed
↵
1. Goodrich JK,
2. Davenport ER,
3. Beaumont M, et al
. Genetic determinants of the gut microbiome in UK twins. Cell Host Microbe 2016;19:731–43.doi:10.1016/j.chom.2016.04.017pmid:http://www.ncbi.nlm.nih.gov/pubmed/27173935
OpenUrl CrossRef PubMed
↵
1. Kolde R,
2. Franzosa EA,
3. Rahnavard G, et al
. Host genetic variation and its microbiome interactions within the human microbiome project. Genome Med 2018;10:6. doi:10.1186/s13073-018-0515-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/29378630
OpenUrl CrossRef PubMed
↵
1. Liu JZ,
2. van Sommeren S,
3. Huang H, et al
. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat Genet 2015;47:979–86.doi:10.1038/ng.3359pmid:http://www.ncbi.nlm.nih.gov/pubmed/26192919
OpenUrl CrossRef PubMed
↵
1. Koh A,
2. De Vadder F,
3. Kovatcheva-Datchary P, et al
. From dietary fiber to host physiology: short-chain fatty acids as key bacterial metabolites. Cell 2016;165:1332–45.doi:10.1016/j.cell.2016.05.041pmid:http://www.ncbi.nlm.nih.gov/pubmed/27259147
OpenUrl CrossRef PubMed
↵
1. Ríos-Covián D,
2. Ruas-Madiedo P,
3. Margolles A, et al
. Intestinal short chain fatty acids and their link with diet and human health. Front Microbiol 2016;7:185. doi:10.3389/fmicb.2016.00185pmid:http://www.ncbi.nlm.nih.gov/pubmed/26925050
OpenUrl CrossRef PubMed
↵
1. Carethers JM,
2. Jung BH
. Genetics and genetic biomarkers in sporadic colorectal cancer. Gastroenterology 2015;149:1177–90.doi:10.1053/j.gastro.2015.06.047pmid:http://www.ncbi.nlm.nih.gov/pubmed/26216840
OpenUrl CrossRef PubMed
↵
1. Costea I,
2. Mack DR,
3. Lemaitre RN, et al
. Interactions between the dietary polyunsaturated fatty acid ratio and genetic factors determine susceptibility to pediatric Crohn's disease. Gastroenterology 2014;146:929–31.doi:10.1053/j.gastro.2013.12.034pmid:http://www.ncbi.nlm.nih.gov/pubmed/24406470
OpenUrl CrossRef PubMed
↵
1. Illig T,
2. Gieger C,
3. Zhai G, et al
. A genome-wide perspective of genetic variation in human metabolism. Nat Genet 2010;42:137–41.doi:10.1038/ng.507pmid:http://www.ncbi.nlm.nih.gov/pubmed/20037589
OpenUrl CrossRef PubMed Web of Science
↵
1. Marion-Letellier R,
2. Savoye G,
3. Beck PL, et al
. Polyunsaturated fatty acids in inflammatory bowel diseases: a reappraisal of effects and therapeutic approaches. Inflamm Bowel Dis 2013;19:650–61.doi:10.1097/MIB.0b013e3182810122pmid:http://www.ncbi.nlm.nih.gov/pubmed/23328774
OpenUrl CrossRef PubMed
↵
1. Sun L,
2. Youn HD,
3. Loh C, et al
. Cabin 1, a negative regulator for calcineurin signaling in T lymphocytes. Immunity 1998;8:703–11.doi:10.1016/S1074-7613(00)80575-0pmid:http://www.ncbi.nlm.nih.gov/pubmed/9655484
OpenUrl CrossRef PubMed Web of Science
↵
1. Gulick AM,
2. Hubbard BK,
3. Gerlt JA, et al
. Evolution of enzymatic activities in the enolase superfamily: identification of the general acid catalyst in the active site of D-glucarate dehydratase from Escherichia coli. Biochemistry 2001;40:10054–62.doi:10.1021/bi010733bpmid:http://www.ncbi.nlm.nih.gov/pubmed/11513584
OpenUrl CrossRef PubMed Web of Science
↵
1. Lewis JD,
2. Chen EZ,
3. Baldassano RN, et al
. Inflammation, antibiotics, and diet as environmental stressors of the gut microbiome in pediatric Crohn's disease. Cell Host Microbe 2015;18:489–500.doi:10.1016/j.chom.2015.09.008pmid:http://www.ncbi.nlm.nih.gov/pubmed/26468751
OpenUrl CrossRef PubMed
↵
1. Wang T,
2. Cai G,
3. Qiu Y, et al
. Structural segregation of gut microbiota between colorectal cancer patients and healthy volunteers. Isme J 2012;6:320–9.doi:10.1038/ismej.2011.109pmid:http://www.ncbi.nlm.nih.gov/pubmed/21850056
OpenUrl CrossRef PubMed Web of Science
↵
1. Ridaura VK,
2. Faith JJ,
3. Rey FE, et al
. Gut microbiota from twins discordant for obesity modulate metabolism in mice. Science 2013;341:1241214. doi:10.1126/science.1241214pmid:http://www.ncbi.nlm.nih.gov/pubmed/24009397
OpenUrl Abstract/FREE Full Text
↵
1. Schirmer M,
2. Smeekens SP,
3. Vlamakis H, et al
. Linking the human gut microbiome to inflammatory cytokine production capacity. Cell 2016;167:1125–36.doi:10.1016/j.cell.2016.10.020pmid:http://www.ncbi.nlm.nih.gov/pubmed/27814509
OpenUrl CrossRef PubMed
↵
1. Zhernakova A,
2. Festen EM,
3. Franke L, et al
. Genetic analysis of innate immunity in Crohn's disease and ulcerative colitis identifies two susceptibility loci harboring CARD9 and IL18RAP. Am J Hum Genet 2008;82:1202–10.doi:10.1016/j.ajhg.2008.03.016pmid:http://www.ncbi.nlm.nih.gov/pubmed/18439550
OpenUrl CrossRef PubMed Web of Science
↵
1. Schirmer M, et al
. Inflammatory bowel disease gut microbiome. Nat. Microbiol 2017.
↵
1. Peters LA,
2. Perrigoue J,
3. Mortha A, et al
. A functional genomics predictive network model identifies regulators of inflammatory bowel disease. Nat Genet 2017;49:1437–49.doi:10.1038/ng.3947pmid:http://www.ncbi.nlm.nih.gov/pubmed/28892060
OpenUrl CrossRef PubMed
↵
1. Cho H,
2. Kehrl JH
. Regulation of immune function by G protein-coupled receptors, trimeric G proteins, and RGS proteins. Prog Mol Biol Transl Sci 2009;86:249–98.doi:10.1016/S1877-1173(09)86009-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/20374719
OpenUrl CrossRef PubMed
↵
1. Gräler MH,
2. Goetzl EJ
. Lysophospholipids and their G protein-coupled receptors in inflammation and immunity. Biochim Biophys Acta 2002;1582:168–74.doi:10.1016/S1388-1981(02)00152-Xpmid:http://www.ncbi.nlm.nih.gov/pubmed/12069825
OpenUrl CrossRef PubMed Web of Science
↵
1. Emdin CA,
2. Khera AV,
3. Chaffin M, et al
. Analysis of predicted loss-of-function variants in UK Biobank identifies variants protective for disease. Nat Commun 2018;9:1613. doi:10.1038/s41467-018-03911-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/29691411
OpenUrl CrossRef PubMed
↵
1. Fons P,
2. Chabot S,
3. Cartwright JE, et al
. Soluble HLA-G1 inhibits angiogenesis through an apoptotic pathway and by direct binding to CD160 receptor expressed by endothelial cells. Blood 2006;108:2608–15.doi:10.1182/blood-2005-12-019919pmid:http://www.ncbi.nlm.nih.gov/pubmed/16809620
OpenUrl Abstract/FREE Full Text
↵
1. Cai G,
2. Anumanthan A,
3. Brown JA, et al
. CD160 inhibits activation of human CD4+ T cells through interaction with herpesvirus entry mediator. Nat Immunol 2008;9:176–85.doi:10.1038/ni1554pmid:http://www.ncbi.nlm.nih.gov/pubmed/18193050
OpenUrl CrossRef PubMed Web of Science
↵
1. Shui J-W,
2. Larange A,
3. Kim G, et al
. HVEM signalling at mucosal barriers provides host defence against pathogenic bacteria. Nature 2012;488:222–5.doi:10.1038/nature11242pmid:http://www.ncbi.nlm.nih.gov/pubmed/22801499
OpenUrl CrossRef PubMed
↵
1. Kelsen JR,
2. Dawany N,
3. Moran CJ, et al
. Exome sequencing analysis reveals variants in primary immunodeficiency genes in patients with very early onset inflammatory bowel disease. Gastroenterology 2015;149:1415–24.doi:10.1053/j.gastro.2015.07.006pmid:http://www.ncbi.nlm.nih.gov/pubmed/26193622
OpenUrl CrossRef PubMed
↵
1. An R,
2. Wilms E,
3. Masclee AAM, et al
. Age-Dependent changes in Gi physiology and microbiota: time to reconsider? Gut 2018;67:2213–22.doi:10.1136/gutjnl-2017-315542pmid:http://www.ncbi.nlm.nih.gov/pubmed/30194220
OpenUrl Abstract/FREE Full Text

Footnotes

SH, AVV, RG and VC are joint first authors.
AZ, AK and RKW are joint senior authors.
Contributors Study supervision: RKW and AK. Analysis and drafting: SH, AVV, RG and VC. Data support: CS, MR, RX, MJD, JMF, IW and MET. Critical revision: RKW, AK, AZ, JF, CW, FI, EAF, HMvD, GD, MCV and LB. Shared last authors: AZ, AK and RKW.
Funding MR is supported by a National Institute of Health Center for Multi- and Trans-Ethnic Mapping of Mendelian and Complex Diseases grant (5U01 HG009080) and by the National Human Genome Research Institute of the National Institutes of Health (NIH) under award R01HG010140. CW is supported by a European Research Council (ERC) Advanced grant (FP/2007-2013/ERC grant 2012-322698), a Netherlands Organization for Scientific Research (NWO) Spinoza prize grant (NWO SPI 92-266) and the Gravitation Netherlands Organ-on-Chip Initiative (024.003.001). JF is supported by grants from NWO (NWO-VIDI 864.13.013) and CardioVasculair Onderzoek Nederland (CVON 2018-27). AZ is supported by an NWO Vidi grant (NWO-VIDI 016.178.056), an ERC Starting Grant (715772), CVON 2018-27 and a Rosalind Franklin Fellowship from the University of Groningen. Copy number variant analyses were supported by NIH MH115957 to MET.
Disclaimer The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Competing interests None declared.
Patient and public involvement Patients and/or the public were involved in the design, or conduct, or reporting, or dissemination plans of this research. Refer to the Methods section for further details.
Patient consent for publication Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data are available upon reasonable request. The data for the LifeLines DEEP cohort available upon request from the European Genome-Phenome Archive (EGA; https://www.ebi.ac.uk/ega/) at accession number EGAS00001001704. The data for the Groningen IBD cohort can be requested with the accession number EGAS00001002702.

[1] ↵
Ng SC,
Shi HY,
Hamidi N, et al
. Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies. Lancet 2018;390:2769–78.doi:10.1016/S0140-6736(17)32448-0pmid:http://www.ncbi.nlm.nih.gov/pubmed/29050646
OpenUrl CrossRef PubMed

[2] Ng SC,

[3] Shi HY,

[4] Hamidi N, et al

[5] ↵
de Lange KM,
Moutsianas L,
Lee JC, et al
. Genome-Wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease. Nat Genet 2017;49:256–61.doi:10.1038/ng.3760pmid:http://www.ncbi.nlm.nih.gov/pubmed/28067908
OpenUrl CrossRef PubMed

[6] de Lange KM,

[7] Moutsianas L,

[8] Lee JC, et al

[9] ↵
Vich Vila A,
Imhann F,
Collij V, et al
. Gut microbiota composition and functional changes in inflammatory bowel disease and irritable bowel syndrome. Sci Transl Med 2018;10. doi:doi:10.1126/scitranslmed.aap8914. [Epub ahead of print: 19 Dec 2018].pmid:http://www.ncbi.nlm.nih.gov/pubmed/30567928
OpenUrl PubMed

[10] Vich Vila A,

[11] Imhann F,

[12] Collij V, et al

[13] ↵
Franzosa EA,
Sirota-Madi A,
Avila-Pacheco J, et al
. Gut microbiome structure and metabolic activity in inflammatory bowel disease. Nat Microbiol 2019;4:293–305.doi:10.1038/s41564-018-0306-4pmid:http://www.ncbi.nlm.nih.gov/pubmed/30531976
OpenUrl CrossRef PubMed

[14] Franzosa EA,

[15] Sirota-Madi A,

[16] Avila-Pacheco J, et al

[17] ↵
Schirmer M,
Garner A,
Vlamakis H, et al
. Microbial genes and pathways in inflammatory bowel disease. Nat Rev Microbiol 2019;17:497–511.doi:10.1038/s41579-019-0213-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/31249397
OpenUrl CrossRef PubMed

[18] Schirmer M,

[19] Garner A,

[20] Vlamakis H, et al

[21] ↵
Knights D,
Lassen KG,
Xavier RJ
. Advances in inflammatory bowel disease pathogenesis: linking host genetics and the microbiome. Gut 2013;62:1505–10.doi:10.1136/gutjnl-2012-303954pmid:http://www.ncbi.nlm.nih.gov/pubmed/24037875
OpenUrl Abstract/FREE Full Text

[22] Knights D,

[23] Lassen KG,

[24] Xavier RJ

[25] ↵
Turpin W,
Goethel A,
Bedrani L, et al
. Determinants of IBD heritability: genes, bugs, and more. Inflamm Bowel Dis 2018;24:1133–48.doi:10.1093/ibd/izy085pmid:http://www.ncbi.nlm.nih.gov/pubmed/29701818
OpenUrl CrossRef PubMed

[26] Turpin W,

[27] Goethel A,

[28] Bedrani L, et al

[29] ↵
Hall AB,
Tolonen AC,
Xavier RJ
. Human genetic variation and the gut microbiome in disease. Nat Rev Genet 2017;18:690–9.doi:10.1038/nrg.2017.63pmid:http://www.ncbi.nlm.nih.gov/pubmed/28824167
OpenUrl CrossRef PubMed

[30] Hall AB,

[31] Tolonen AC,

[32] Xavier RJ

[33] ↵
Cohen LJ,
Cho JH,
Gevers D, et al
. Genetic factors and the intestinal microbiome guide development of Microbe-Based therapies for inflammatory bowel diseases. Gastroenterology 2019;156:2174–89.doi:10.1053/j.gastro.2019.03.017pmid:http://www.ncbi.nlm.nih.gov/pubmed/30880022
OpenUrl CrossRef PubMed

[34] Cohen LJ,

[35] Cho JH,

[36] Gevers D, et al

[37] ↵
Kobayashi KS,
Chamaillard M,
Ogura Y, et al
. Nod2-Dependent regulation of innate and adaptive immunity in the intestinal tract. Science 2005;307:731–4.doi:10.1126/science.1104911pmid:http://www.ncbi.nlm.nih.gov/pubmed/15692051
OpenUrl Abstract/FREE Full Text

[38] Kobayashi KS,

[39] Chamaillard M,

[40] Ogura Y, et al

[41] ↵
Mondot S,
Barreau F,
Al Nabhani Z, et al
. Altered gut microbiota composition in immune-impaired Nod2(-/-) mice. Gut 2012;61:634–5.doi:10.1136/gutjnl-2011-300478pmid:http://www.ncbi.nlm.nih.gov/pubmed/21868489
OpenUrl FREE Full Text

[42] Mondot S,

[43] Barreau F,

[44] Al Nabhani Z, et al

[45] ↵
Rehman A,
Sina C,
Gavrilova O, et al
. Nod2 is essential for temporal development of intestinal microbial communities. Gut 2011;60:1354–62.doi:10.1136/gut.2010.216259pmid:http://www.ncbi.nlm.nih.gov/pubmed/21421666
OpenUrl Abstract/FREE Full Text

[46] Rehman A,

[47] Sina C,

[48] Gavrilova O, et al

[49] ↵
Butera A,
Di Paola M,
Pavarini L, et al
. Nod2 deficiency in mice is associated with microbiota variation favouring the expansion of mucosal CD4+ LAP+ regulatory cells. Sci Rep 2018;8:14241. doi:10.1038/s41598-018-32583-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/30250234
OpenUrl CrossRef PubMed

[50] Butera A,

[51] Di Paola M,

[52] Pavarini L, et al

[53] ↵
Sadaghian Sadabad M,
Regeling A,
de Goffau MC, et al
. The ATG16L1-T300A allele impairs clearance of pathosymbionts in the inflamed ileal mucosa of Crohn's disease patients. Gut 2015;64:1546–52.doi:10.1136/gutjnl-2014-307289pmid:http://www.ncbi.nlm.nih.gov/pubmed/25253126
OpenUrl Abstract/FREE Full Text

[54] Sadaghian Sadabad M,

[55] Regeling A,

[56] de Goffau MC, et al

[57] ↵
Bonder MJ,
Kurilshikov A,
Tigchelaar EF, et al
. The effect of host genetics on the gut microbiome. Nat Genet 2016;48:1407–12.doi:10.1038/ng.3663pmid:http://www.ncbi.nlm.nih.gov/pubmed/27694959
OpenUrl CrossRef PubMed

[58] Bonder MJ,

[59] Kurilshikov A,

[60] Tigchelaar EF, et al

[61] ↵
Wang J,
Thingholm LB,
Skiecevičienė J, et al
. Genome-Wide association analysis identifies variation in vitamin D receptor and other host factors influencing the gut microbiota. Nat Genet 2016;48:1396–406.doi:10.1038/ng.3695pmid:http://www.ncbi.nlm.nih.gov/pubmed/27723756
OpenUrl CrossRef PubMed

[62] Wang J,

[63] Thingholm LB,

[64] Skiecevičienė J, et al

[65] ↵
Aschard H,
Laville V,
Tchetgen ET, et al
. Genetic effects on the commensal microbiota in inflammatory bowel disease patients. PLoS Genet 2019;15:e1008018. doi:10.1371/journal.pgen.1008018pmid:http://www.ncbi.nlm.nih.gov/pubmed/30849075
OpenUrl CrossRef PubMed

[66] Aschard H,

[67] Laville V,

[68] Tchetgen ET, et al

[69] ↵
Knights D,
Silverberg MS,
Weersma RK, et al
. Complex host genetics influence the microbiome in inflammatory bowel disease. Genome Med 2014;6:107. doi:10.1186/s13073-014-0107-1pmid:http://www.ncbi.nlm.nih.gov/pubmed/25587358
OpenUrl CrossRef PubMed

[70] Knights D,

[71] Silverberg MS,

[72] Weersma RK, et al

[73] ↵
Kurilshikov A,
Wijmenga C,
Fu J, et al
. Host genetics and gut microbiome: challenges and perspectives. Trends Immunol 2017;38:633–47.doi:10.1016/j.it.2017.06.003pmid:http://www.ncbi.nlm.nih.gov/pubmed/28669638
OpenUrl CrossRef PubMed

[74] Kurilshikov A,

[75] Wijmenga C,

[76] Fu J, et al

[77] ↵
Rothschild D,
Weissbrod O,
Barkan E, et al
. Environment dominates over host genetics in shaping human gut microbiota. Nature 2018;555:210–5.doi:10.1038/nature25973pmid:http://www.ncbi.nlm.nih.gov/pubmed/29489753
OpenUrl CrossRef PubMed

[78] Rothschild D,

[79] Weissbrod O,

[80] Barkan E, et al

[81] ↵
McKenna A,
Hanna M,
Banks E, et al
. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010;20:1297–303.doi:10.1101/gr.107524.110pmid:http://www.ncbi.nlm.nih.gov/pubmed/20644199
OpenUrl Abstract/FREE Full Text

[82] McKenna A,

[83] Hanna M,

[84] Banks E, et al

[85] ↵
Anderson CA,
Pettersson FH,
Clarke GM, et al
. Data quality control in genetic case-control association studies. Nat Protoc 2010;5:1564–73.doi:10.1038/nprot.2010.116pmid:http://www.ncbi.nlm.nih.gov/pubmed/21085122
OpenUrl CrossRef PubMed Web of Science

[86] Anderson CA,

[87] Pettersson FH,

[88] Clarke GM, et al

[89] ↵
Babadi M,
Lee S,
Smirnov A, et al
. Precise common and rare germline CNV calling with GATK 2018.

[90] Babadi M,

[91] Lee S,

[92] Smirnov A, et al

[93] ↵
Segata N,
Waldron L,
Ballarini A, et al
. Metagenomic microbial community profiling using unique clade-specific marker genes. Nat Methods 2012;9:811–4.doi:10.1038/nmeth.2066pmid:http://www.ncbi.nlm.nih.gov/pubmed/22688413
OpenUrl CrossRef PubMed Web of Science

[94] Segata N,

[95] Waldron L,

[96] Ballarini A, et al

[97] ↵
Franzosa EA,
McIver LJ,
Rahnavard G, et al
. Species-level functional profiling of metagenomes and metatranscriptomes. Nat Methods 2018;15:962–8.doi:10.1038/s41592-018-0176-ypmid:http://www.ncbi.nlm.nih.gov/pubmed/30377376
OpenUrl CrossRef PubMed

[98] Franzosa EA,

[99] McIver LJ,

[100] Rahnavard G, et al

[101] ↵
Turpin W,
Espin-Garcia O,
Xu W, et al
. Association of host genome with intestinal microbial composition in a large healthy cohort. Nat Genet 2016;48:1413–7.doi:10.1038/ng.3693pmid:http://www.ncbi.nlm.nih.gov/pubmed/27694960
OpenUrl CrossRef PubMed

[102] Turpin W,

[103] Espin-Garcia O,

[104] Xu W, et al

[105] ↵
Lloyd-Price J,
Arze C,
Ananthakrishnan AN, et al
. Multi-Omics of the gut microbial ecosystem in inflammatory bowel diseases. Nature 2019;569:655–62.doi:10.1038/s41586-019-1237-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/31142855
OpenUrl CrossRef PubMed

[106] Lloyd-Price J,

[107] Arze C,

[108] Ananthakrishnan AN, et al

[109] ↵
Falony G,
Joossens M,
Vieira-Silva S, et al
. Population-Level analysis of gut microbiome variation. Science 2016;352:560–4.doi:10.1126/science.aad3503pmid:http://www.ncbi.nlm.nih.gov/pubmed/27126039
OpenUrl Abstract/FREE Full Text

[110] Falony G,

[111] Joossens M,

[112] Vieira-Silva S, et al

[113] ↵
Zhernakova A,
Kurilshikov A,
Bonder MJ, et al
. Population-Based metagenomics analysis reveals markers for gut microbiome composition and diversity. Science 2016;352:565–9.doi:10.1126/science.aad3369pmid:http://www.ncbi.nlm.nih.gov/pubmed/27126040
OpenUrl Abstract/FREE Full Text

[114] Zhernakova A,

[115] Kurilshikov A,

[116] Bonder MJ, et al

[117] ↵
Imhann F,
Vich Vila A,
Bonder MJ, et al
. Interplay of host genetics and gut microbiota underlying the onset and clinical presentation of inflammatory bowel disease. Gut 2018;67:108–19.doi:10.1136/gutjnl-2016-312135pmid:http://www.ncbi.nlm.nih.gov/pubmed/27802154
OpenUrl Abstract/FREE Full Text

[118] Imhann F,

[119] Vich Vila A,

[120] Bonder MJ, et al

[121] ↵
Rivas MA,
Pirinen M,
Conrad DF, et al
. Human genomics. Effect of predicted protein-truncating genetic variants on the human transcriptome. Science 2015;348:666–9.doi:10.1126/science.1261877pmid:http://www.ncbi.nlm.nih.gov/pubmed/25954003
OpenUrl Abstract/FREE Full Text

[122] Rivas MA,

[123] Pirinen M,

[124] Conrad DF, et al

[125] ↵
Lee S,
Teslovich TM,
Boehnke M, et al
. General framework for meta-analysis of rare variants in sequencing association studies. Am J Hum Genet 2013;93:42–53.doi:10.1016/j.ajhg.2013.05.010pmid:http://www.ncbi.nlm.nih.gov/pubmed/23768515
OpenUrl CrossRef PubMed

[126] Lee S,

[127] Teslovich TM,

[128] Boehnke M, et al

[129] ↵
Purcell SM,
Moran JL,
Fromer M, et al
. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 2014;506:185–90.doi:10.1038/nature12975pmid:http://www.ncbi.nlm.nih.gov/pubmed/24463508
OpenUrl CrossRef PubMed Web of Science

[130] Purcell SM,

[131] Moran JL,

[132] Fromer M, et al

[133] ↵
Frenkel S,
Bernstein CN,
Sargent M, et al
. Genome-Wide analysis identifies rare copy number variations associated with inflammatory bowel disease. PLoS One 2019;14:e0217846. doi:10.1371/journal.pone.0217846pmid:http://www.ncbi.nlm.nih.gov/pubmed/31185018
OpenUrl CrossRef PubMed

[134] Frenkel S,

[135] Bernstein CN,

[136] Sargent M, et al

[137] ↵
Peters JE,
Lyons PA,
Lee JC, et al
. Insight into genotype-phenotype associations through eQTL mapping in multiple cell types in health and immune-mediated disease. PLoS Genet 2016;12:e1005908. doi:10.1371/journal.pgen.1005908pmid:http://www.ncbi.nlm.nih.gov/pubmed/27015630
OpenUrl CrossRef PubMed

[138] Peters JE,

[139] Lyons PA,

[140] Lee JC, et al

[141] ↵
GTEx Consortium
. Human genomics. The Genotype-Tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 2015;348:648–60.doi:10.1126/science.1262110pmid:http://www.ncbi.nlm.nih.gov/pubmed/25954001
OpenUrl Abstract/FREE Full Text

[142] GTEx Consortium

[143] ↵
Kuleshov MV,
Jones MR,
Rouillard AD, et al
. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 2016;44:W90–7.doi:10.1093/nar/gkw377pmid:http://www.ncbi.nlm.nih.gov/pubmed/27141961
OpenUrl CrossRef PubMed

[144] Kuleshov MV,

[145] Jones MR,

[146] Rouillard AD, et al

[147] ↵
Watanabe K,
Taskesen E,
van Bochoven A, et al
. Functional mapping and annotation of genetic associations with FUMA. Nat Commun 2017;8:1826. doi:10.1038/s41467-017-01261-5pmid:http://www.ncbi.nlm.nih.gov/pubmed/29184056
OpenUrl CrossRef PubMed

[148] Watanabe K,

[149] Taskesen E,

[150] van Bochoven A, et al

[151] ↵
Franke A,
Balschun T,
Sina C, et al
. Genome-Wide association study for ulcerative colitis identifies risk loci at 7q22 and 22q13 (IL17REL). Nat Genet 2010;42:292–4.doi:10.1038/ng.553pmid:http://www.ncbi.nlm.nih.gov/pubmed/20228798
OpenUrl CrossRef PubMed Web of Science

[152] Franke A,

[153] Balschun T,

[154] Sina C, et al

[155] ↵
Prescott NJ,
Lehne B,
Stone K, et al
. Pooled sequencing of 531 genes in inflammatory bowel disease identifies an associated rare variant in BTNL2 and implicates other immune related genes. PLoS Genet 2015;11:e1004955–19.doi:10.1371/journal.pgen.1004955pmid:http://www.ncbi.nlm.nih.gov/pubmed/25671699
OpenUrl CrossRef PubMed

[156] Prescott NJ,

[157] Lehne B,

[158] Stone K, et al

[159] ↵
Goodrich JK,
Davenport ER,
Beaumont M, et al
. Genetic determinants of the gut microbiome in UK twins. Cell Host Microbe 2016;19:731–43.doi:10.1016/j.chom.2016.04.017pmid:http://www.ncbi.nlm.nih.gov/pubmed/27173935
OpenUrl CrossRef PubMed

[160] Goodrich JK,

[161] Davenport ER,

[162] Beaumont M, et al

[163] ↵
Kolde R,
Franzosa EA,
Rahnavard G, et al
. Host genetic variation and its microbiome interactions within the human microbiome project. Genome Med 2018;10:6. doi:10.1186/s13073-018-0515-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/29378630
OpenUrl CrossRef PubMed

[164] Kolde R,

[165] Franzosa EA,

[166] Rahnavard G, et al

[167] ↵
Liu JZ,
van Sommeren S,
Huang H, et al
. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat Genet 2015;47:979–86.doi:10.1038/ng.3359pmid:http://www.ncbi.nlm.nih.gov/pubmed/26192919
OpenUrl CrossRef PubMed

[168] Liu JZ,

[169] van Sommeren S,

[170] Huang H, et al

[171] ↵
Koh A,
De Vadder F,
Kovatcheva-Datchary P, et al
. From dietary fiber to host physiology: short-chain fatty acids as key bacterial metabolites. Cell 2016;165:1332–45.doi:10.1016/j.cell.2016.05.041pmid:http://www.ncbi.nlm.nih.gov/pubmed/27259147
OpenUrl CrossRef PubMed

[172] Koh A,

[173] De Vadder F,

[174] Kovatcheva-Datchary P, et al

[175] ↵
Ríos-Covián D,
Ruas-Madiedo P,
Margolles A, et al
. Intestinal short chain fatty acids and their link with diet and human health. Front Microbiol 2016;7:185. doi:10.3389/fmicb.2016.00185pmid:http://www.ncbi.nlm.nih.gov/pubmed/26925050
OpenUrl CrossRef PubMed

[176] Ríos-Covián D,

[177] Ruas-Madiedo P,

[178] Margolles A, et al

[179] ↵
Carethers JM,
Jung BH
. Genetics and genetic biomarkers in sporadic colorectal cancer. Gastroenterology 2015;149:1177–90.doi:10.1053/j.gastro.2015.06.047pmid:http://www.ncbi.nlm.nih.gov/pubmed/26216840
OpenUrl CrossRef PubMed

[180] Carethers JM,

[181] Jung BH

[182] ↵
Costea I,
Mack DR,
Lemaitre RN, et al
. Interactions between the dietary polyunsaturated fatty acid ratio and genetic factors determine susceptibility to pediatric Crohn's disease. Gastroenterology 2014;146:929–31.doi:10.1053/j.gastro.2013.12.034pmid:http://www.ncbi.nlm.nih.gov/pubmed/24406470
OpenUrl CrossRef PubMed

[183] Costea I,

[184] Mack DR,

[185] Lemaitre RN, et al

[186] ↵
Illig T,
Gieger C,
Zhai G, et al
. A genome-wide perspective of genetic variation in human metabolism. Nat Genet 2010;42:137–41.doi:10.1038/ng.507pmid:http://www.ncbi.nlm.nih.gov/pubmed/20037589
OpenUrl CrossRef PubMed Web of Science

[187] Illig T,

[188] Gieger C,

[189] Zhai G, et al

[190] ↵
Marion-Letellier R,
Savoye G,
Beck PL, et al
. Polyunsaturated fatty acids in inflammatory bowel diseases: a reappraisal of effects and therapeutic approaches. Inflamm Bowel Dis 2013;19:650–61.doi:10.1097/MIB.0b013e3182810122pmid:http://www.ncbi.nlm.nih.gov/pubmed/23328774
OpenUrl CrossRef PubMed

[191] Marion-Letellier R,

[192] Savoye G,

[193] Beck PL, et al

[194] ↵
Sun L,
Youn HD,
Loh C, et al
. Cabin 1, a negative regulator for calcineurin signaling in T lymphocytes. Immunity 1998;8:703–11.doi:10.1016/S1074-7613(00)80575-0pmid:http://www.ncbi.nlm.nih.gov/pubmed/9655484
OpenUrl CrossRef PubMed Web of Science

[195] Sun L,

[196] Youn HD,

[197] Loh C, et al

[198] ↵
Gulick AM,
Hubbard BK,
Gerlt JA, et al
. Evolution of enzymatic activities in the enolase superfamily: identification of the general acid catalyst in the active site of D-glucarate dehydratase from Escherichia coli. Biochemistry 2001;40:10054–62.doi:10.1021/bi010733bpmid:http://www.ncbi.nlm.nih.gov/pubmed/11513584
OpenUrl CrossRef PubMed Web of Science

[199] Gulick AM,

[200] Hubbard BK,

[201] Gerlt JA, et al

[202] ↵
Lewis JD,
Chen EZ,
Baldassano RN, et al
. Inflammation, antibiotics, and diet as environmental stressors of the gut microbiome in pediatric Crohn's disease. Cell Host Microbe 2015;18:489–500.doi:10.1016/j.chom.2015.09.008pmid:http://www.ncbi.nlm.nih.gov/pubmed/26468751
OpenUrl CrossRef PubMed

[203] Lewis JD,

[204] Chen EZ,

[205] Baldassano RN, et al

[206] ↵
Wang T,
Cai G,
Qiu Y, et al
. Structural segregation of gut microbiota between colorectal cancer patients and healthy volunteers. Isme J 2012;6:320–9.doi:10.1038/ismej.2011.109pmid:http://www.ncbi.nlm.nih.gov/pubmed/21850056
OpenUrl CrossRef PubMed Web of Science

[207] Wang T,

[208] Cai G,

[209] Qiu Y, et al

[210] ↵
Ridaura VK,
Faith JJ,
Rey FE, et al
. Gut microbiota from twins discordant for obesity modulate metabolism in mice. Science 2013;341:1241214. doi:10.1126/science.1241214pmid:http://www.ncbi.nlm.nih.gov/pubmed/24009397
OpenUrl Abstract/FREE Full Text

[211] Ridaura VK,

[212] Faith JJ,

[213] Rey FE, et al

[214] ↵
Schirmer M,
Smeekens SP,
Vlamakis H, et al
. Linking the human gut microbiome to inflammatory cytokine production capacity. Cell 2016;167:1125–36.doi:10.1016/j.cell.2016.10.020pmid:http://www.ncbi.nlm.nih.gov/pubmed/27814509
OpenUrl CrossRef PubMed

[215] Schirmer M,

[216] Smeekens SP,

[217] Vlamakis H, et al

[218] ↵
Zhernakova A,
Festen EM,
Franke L, et al
. Genetic analysis of innate immunity in Crohn's disease and ulcerative colitis identifies two susceptibility loci harboring CARD9 and IL18RAP. Am J Hum Genet 2008;82:1202–10.doi:10.1016/j.ajhg.2008.03.016pmid:http://www.ncbi.nlm.nih.gov/pubmed/18439550
OpenUrl CrossRef PubMed Web of Science

[219] Zhernakova A,

[220] Festen EM,

[221] Franke L, et al

[222] ↵
Schirmer M, et al
. Inflammatory bowel disease gut microbiome. Nat. Microbiol 2017.

[223] Schirmer M, et al

[224] ↵
Peters LA,
Perrigoue J,
Mortha A, et al
. A functional genomics predictive network model identifies regulators of inflammatory bowel disease. Nat Genet 2017;49:1437–49.doi:10.1038/ng.3947pmid:http://www.ncbi.nlm.nih.gov/pubmed/28892060
OpenUrl CrossRef PubMed

[225] Peters LA,

[226] Perrigoue J,

[227] Mortha A, et al

[228] ↵
Cho H,
Kehrl JH
. Regulation of immune function by G protein-coupled receptors, trimeric G proteins, and RGS proteins. Prog Mol Biol Transl Sci 2009;86:249–98.doi:10.1016/S1877-1173(09)86009-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/20374719
OpenUrl CrossRef PubMed

[229] Cho H,

[230] Kehrl JH

[231] ↵
Gräler MH,
Goetzl EJ
. Lysophospholipids and their G protein-coupled receptors in inflammation and immunity. Biochim Biophys Acta 2002;1582:168–74.doi:10.1016/S1388-1981(02)00152-Xpmid:http://www.ncbi.nlm.nih.gov/pubmed/12069825
OpenUrl CrossRef PubMed Web of Science

[232] Gräler MH,

[233] Goetzl EJ

[234] ↵
Emdin CA,
Khera AV,
Chaffin M, et al
. Analysis of predicted loss-of-function variants in UK Biobank identifies variants protective for disease. Nat Commun 2018;9:1613. doi:10.1038/s41467-018-03911-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/29691411
OpenUrl CrossRef PubMed

[235] Emdin CA,

[236] Khera AV,

[237] Chaffin M, et al

[238] ↵
Fons P,
Chabot S,
Cartwright JE, et al
. Soluble HLA-G1 inhibits angiogenesis through an apoptotic pathway and by direct binding to CD160 receptor expressed by endothelial cells. Blood 2006;108:2608–15.doi:10.1182/blood-2005-12-019919pmid:http://www.ncbi.nlm.nih.gov/pubmed/16809620
OpenUrl Abstract/FREE Full Text

[239] Fons P,

[240] Chabot S,

[241] Cartwright JE, et al

[242] ↵
Cai G,
Anumanthan A,
Brown JA, et al
. CD160 inhibits activation of human CD4+ T cells through interaction with herpesvirus entry mediator. Nat Immunol 2008;9:176–85.doi:10.1038/ni1554pmid:http://www.ncbi.nlm.nih.gov/pubmed/18193050
OpenUrl CrossRef PubMed Web of Science

[243] Cai G,

[244] Anumanthan A,

[245] Brown JA, et al

[246] ↵
Shui J-W,
Larange A,
Kim G, et al
. HVEM signalling at mucosal barriers provides host defence against pathogenic bacteria. Nature 2012;488:222–5.doi:10.1038/nature11242pmid:http://www.ncbi.nlm.nih.gov/pubmed/22801499
OpenUrl CrossRef PubMed

[247] Shui J-W,

[248] Larange A,

[249] Kim G, et al

[250] ↵
Kelsen JR,
Dawany N,
Moran CJ, et al
. Exome sequencing analysis reveals variants in primary immunodeficiency genes in patients with very early onset inflammatory bowel disease. Gastroenterology 2015;149:1415–24.doi:10.1053/j.gastro.2015.07.006pmid:http://www.ncbi.nlm.nih.gov/pubmed/26193622
OpenUrl CrossRef PubMed

[251] Kelsen JR,

[252] Dawany N,

[253] Moran CJ, et al

[254] ↵
An R,
Wilms E,
Masclee AAM, et al
. Age-Dependent changes in Gi physiology and microbiota: time to reconsider? Gut 2018;67:2213–22.doi:10.1136/gutjnl-2017-315542pmid:http://www.ncbi.nlm.nih.gov/pubmed/30194220
OpenUrl Abstract/FREE Full Text

[255] An R,

[256] Wilms E,

[257] Masclee AAM, et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

other Versions

Abstract

Statistics from Altmetric.com

Request Permissions

Significance of this study

What is already known about this subject?

What are the new findings?

How might it impact on clinical practice in the foreseeable future?

Introduction

Methods

Study cohorts

Supplemental material

WES and data processing

Metagenomic sequencing and data processing

Supplemental material

Host genetics and gut microbiota differences between cohorts

IBD genetic signature

IBD-associated gut microbial taxa and pathways

mbQTL analyses

Step 1: whole-exome-wide association meta-analyses

Step 2: meta-analyses of selected variants

Step 3: gene-based burden test meta-analyses

Step 4: assessing disease effect in the host–microbiota correlations

Annotation of genetic variants

Results

Cohort description

Differences on host genetics and gut microbiota between cases and controls

Whole-exome-wide analysis reveals mbQTLs in immune-related genes

Supplemental material

Targeted analysis identifies mbQTLs in IBD-associated genes

Supplemental material

Gene-based burden test highlights rare mutation mbQTLs

Supplemental material

Interaction analyses identifies IBD-specific mbQTLs

Supplemental material

Supplemental material

Discussion

Acknowledgments

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password