Article Text

Download PDFPDF
IDDF2024-ABS-0427 Pacbio sequencing reveals the features and genomic function of gut viruses in colorectal cancer
  1. Yanqiang Ding1,
  2. Gang Guo2,
  3. Liuyang Zhao1,
  4. Gaixia Liu2,
  5. Chuanfa Liu1,
  6. Shu Fan2,
  7. Feiyu Shi2,
  8. Yufeng Lin1,
  9. Junjun She2,
  10. Jun Yu1
  1. 1Institute of Digestive Disease and Department of Medicine and Therapeutics, State Key Laboratory of Digestive Disease, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Hong Kong
  2. 2Center for Gut Microbiome Research, Med-X Institute, Department of High Talent, The First Affiliated Hospital of Xi’an Jiaotong University, China

Abstract

Background The association between an imbalanced gut virus community and colorectal cancer (CRC) is well-established, while the specific characteristics and genomic functions of these viruses remain poorly understood. This study aimed to investigate the lifestyle, gene function, single nucleotide polymorphisms (SNPs), structural variations (SVs), and phage-host interactions of gut viruses in CRC through the analysis of assembled viral genomes.

Methods Fecal samples were collected from CRC patients and healthy controls, and virus-like particles (VLPs) were isolated. PacBio sequencing platforms were used to sequence VLP DNA with a minimum length of 15 kb and at least 10 μg per sample. The gut virus genomes were assembled and annotated using Canu and EggNog. MetaPop and Sniffles2 were employed for SNP/SV analysis.

Results After assembling and filtering, more than 23,000 high-quality gut virus genomes were obtained. The prevalence of novel viruses was higher in CRC compared to the healthy control group. Notably, the proportion of phages, particularly temperate phages, was significantly higher in CRC than in the healthy control group (P < 0.001). KEGG pathway analysis of virus genes revealed enrichment in DNA repair and recombination, homologous recombination, and mismatch repair in CRC. Furthermore, 15 genes with differential SNPs accurately distinguished CRC from healthy controls with an AUC of 88.03% using a random forest model. Additionally, significant alterations were observed in the interaction between gut phages and host bacteria in CRC, particularly involving CRC-associated bacteria. Notably, the number of Bacteroides fragilis-hosted phage genomes and their relative abundance were significantly lower in CRC than in healthy controls (P < 0.05). Lastly, 15 viruses with differential abundances effectively discriminated CRC from healthy controls, with an AUC of 94.12% in the training cohort. These findings were validated in five global cohorts, with AUCs ranging from 71.94% to 81.72%.

Conclusions Multidimensional analysis based on PacBio sequencing revealed substantial changes in gut virus communities, lifestyle, gene function, SNP/SV, and phage-host interactions in CRC patients compared to healthy controls. Gut virus abundance and SNPs show promise as potential biomarkers for diagnosing CRC.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.