Quantifying the accessibility of the metagenome by random expression cloning techniques

Environ Microbiol. 2004 Sep;6(9):879-86. doi: 10.1111/j.1462-2920.2004.00640.x.

Abstract

The exploitation of the metagenome for novel biocatalysts by functional screening is determined by the ability to express the respective genes in a surrogate host. The probability of recovering a certain gene thereby depends on its abundance in the environmental DNA used for library construction, the chosen insert size, the length of the target gene, and the presence of expression signals that are functional in the host organism. In this paper, we present a set of formulas that describe the chance of isolating a gene by random expression cloning, taking into account the three different modes of heterologous gene expression: independent expression, expression as a transcriptional fusion and expression as a translational fusion. Genes of the last category are shown to be virtually inaccessible by shotgun cloning because of the low frequency of functional constructs. To evaluate which part of the metagenome might in this way evade exploitation, 32 complete genome sequences of prokaryotic organisms were analysed for the presence of expression signals functional in E. coli hosts, using bioinformatics tools. Our study reveals significant differences in the predicted expression modes between distinct taxonomic groups of organisms and suggests that about 40% of the enzymatic activities may be readily recovered by random cloning in E. coli.

MeSH terms

  • Bacteria / enzymology
  • Bacteria / genetics*
  • Cloning, Molecular / methods*
  • Computational Biology
  • Escherichia coli / metabolism
  • Gene Expression*
  • Gene Library
  • Genes / genetics*
  • Genomics / methods*
  • Species Specificity