Promoter prediction analysis on the whole human genome

Nat Biotechnol. 2004 Nov;22(11):1467-73. doi: 10.1038/nbt1032.

Abstract

Promoter prediction programs (PPPs) are important for in silico gene discovery without support from expressed sequence tag (EST)/cDNA/mRNA sequences, in the analysis of gene regulation and in genome annotation. Contrary to previous expectations, a comprehensive analysis of PPPs reveals that no program simultaneously achieves sensitivity and a positive predictive value >65%. PPP performances deduced from a limited number of chromosomes or smaller data sets do not hold when evaluated at the level of the whole genome, with serious inaccuracy of predictions for non-CpG-island-related promoters. Some PPPs even perform worse than, or close to, pure random guessing.

Publication types

  • Comparative Study
  • Evaluation Study
  • Validation Study

MeSH terms

  • Algorithms*
  • Base Sequence
  • Chromosome Mapping / methods*
  • CpG Islands / genetics
  • Genome, Human*
  • Humans
  • Molecular Sequence Data
  • Promoter Regions, Genetic / genetics*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Sequence Analysis, DNA / methods*
  • Software Validation
  • Software*