Standard Article

Eukaryotic gene finding

Part 4. Bioinformatics

4.2. Gene Finding and Gene Structure

Introductory Review

  1. Roderic Guigó

Published Online: 15 NOV 2005

DOI: 10.1002/047001153X.g402107

Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics

Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics

How to Cite

Guigó, R. 2005. Eukaryotic gene finding. Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics. 4:4.2:14.

Author Information

  1. Institut Municipal d'Investigació Mèdica, Universitat Pompeu Fabra, and Centre de Regulació Genòmica, Barcelona, Spain

Publication History

  1. Published Online: 15 NOV 2005

Abstract

After the genome of an organism is sequenced and assembled, the first necessary step toward the understanding of its functional content is to locate all protein-coding genes. Identification of genes is difficult in the eukaryotic genomes, because of the split nature of eukaryotic genes and because of the large intergenic spacers between adjacent genes. In this article, we will review how computational gene-prediction programs address this difficulty, describing the basic components underlying most computational methods, and the strategies employed to integrate them.

Keywords:

  • gene prediction;
  • coding statistics;
  • comparative genomics;
  • sequence signals;
  • splice sites;
  • exon;
  • gene;
  • dynamic programming;
  • Markov models