Unit

UNIT 4.12 Protein Function Prediction: Problems and Pitfalls

  1. William R. Pearson

Published Online: 3 SEP 2015

DOI: 10.1002/0471250953.bi0412s51

Current Protocols in Bioinformatics

Current Protocols in Bioinformatics

How to Cite

Pearson, W.R. 2015. Protein function prediction: problems and pitfalls. Curr. Protoc. Bioinform. 51:4.12.1-4.12.8. doi: 10.1002/0471250953.bi0412s51

Author Information

  1. University of Virginia School of Medicine, Charlottesville, Virginia

Publication History

  1. Published Online: 3 SEP 2015

Abstract

The characterization of new genomes based on their protein sets has been revolutionized by new sequencing technologies, but biologists seeking to exploit new sequence information are often frustrated by the challenges associated with accurately assigning biological functions to newly identified proteins. Here, we highlight some of the challenges in functional inference from sequence similarity. Investigators can improve the accuracy of function prediction by (1) being conservative about the evolutionary distance to a protein of known function; (2) considering the ambiguous meaning of “functional similarity,” and (3) being aware of the limitations of annotations in functional databases. Protein function prediction does not offer “one-size-fits-all” solutions. Prediction strategies work better when the idiosyncrasies of function and functional annotation are better understood. © 2015 by John Wiley & Sons, Inc.

Keywords:

  • homology;
  • orthology;
  • paralogy;
  • function prediction;
  • gene ontology;
  • EC numbers