Identifying remote protein homologs by network propagation

Authors


W. S. Noble, Department of Genome Sciences Department of Computer Science and Engineering University of Washington Seattle, WA, USA
Fax: +1 206 685 7301 Tel: +1 206 543 8930
E-mail: noble@gs.washington.edu

Abstract

Perhaps the most widely used applications of bioinformatics are tools such as psi-blast for searching sequence databases. We describe a recently developed protein database search algorithm called rankprop. rankprop relies upon a precomputed network of pairwise protein similarities. The algorithm performs a diffusion operation from a specified query protein across the protein similarity network. The resulting activation scores, assigned to each database protein, encode information about the global structure of the protein similarity network. This type of algorithm has a rich history in associationist psychology, artificial intelligence and web search. We describe the rankprop algorithm and its relatives, and we provide evidence that the algorithm successfully improves upon the rankings produced by psi-blast.

Ancillary