Get access

Adaptive Smith–Waterman residue match seeding for protein structural alignment

Authors

  • Christopher M. Topham,

    Corresponding author
    1. Université de Toulouse, INSA, UPS, INP, LISBP, Toulouse, France
    2. CNRS, UMR5504, Toulouse, France
    3. INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés, Toulouse, France
    • Correspondence to: Christopher M. Topham; Université de Toulouse; INSA, INSA, UPS, INP, 135 Avenue de Rangueil, F-31077 Toulouse, France. E-mail: christopher.topham@insa-toulouse.fr or Isabelle André; Université de Toulouse; INSA,UPS,INP; LISBP, 135 Avenue de Rangueil, F-31077 Toulouse, France. E-mail: isabelle.andre@insa-toulouse.fr

    Search for more papers by this author
  • Mickaël Rouquier,

    1. Université de Toulouse, INSA, UPS, INP, LISBP, Toulouse, France
    2. CNRS, UMR5504, Toulouse, France
    3. INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés, Toulouse, France
    Search for more papers by this author
  • Nathalie Tarrat,

    1. Université de Toulouse, INSA, UPS, INP, LISBP, Toulouse, France
    2. CNRS, UMR5504, Toulouse, France
    3. INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés, Toulouse, France
    Search for more papers by this author
  • Isabelle André

    Corresponding author
    1. Université de Toulouse, INSA, UPS, INP, LISBP, Toulouse, France
    2. CNRS, UMR5504, Toulouse, France
    3. INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés, Toulouse, France
    • Correspondence to: Christopher M. Topham; Université de Toulouse; INSA, INSA, UPS, INP, 135 Avenue de Rangueil, F-31077 Toulouse, France. E-mail: christopher.topham@insa-toulouse.fr or Isabelle André; Université de Toulouse; INSA,UPS,INP; LISBP, 135 Avenue de Rangueil, F-31077 Toulouse, France. E-mail: isabelle.andre@insa-toulouse.fr

    Search for more papers by this author

ABSTRACT

The POLYFIT rigid-body algorithm for automated global pairwise and multiple protein structural alignment is presented. Smith–Waterman local alignment is used to establish a set of seed equivalences that are extended using Needleman–Wunsch dynamic programming techniques. Structural and functional interaction constraints provided by evolution are encoded as one-dimensional residue physical environment strings for alignment of highly structurally overlapped protein pairs. Local structure alignment of more distantly related pairs is carried out using rigid-body conformational matching of 15-residue fragments, with allowance made for less stringent conformational matching of metal-ion and small molecule ligand-contact, disulphide bridge, and cis-peptide correspondences. Protein structural plasticity is accommodated through the stepped adjustment of a single empirical distance parameter value in the calculation of the Smith–Waterman dynamic programming matrix. Structural overlap is used both as a measure of similarity and to assess alignment quality. Pairwise alignment accuracy has been benchmarked against that of 10 widely used aligners on the Sippl and Wiederstein set of difficult pairwise structure alignment problems, and more extensively against that of Matt, SALIGN, and MUSTANG in pairwise and multiple structural alignments of protein domains with low shared sequence identity in the SCOP-ASTRAL 40% compendium. The results demonstrate the advantages of POLYFIT over other aligners in the efficient and robust identification of matching seed residue positions in distantly related protein targets and in the generation of longer structurally overlapped alignment lengths. Superposition-based application areas include comparative modeling and protein and ligand design. POLYFIT is available on the Web server at http://polyfit.insa-toulouse.fr. Proteins 2013; 81:1823–1839. © 2013 Wiley Periodicals, Inc.

Get access to the full text of this article

Ancillary