Protein loops on structurally similar scaffolds: Database and conformational analysis
Article first published online: 17 MAR 1999
DOI: 10.1002/(SICI)1097-0282(199905)49:6<481::AID-BIP6>3.0.CO;2-V
Copyright © 1999 John Wiley & Sons, Inc.
Additional Information
How to Cite
Li, W., Liu, Z. and Lai, L. (1999), Protein loops on structurally similar scaffolds: Database and conformational analysis. Biopolymers, 49: 481–495. doi: 10.1002/(SICI)1097-0282(199905)49:6<481::AID-BIP6>3.0.CO;2-V
Publication History
- Issue published online: 17 MAR 1999
- Article first published online: 17 MAR 1999
- Manuscript Accepted: 27 OCT 1998
- Manuscript Received: 8 JUN 1998
Funded by
- Chinese National High Technology Development Program (CNHT). Grant Number: 863-103-13-03-05
- NSFC. Grant Number: 29703001
- Abstract
- References
- Cited By
Keywords:
- protein loop;
- data base;
- loop structure;
- loop conformation;
- loop modeling
Abstract
A general problem in comparative modeling and protein design is the conformational evaluation of loops with a certain sequence in specific environmental protein frameworks. Loops of different sequences and structures on similar scaffolds are common in the Protein Data Bank (PDB). In order to explore both structural and sequential diversity of them, a data base of loops connecting similar secondary structure fragments is constructed by searching the data base of families of structurally similar proteins and PDB. A total of 84 loop families having 2–13 residues are found among the well-determined structures of resolution better than 2.5 Å. Eight α-α, 20 α-β, 19 β-α, and 37 β-β families are identified. Every family contains more than 5 loop motifs. In each family, no loops share same sequence and all the frameworks are well superimposed. Forty-three new loop classes are distinguished in the data base. The structural variability of loops in homologous proteins are examined and shown in 44 families. Motif families are characterized with geometric parameters and sequence patterns. The conformations of loops in each family are clustered into subfamilies using average linkage cluster analysis method. Information such as geometric properties, sequence profile, sequential and structural variability in loop, structural alignment parameters, sequence similarities, and clustering results are provided. Correlations between the conformation of loops and loop sequence, motif sequence, and global sequence of PDB chain are examined in order to find how loop structures depend on their sequences and how they are affected by the local and global environment. Strong correlations (R > 0.75) are only found in 24 families. The best R value is 0.98. The data base is available through the Internet. © 1999 John Wiley & Sons, Inc. Biopoly 49: 481–495, 1999

1097-0282/asset/BIP_left.gif?v=1&s=f807857598ff440ee8acab64bad01b412bcbb591)
1097-0282/asset/cover.gif?v=1&s=fac188e766fbfc223bfd2cc29b9a541bdc4d135e)