Structure refinement of protein model decoys requires accurate side-chain placement


  • Mark A. Olson,

    Corresponding author
    1. Department of Cell Biology and Biochemistry, USAMRIID, Frederick, Maryland 21702
    • Department of Cell Biology and Biochemistry, USAMRIID, Frederick, MD 21702
    Search for more papers by this author
  • Michael S. Lee

    1. Department of Cell Biology and Biochemistry, USAMRIID, Frederick, Maryland 21702
    2. Center for Genome Sciences, USAMRIID, Frederick, Maryland 21702
    3. Computational Sciences and Engineering Branch, U.S. Army Research Laboratory, Aberdeen Proving Ground, Aberdeen, Maryland 21005
    Search for more papers by this author

  • Published 2012. This article is a US Government work and, as such, is in the public domain in the United States of America.


In this study, the application of temperature-based replica-exchange (T-ReX) simulations for structure refinement of decoys taken from the I-TASSER dataset was examined. A set of eight nonredundant proteins was investigated using self-guided Langevin dynamics (SGLD) with a generalized Born implicit solvent model to sample conformational space. For two of the protein test cases, a comparison of the SGLD/T-ReX method with that of a hybrid explicit/implicit solvent molecular dynamics T-ReX simulation model is provided. Additionally, the effect of side-chain placement among the starting decoy structures, using alternative rotamer conformations taken from the SCWRL4 modeling program, was investigated. The simulation results showed that, despite having near-native backbone conformations among the starting decoys, the determinant of their refinement is side-chain packing to a level that satisfies a minimum threshold of native contacts to allow efficient excursions toward the downhill refinement regime on the energy landscape. By repacking using SCWRL4 and by applying the RWplus statistical potential for structure identification, the SGLD/T-ReX simulations achieved refinement to an average of 38% increase in the number of native contacts relative to the original I-TASSER decoy sets and a 25% reduction in values of Cα root-mean-square deviation. The hybrid model succeeded in obtaining a sharper funnel to low-energy states for a modeled target than the implicit solvent SGLD model; yet, structure identification remained roughly the same. Without meeting a threshold of near-native packing of side chains, the T-ReX simulations degrade the accuracy of the decoys, and subsequently, refinement becomes tantamount to the protein folding problem. Proteins 2013. 2012 Published by Wiley Periodicals, Inc.