A nonredundant structure dataset for benchmarking protein-RNA computational docking

Authors

  • Sheng-You Huang,

    1. Department of Physics and Astronomy, University of Missouri, Columbia, Missouri 65211
    2. Department of Biochemistry, University of Missouri, Columbia, Missouri 65211
    3. Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri 65211
    4. Informatics Institute, University of Missouri, Columbia, Missouri 65211
    Search for more papers by this author
  • Xiaoqin Zou

    Corresponding author
    1. Department of Physics and Astronomy, University of Missouri, Columbia, Missouri 65211
    2. Department of Biochemistry, University of Missouri, Columbia, Missouri 65211
    3. Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri 65211
    4. Informatics Institute, University of Missouri, Columbia, Missouri 65211
    • Department of Physics and Astronomy, University of Missouri, Columbia, Missouri 65211
    Search for more papers by this author
    • Fax: 573-884-4232


Abstract

Protein–RNA interactions play an important role in many biological processes. The ability to predict the molecular structures of protein–RNA complexes from docking would be valuable for understanding the underlying chemical mechanisms. We have developed a novel nonredundant benchmark dataset for protein–RNA docking and scoring. The diverse dataset of 72 targets consists of 52 unbound–unbound test complexes, and 20 unbound–bound test complexes. Here, unbound–unbound complexes refer to cases in which both binding partners of the cocrystallized complex are either in apo form or in a conformation taken from a different protein–RNA complex, whereas unbound–bound complexes are cases in which only one of the two binding partners has another experimentally determined conformation. The dataset is classified into three categories according to the interface root mean square deviation and the percentage of native contacts in the unbound structures: 49 easy, 16 medium, and 7 difficult targets. The bound and unbound cases of the benchmark dataset are expected to benefit the development and improvement of docking and scoring algorithms for the docking community. All the easy-to-view structures are freely available to the public at http://zoulab.dalton.missouri.edu/RNAbenchmark/. © 2012 Wiley Periodicals, Inc.

Ancillary