Unit

UNIT 12.13 Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families

  1. Lars Barquist1,2,
  2. Sarah W. Burge2,
  3. Paul P. Gardner3,4

Published Online: 20 JUN 2016

DOI: 10.1002/cpbi.4

Current Protocols in Bioinformatics

Current Protocols in Bioinformatics

How to Cite

Barquist, L., Burge, S.W. and Gardner, P.P. 2016. Studying RNA homology and conservation with infernal: from single sequences to RNA families. Curr. Protoc. Bioinform. 54:12.13.1-12.13.25. doi: 10.1002/cpbi.4

Author Information

  1. 1

    Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany

  2. 2

    Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom

  3. 3

    School of Biological Sciences, University of Canterbury, Christchurch, New Zealand

  4. 4

    Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand

Publication History

  1. Published Online: 20 JUN 2016

Abstract

Emerging high-throughput technologies have led to a deluge of putative non-coding RNA (ncRNA) sequences identified in a wide variety of organisms. Systematic characterization of these transcripts will be a tremendous challenge. Homology detection is critical to making maximal use of functional information gathered about ncRNAs: identifying homologous sequence allows us to transfer information gathered in one organism to another quickly and with a high degree of confidence. ncRNA presents a challenge for homology detection, as the primary sequence is often poorly conserved and de novo secondary structure prediction and search remain difficult. This unit introduces methods developed by the Rfam database for identifying “families” of homologous ncRNAs starting from single “seed” sequences, using manually curated sequence alignments to build powerful statistical models of sequence and structure conservation known as covariance models (CMs), implemented in the Infernal software package. We provide a step-by-step iterative protocol for identifying ncRNA homologs and then constructing an alignment and corresponding CM. We also work through an example for the bacterial small RNA MicA, discovering a previously unreported family of divergent MicA homologs in genus Xenorhabdus in the process. © 2016 by John Wiley & Sons, Inc.

Keywords:

  • covariance model;
  • homology;
  • RNA;
  • Rfam;
  • alignment;
  • ncRNA;
  • conservation