Unit

UNIT 6.15 Inference of Episodic Changes in Natural Selection Acting on Protein Coding Sequences via CODEML

  1. Joseph P. Bielawski1,2,
  2. Jennifer L. Baker3,
  3. Joseph Mingrone2

Published Online: 20 JUN 2016

DOI: 10.1002/cpbi.2

Current Protocols in Bioinformatics

Current Protocols in Bioinformatics

How to Cite

Bielawski, J.P., Baker, J.L. and Mingrone, J. 2016. Inference of episodic changes in natural selection acting on protein coding sequences via CODEML. Curr. Protoc. Bioinform. 54:6.15.1-6.15.32. doi: 10.1002/cpbi.2

Author Information

  1. 1

    Department of Biology, Dalhousie University, Halifax, Nova Scotia, Canada

  2. 2

    Department of Mathematics & Statistics, Dalhousie University, Halifax, Nova Scotia, Canada

  3. 3

    Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland

Publication History

  1. Published Online: 20 JUN 2016

Abstract

This unit provides protocols for using the CODEML program from the PAML package to make inferences about episodic natural selection in protein-coding sequences. The protocols cover inference tasks such as maximum likelihood estimation of selection intensity, testing the hypothesis of episodic positive selection, and identifying sites with a history of episodic evolution. We provide protocols for using the rich set of models implemented in CODEML to assess robustness, and for using bootstrapping to assess if the requirements for reliable statistical inference have been met. An example dataset is used to illustrate how the protocols are used with real protein-coding sequences. The workflow of this design, through automation, is readily extendable to a larger-scale evolutionary survey. © 2016 by John Wiley & Sons, Inc.

Keywords:

  • codon model;
  • natural selection;
  • episodic evolution;
  • maximum likelihood;
  • dN/dS ratio;
  • experimental design