• Open Access

SNP discovery via 454 transcriptome sequencing

Authors

  • W. Brad Barbazuk,

    1. Donald Danforth Plant Science Center, St Louis, MO 63132, USA
    Search for more papers by this author
    • These authors contributed equally to this work.

  • Scott J. Emrich,

    1. Interdepartmental Bioinformatics and Computational Biology Graduate Program
    2. Department of Electrical and Computer Engineering
    Search for more papers by this author
    • These authors contributed equally to this work.

  • Hsin D. Chen,

    1. Department of Agronomy
    Search for more papers by this author
  • Li Li,

    1. Interdepartmental Plant Physiology Major
    2. Department of Genetics, Development, and Cell Biology
    Search for more papers by this author
  • Patrick S. Schnable

    Corresponding author
    1. Interdepartmental Bioinformatics and Computational Biology Graduate Program
    2. Department of Agronomy
    3. Interdepartmental Plant Physiology Major
    4. Department of Genetics, Development, and Cell Biology
    5. Center for Plant Genomics, Iowa State University, Ames, IA 50011, USA
      (fax +1 515 294 5256; e-mail schnable@iastate.edu).
    Search for more papers by this author

(fax +1 515 294 5256; e-mail schnable@iastate.edu).

Summary

A massively parallel pyro-sequencing technology commercialized by 454 Life Sciences Corporation was used to sequence the transcriptomes of shoot apical meristems isolated from two inbred lines of maize using laser capture microdissection (LCM). A computational pipeline that uses the POLYBAYES polymorphism detection system was adapted for 454 ESTs and used to detect SNPs (single nucleotide polymorphisms) between the two inbred lines. Putative SNPs were computationally identified using 260 000 and 280 000 454 ESTs from the B73 and Mo17 inbred lines, respectively. Over 36 000 putative SNPs were detected within 9980 unique B73 genomic anchor sequences (MAGIs). Stringent post-processing reduced this number to > 7000 putative SNPs. Over 85% (94/110) of a sample of these putative SNPs were successfully validated by Sanger sequencing. Based on this validation rate, this pilot experiment conservatively identified > 4900 valid SNPs within > 2400 maize genes. These results demonstrate that 454-based transcriptome sequencing is an excellent method for the high-throughput acquisition of gene-associated SNPs.

Ancillary