Interpreting Disease Relevance of Amino Acid Substitutions
Published Online: 17 MAR 2014
Copyright © 2001 John Wiley & Sons, Ltd. All rights reserved.
How to Cite
Vihinen, M. 2014. Interpreting Disease Relevance of Amino Acid Substitutions. eLS. .
- Published Online: 17 MAR 2014
High-throughput sequencing methods can generate large amounts of information about genetic variations; however, interpretation of this data has become a severe bottleneck for efficient use of genomic data, for example, in diagnostics. Identification of variations responsible for phenotypes is laborious and many times difficult task. Amino acid substitutions are among most common disease-causing variants. Human genome codes for, on an average, approximately 11 000 such variants. Computational tools are needed to filter and rank raw variation datasets for further studies. Amino acid substitutions can have numerous effects, and mechanisms behind them are diverse. Therefore, different kinds of methods have been developed. Tolerance predictors aim at finding out likely harmful variants. Mechanism- and effect-specific tools are dedicated for specific outcomes of variants.
Amino acid substitution is a change in protein sequence where a single residue is changed.
Benchmark dataset contains cases with known effect. It serves as the gold standard, for example, for method performance assessment and training machine learning-based methods.
Human variome project (HVP) is an international organisation coordinating research and standards for variation research.
Next generation sequencing methods are fast nucleotide sequencing methods taking benefit of multiplexing and able of sequencing complete genomes very fast.
Performance measures are used to indicate performance of prediction methods. For full picture of performance, a number of measures should be reported.
Tolerance predictors are methods to predict whether amino acid substitutions are tolerated or not in a sequence.
Variation is a change in nucleotide or amino acid sequence in comparison with the reference sequence.
- amino acid substitution;
- tolerance predictor;
- locus specific databases;
- protein localisation;
- machine learning;
- next generation sequencing;
- protein stability predictor