UC-Curve: A highly compact 2D graphical representation of protein sequences
Article first published online: 21 NOV 2013
Copyright © 2013 Wiley Periodicals, Inc.
International Journal of Quantum Chemistry
Volume 114, Issue 6, pages 409–415, 15 March 2014
How to Cite
How to cite this article: Int. J. Quantum Chem. 2014, 114, 409–415, DOI: 10.1002/qua.24581, , , .
- Issue published online: 5 FEB 2014
- Article first published online: 21 NOV 2013
- Manuscript Accepted: 22 OCT 2013
- Manuscript Revised: 10 OCT 2013
- Manuscript Received: 6 SEP 2013
- National Natural Science Foundation of China. Grant Numbers: 11201409, 31100953, 61170110
- Natural Science Foundation of Hebei Province. Grant Number: A2013203009
- ZheJiang QianJiang Talent Plan. Grant Number: 2013R10061
- unit circle;
- cyclic order;
- geometric center vector;
- phylogenetic relationship
A highly compact two-dimensional graphical representation of protein sequences, namely, UC-Curve, is presented by assigning amino acids to the circumference of a unit circle with a cyclic order. UC-Curves can visually reveal general composition features of protein sequences, and roughly exhibit major differences among similar protein sequences. Geometric center vectors of UC-Curves and Euclidean distances are extracted, respectively, to analyze pairwise similarities/dissimilarities between two different families of proteins. Comparative results demonstrate the robustness of the technique and show that UC-Curves could help to inference reasonable phylogenetic relationships with relatively less computational cost. © 2013 Wiley Periodicals, Inc.