Determination of rank by augmentation (DRAUG)



Determining the rank of a chemical matrix is the first step in many multivariate, chemometric studies. Rank is defined as the minimum number of linearly independent factors after deletion of factors that contribute to random, nonlinear, uncorrelated errors. Adding a matrix of rank 1 to a data matrix not only increases the rank by one unit but also perturbs the primary factor axes, having little effect on the secondary axes associated with the random errors in the measurements. The primary rank of a data matrix can be determined by comparing the residual variances obtained from principal component analysis (PCA) of the original data matrix to those obtained from an augmented matrix. The ratio of the residual variances between adjacent factor levels represents a Fisher ratio that can be used to distinguish the primary factors (chemical as well as instrumental factors) from the secondary factors (experimental errors). The results gleaned from model studies as well as those from experimental studies are used to illustrate the efficacy of the proposed methodology. The method is independent of the nature of the error distribution. Limitations and precautions are discussed. An algorithm, written in MATLAB format, is included. Copyright © 2011 John Wiley & Sons, Ltd.