Research Article
Choosing the number of factors in partial least squares regression: estimating and minimizing the mean squared error of prediction
Article first published online: 4 JUL 2000
DOI: 10.1002/1099-128X(200007/08)14:4<351::AID-CEM598>3.0.CO;2-Q
Copyright © 2000 John Wiley & Sons, Ltd.
Additional Information
How to Cite
Denham, M. C. (2000), Choosing the number of factors in partial least squares regression: estimating and minimizing the mean squared error of prediction. Journal of Chemometrics, 14: 351–361. doi: 10.1002/1099-128X(200007/08)14:4<351::AID-CEM598>3.0.CO;2-Q
Publication History
- Issue published online: 4 JUL 2000
- Article first published online: 4 JUL 2000
- Manuscript Accepted: 10 JAN 2000
- Manuscript Received: 20 APR 1999
- Abstract
- References
- Cited By
Keywords:
- PLS regression;
- model selection;
- prediction;
- bootstrap;
- linearization
Abstract
We investigate a number of approaches to estimating the mean squared error of prediction (MSEP) in partial least squares (PLS) regression without resorting to external validation. Using two simulation examples based on real data, performances of the methods are evaluated in terms of their accuracy and their usefulness in determining the optimal number of factors to include in the PLS model. We find that for problems with relatively few variables, methods based on ignoring the effect of non-linearity in PLS regression or using a linear approximation give good estimates of MSEP, with little to choose between them. However, where linear approximation is feasible, we prefer it, since it gives estimates of MSEP which have lower bias and variance than cross-validation. In situations where there are large numbers of variables, these methods break down. In these circumstances, cross-validation and bootstrapping methods are better able to capture the changes in MSEP with the number of factors fitted and thus are more useful for identifying the optimal PLS regression model. Copyright © 2000 John Wiley & Sons, Ltd.

1099-128X/asset/CEM_left.gif?v=1&s=bf7a32b94d86cfd950babd255fbe81e66d033e4b)
1099-128X/asset/CEM_right.gif?v=1&s=4630211ecefb8b6241dad7b782e7b742d7a9891a)
1099-128X/asset/cover.gif?v=1&s=2e3045c3733baa4258989f44bd61b29dd74ee736)