Research Article
Modelling of partition constants: linear solvation energy relationships or PLS regression?
Article first published online: 2 FEB 2009
DOI: 10.1002/cem.1224
Copyright © 2009 John Wiley & Sons, Ltd.
Additional Information
How to Cite
Liu, T. and Öberg, T. (2009), Modelling of partition constants: linear solvation energy relationships or PLS regression?. Journal of Chemometrics, 23: 254–262. doi: 10.1002/cem.1224
Publication History
- Issue published online: 1 MAY 2009
- Article first published online: 2 FEB 2009
- Manuscript Accepted: 22 DEC 2008
- Manuscript Revised: 18 DEC 2008
- Manuscript Received: 23 SEP 2008
- Abstract
- References
- Cited By
Keywords:
- QSPR;
- QSAR;
- LFER;
- LSER;
- PLSR
The partitioning between octanol and water (Kow) and the water solubility (SW) are used to investigate similarities and differences between linear solvation energy relationships (LSER) and partial least squares regression (PLSR) models. The similarities in model structure are described, and shown to transform into a comparable prediction performance. Furthermore, the results demonstrate the opportunity for an analogous chemical interpretation and much of the alleged difference between the mechanistic or semi-empirical LSER and the statistical PLSR models then disappear.
Abstract
Estimation methods for partition constants are needed in many fields of engineering and science. The partitioning between phases is determined by the free energy of the transfer and all estimation methods must therefore describe the same entity. Linear solvation energy relationships (LSERs) try to split the contributions to van der Waals and polar interactions into directly interpretable solute descriptors, while projection-based regression methods can accomplish a similar dimensionality reduction from a set of theoretical descriptors. Here, we use the partitioning between octanol and water (Kow) and water solubility (Sw) to investigate similarities and differences between LSER and partial least squares regression (PLSR) models. The similarities in model structure are described, and shown to transform into a comparable prediction performance. We also demonstrate the opportunity to accomplish an analogous chemical interpretation of a PLSR model—either directly or through a linear transformation of the PLS factors—as with an LSER model. Much of the alleged difference between the mechanistic or semi-empirical LSER and the statistical PLSR models will then disappear. The choice of a modelling approach should therefore primarily be driven by the availability of data and predictive performance. Copyright © 2009 John Wiley & Sons, Ltd.

1099-128X/asset/CEM_left.gif?v=1&s=bf7a32b94d86cfd950babd255fbe81e66d033e4b)
1099-128X/asset/CEM_right.gif?v=1&s=4630211ecefb8b6241dad7b782e7b742d7a9891a)
1099-128X/asset/cover.gif?v=1&s=2e3045c3733baa4258989f44bd61b29dd74ee736)