Journal of Chemometrics

Cover image for Journal of Chemometrics

May 2009

Volume 23, Issue 5

Pages 217–264

  1. Research Articles

    1. Top of page
    2. Research Articles
    3. Short Communications
    1. Regression by L1 regularization of smart contrasts and sums (ROSCAS) beats PLS and elastic net in latent variable model (pages 217–228)

      Cajo J. F. ter Braak

      Article first published online: 9 JAN 2009 | DOI: 10.1002/cem.1213

      Regression methods for high dimensional data, such as LASSO, elastic net, PLS and ridge regression, regularize the regression coefficients by penalizing their size. We show that such methods perform poorly on data generated by a latent variable model with different numbers of predictors per latent variable. The new and better performing method proposed here (ROSCAS) exploits the idea that a priori correlated predictors should have similar coefficients by penalizing contrasts and sums derived from the X-correlations.

    2. Modeling the semi-empirical electrotopological index in QSPR studies for aldehydes and ketones (pages 229–235)

      Érica Silva Souza, Carlos Alberto Kuhnen, Berenice da Silva Junkes, Rosendo Augusto Yunes and Vilma Edite Fonseca Heinzen

      Article first published online: 20 JAN 2009 | DOI: 10.1002/cem.1215

      The semi-empirical electrotopological index, ISET, remodeled for ketones and aldehydes is calculated through the atomic charge values obtained from a Mulliken population analysis using the semi-empirical AM1 method. The best definition of an equivalent local dipole moment is clearly dependent on the specific features of the charge distribution in the polar region of the molecules. The models were of good quality as shown by the statistical parameters, showing that including charge distribution and structural details open a new way in QSRR studies.

    3. Rank annihilation factor analysis for multicomponent kinetic-spectrophotometric determination using difference spectra (pages 236–247)

      Morteza Bahram and Mehdi Mabhooti

      Article first published online: 26 JAN 2009 | DOI: 10.1002/cem.1216

      This Manuscript presents a new strategy for handling rank-deficient two-way data using rank annihilation factor analysis (RAFA) in conjunction with difference spectra. The obtained difference matrix of sample and that of analyte of interest will be full-rank and rank-one, respectively. Therefore the system can be analyzed by RAFA.

    4. Re-parameterization of five-parameter logistic function (pages 248–253)

      Jason J. Z. Liao and Rong Liu

      Article first published online: 12 JAN 2009 | DOI: 10.1002/cem.1218

      The five-parameter logistic (5PL) function has seen increased use as a model for bioassay dose-response curves. The 5PL function takes curve asymmetry into consideration to overcome some drawbacks of the four-parameter logistic (4PL) function. However, the currently used 5PL functional form does not have the same practical useful parameter interpretation as the 4PL function. In this paper, we re-parameterize the 5PL function to preserve the practical useful parameter interpretation of the 4PL, and the new 5PL has better statistical properties in terms of bias and mean squared error (MSE).

    5. Modelling of partition constants: linear solvation energy relationships or PLS regression? (pages 254–262)

      Tao Liu and Tomas Öberg

      Article first published online: 2 FEB 2009 | DOI: 10.1002/cem.1224

      The partitioning between octanol and water (Kow) and the water solubility (SW) are used to investigate similarities and differences between linear solvation energy relationships (LSER) and partial least squares regression (PLSR) models. The similarities in model structure are described, and shown to transform into a comparable prediction performance. Furthermore, the results demonstrate the opportunity for an analogous chemical interpretation and much of the alleged difference between the mechanistic or semi-empirical LSER and the statistical PLSR models then disappear.

  2. Short Communications

    1. Top of page
    2. Research Articles
    3. Short Communications
    1. OPLS filtered data can be obtained directly from non-orthogonalized PLS1 (pages 263–264)

      E. K. Kemsley and H. S. Tapp

      Article first published online: 15 JAN 2009 | DOI: 10.1002/cem.1217

      This work discloses a simple re-arrangement of the Martens approach to partial least squares (‘non-orthogonalized’ PLS). The factorization is separated into entirely y-related and y-unrelated parts; the y-unrelated variation can hence be removed from the X data through a single post hoc calculation following conventional PLS. The outcome is identical to performing OPLS-filtering (‘orthogonal projections to latent structures’), but without any recourse to the OPLS algorithm.

SEARCH

SEARCH BY CITATION