This article is a U.S. Government work and is in the public domain in the U.S.A.
Using lowess to remove systematic trends over time in predictor variables prior to logistic regression with quantile categories†
Article first published online: 14 APR 2003
This article is a U.S. Government work and is in the public domain in the U.S.A. Published in 2003 by John Wiley & Sons. Ltd.
Statistics in Medicine
Special Issue: 8th Biennial CDC and ADSTR Syposium on Statistical Methods Issues Associated with Complicated Designs and Data Structures
Volume 22, Issue 9, pages 1477–1493, 15 May 2003
How to Cite
Borkowf, C. B., Albert, P. S. and Abnet, C. C. (2003), Using lowess to remove systematic trends over time in predictor variables prior to logistic regression with quantile categories. Statist. Med., 22: 1477–1493. doi: 10.1002/sim.1507
- Issue published online: 14 APR 2003
- Article first published online: 14 APR 2003
- laboratory measurement drift;
- measurement error;
- trend removal
In case-control studies one may employ logistic regression to model the relationship between binary responses and continuous predictor variables that have been categorized by the empirical quartiles of the controls. Sometimes, however, systematic trends over time (or drifts) contaminate the laboratory measurements of predictor variables. In this paper we consider the use of locally weighted robust regression (lowess) to estimate and remove these systematic trends when the trends for the cases and controls have a common shape. One can then use the lowess adjusted data in the desired logistic regression model. We illustrate these methods with a case-control study that was designed to assess the risk of oesophageal cancer as a function of the quartile categories of sphinganine levels in the blood serum. Upon examination of the data, it was discovered that the sphinganine laboratory measurements were contaminated by a systematic trend, the magnitude of which depended only on the day of analysis. This trend needed to be removed before performing further analyses of the data. In addition, we present simulations to examine the use of lowess methods to estimate and remove various shapes of trends from contaminated predictor data before constructing logistic regression models with quartile categories. We found that using the trend-contaminated data tends to give attenuated parameter estimates and hence lower significance and power levels than using the uncontaminated data. Conversely, using appropriate lowess methods to adjust the data tends to give nearly unbiased parameter estimates, near nominal significance levels, and improved power. Published in 2003 by John Wiley & Sons, Ltd.