Hybrid pooled–unpooled design for cost‐efficient measurement of biomarkers†
This article is a U.S. Government work and is in the public domain in the U.S.A.
Abstract
Evaluating biomarkers in epidemiological studies can be expensive and time consuming. Many investigators use techniques such as random sampling or pooling biospecimens in order to cut costs and save time on experiments. Commonly, analyses based on pooled data are strongly restricted by distributional assumptions that are challenging to validate because of the pooled biospecimens. Random sampling provides data that can be easily analyzed. However, random sampling methods are not optimal cost‐efficient designs for estimating means. We propose and examine a cost‐efficient hybrid design that involves taking a sample of both pooled and unpooled data in an optimal proportion in order to efficiently estimate the unknown parameters of the biomarker distribution. In addition, we find that this design can be used to estimate and account for different types of measurement and pooling error, without the need to collect validation data or repeated measurements. We show an example where application of the hybrid design leads to minimization of a given loss function based on variances of the estimators of the unknown parameters. Monte Carlo simulation and biomarker data from a study on coronary heart disease are used to demonstrate the proposed methodology. Published in 2010 by John Wiley & Sons, Ltd.
Citing Literature
Number of times cited according to CrossRef: 15
- Dane R. Van Domelen, Emily M. Mitchell, Neil J. Perkins, Enrique F. Schisterman, Amita K. Manatunga, Yijian Huang, Robert H. Lyles, Logistic regression with a continuous exposure measured in pools and subject to errors, Statistics in Medicine, 10.1002/sim.7891, 37, 27, (4007-4021), (2018).
- Wei Zhang, Aiyi Liu, Paul S. Albert, Robert D. Ashmead, Enrique F. Schisterman, James L. Mills, A pooling strategy to effectively use genotype data in quantitative traits genome‐wide association studies, Statistics in Medicine, 10.1002/sim.7898, 37, 27, (4083-4095), (2018).
- Robert H. Lyles, Emily M. Mitchell, Clarice R. Weinberg, David M. Umbach, Enrique F. Schisterman, An efficient design strategy for logistic regression using outcome‐ and covariate‐dependent pooling of biospecimens prior to assay, Biometrics, 10.1111/biom.12489, 72, 3, (965-975), (2016).
- Michelle R. Danaher, Paul S. Albert, Aninyda Roy, Enrique F. Schisterman, Estimation of interaction effects using pooled biospecimens in a case‐control study, Statistics in Medicine, 10.1002/sim.6798, 35, 9, (1502-1513), (2015).
- Emily M. Mitchell, Robert H. Lyles, Enrique F. Schisterman, Positing, fitting, and selecting regression models for pooled biomarker data, Statistics in Medicine, 10.1002/sim.6496, 34, 17, (2544-2558), (2015).
- Robert Lyles, Dane Van Domelen, Emily Mitchell, Enrique Schisterman, A Discriminant Function Approach to Adjust for Processing and Measurement Error When a Biomarker is Assayed in Pooled Samples, International Journal of Environmental Research and Public Health, 10.3390/ijerph121114723, 12, 11, (14723-14740), (2015).
- Emily M. Mitchell, Robert H. Lyles, Amita K. Manatunga, Michelle Danaher, Neil J. Perkins, Enrique F. Schisterman, Regression for skewed biomarker outcomes subject to pooling, Biometrics, 10.1111/biom.12134, 70, 1, (202-211), (2014).
- M. R. Danaher, E. F. Schisterman, A. Roy, P. S. Albert, Estimation of gene–environment interaction by pooling biospecimens, Statistics in Medicine, 10.1002/sim.5357, 31, 26, (3241-3252), (2012).
- Enrique F. Schisterman, Paul S. Albert, The biomarker revolution, Statistics in Medicine, 10.1002/sim.5499, 31, 22, (2513-2515), (2012).
- Brian W. Whitcomb, Neil J. Perkins, Zhiwei Zhang, Aijun Ye, Robert H. Lyles, Assessment of skewed exposure in case‐control studies with pooling, Statistics in Medicine, 10.1002/sim.5351, 31, 22, (2461-2472), (2012).
- Robert H. Lyles, Li Tang, Ji Lin, Zhiwei Zhang, Bhramar Mukherjee, Likelihood‐based methods for regression analysis with binary exposure status assessed by pooling, Statistics in Medicine, 10.1002/sim.4426, 31, 22, (2485-2497), (2012).
- Yaakov Malinovsky, Paul S. Albert, Enrique F. Schisterman, Pooling Designs for Outcomes under a Gaussian Random Effects Model, Biometrics, 10.1111/j.1541-0420.2011.01673.x, 68, 1, (45-52), (2011).
- Albert Vexler, Wan‐Min Tsai, Yaakov Malinovsky, Estimation and testing based on data subject to measurement errors: from parametric to non‐parametric likelihood methods, Statistics in Medicine, 10.1002/sim.4304, 31, 22, (2498-2512), (2011).
- Z. Zhang, A. Liu, R.H. Lyles, B. Mukherjee, Logistic regression analysis of biomarker data subject to pooling and dichotomization, Statistics in Medicine, 10.1002/sim.4367, 31, 22, (2473-2484), (2011).
- Zhiwei Zhang, Paul S. Albert, Binary Regression Analysis with Pooled Exposure Measurements: A Regression Calibration Approach, Biometrics, 10.1111/j.1541-0420.2010.01464.x, 67, 2, (636-645), (2010).




