Article first published online: 9 JUL 2013
Copyright © 2013 John Wiley & Sons, Ltd.
Statistics in Medicine
Volume 32, Issue 26, pages 4550–4566, 20 November 2013
How to Cite
Capanu, M., Gönen, M. and Begg, C. B. (2013), An assessment of estimation methods for generalized linear mixed models with binary outcomes. Statist. Med., 32: 4550–4566. doi: 10.1002/sim.5866
- Issue published online: 9 OCT 2013
- Article first published online: 9 JUL 2013
- Manuscript Accepted: 8 MAY 2013
- Manuscript Received: 11 JAN 2012
- National Cancer Institute. Grant Number: CA131010
- generalized linear mixed models;
- binary clustered data
Two main classes of methodology have been developed for addressing the analytical intractability of generalized linear mixed models: likelihood-based methods and Bayesian methods. Likelihood-based methods such as the penalized quasi-likelihood approach have been shown to produce biased estimates especially for binary clustered data with small clusters sizes. More recent methods using adaptive Gaussian quadrature perform well but can be overwhelmed by problems with large numbers of random effects, and efficient algorithms to better handle these situations have not yet been integrated in standard statistical packages. Bayesian methods, although they have good frequentist properties when the model is correct, are known to be computationally intensive and also require specialized code, limiting their use in practice. In this article, we introduce a modification of the hybrid approach of Capanu and Begg, 2011, Biometrics 67, 371–380, as a bridge between the likelihood-based and Bayesian approaches by employing Bayesian estimation for the variance components followed by Laplacian estimation for the regression coefficients. We investigate its performance as well as that of several likelihood-based methods in the setting of generalized linear mixed models with binary outcomes. We apply the methods to three datasets and conduct simulations to illustrate their properties. Simulation results indicate that for moderate to large numbers of observations per random effect, adaptive Gaussian quadrature and the Laplacian approximation are very accurate, with adaptive Gaussian quadrature preferable as the number of observations per random effect increases. The hybrid approach is overall similar to the Laplace method, and it can be superior for data with very sparse random effects. Copyright © 2013 John Wiley & Sons, Ltd.