Volume 36, Issue 16
Research Article

Tests for informative cluster size using a novel balanced bootstrap scheme

Jaakko Nevalainen

Corresponding Author

E-mail address: jaakko.nevalainen@uta.fi

School of Health Sciences, University of Tampere, Tampere, Finland

Correspondence to: Jaakko Nevalainen, School of Health Sciences, University of Tampere, Tampere, Finland.

E‐mail: jaakko.nevalainen@uta.fi

Search for more papers by this author
Hannu Oja

Department of Mathematics and Statistics, University of Turku, Turku, Finland

Search for more papers by this author
Somnath Datta

Department of Biostatistics, University of Florida, Gainesville, FL, U.S.A.

Search for more papers by this author
First published: 21 March 2017
Citations: 5

Abstract

Clustered data are often encountered in biomedical studies, and to date, a number of approaches have been proposed to analyze such data. However, the phenomenon of informative cluster size (ICS) is a challenging problem, and its presence has an impact on the choice of a correct analysis methodology. For example, Dutta and Datta (2015, Biometrics) presented a number of marginal distributions that could be tested. Depending on the nature and degree of informativeness of the cluster size, these marginal distributions may differ, as do the choices of the appropriate test. In particular, they applied their new test to a periodontal data set where the plausibility of the informativeness was mentioned, but no formal test for the same was conducted. We propose bootstrap tests for testing the presence of ICS. A balanced bootstrap method is developed to successfully estimate the null distribution by merging the re‐sampled observations with closely matching counterparts. Relying on the assumption of exchangeability within clusters, the proposed procedure performs well in simulations even with a small number of clusters, at different distributions and against different alternative hypotheses, thus making it an omnibus test. We also explain how to extend the ICS test to a regression setting and thereby enhancing its practical utility. The methodologies are illustrated using the periodontal data set mentioned earlier. Copyright © 2017 John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 5

  • Variance estimation in tests of clustered categorical data with informative cluster size, Statistical Methods in Medical Research, 10.1177/0962280220928572, (096228022092857), (2020).
  • Within-cluster resampling for multilevel models under informative cluster size, Biometrika, 10.1093/biomet/asz035, 106, 4, (965-972), (2019).
  • Robust Nonparametric Inference, Annual Review of Statistics and Its Application, 10.1146/annurev-statistics-031017-100247, 5, 1, (473-500), (2018).
  • Comparison of model- and design-based approaches to detect the treatment effect and covariate by treatment interactions in three-level models for multisite cluster-randomized trials, Behavior Research Methods, 10.3758/s13428-018-1080-1, (2018).
  • Diagnostic methods for uncovering outcome dependent visit processes, Biostatistics, 10.1093/biostatistics/kxy068, (2018).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.