Volume 76, Issue 2
BIOMETRIC METHODOLOGY

Sequential adaptive variables and subject selection for GEE methods

Zimu Chen

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, China

Search for more papers by this author
Zhanfeng Wang

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, China

Search for more papers by this author
Yuan‐chin Ivan Chang

Corresponding Author

E-mail address: ycchang@sinica.edu.tw

Institute of Statistical Science, Academia Sinica, Taipei, Taiwan

Correspondence Yuan‐chin Ivan Chang, Institute of Statistical Science, Academia Sinica, 11529 Taipei, Taiwan.

Email: ycchang@sinica.edu.tw

Search for more papers by this author
First published: 10 October 2019

Abstract

Modeling correlated or highly stratified multiple‐response data is a common data analysis task in many applications, such as those in large epidemiological studies or multisite cohort studies. The generalized estimating equations method is a popular statistical method used to analyze these kinds of data, because it can manage many types of unmeasured dependence among outcomes. Collecting large amounts of highly stratified or correlated response data is time‐consuming; thus, the use of a more aggressive sampling strategy that can accelerate this process—such as the active‐learning methods found in the machine‐learning literature—will always be beneficial. In this study, we integrate adaptive sampling and variable selection features into a sequential procedure for modeling correlated response data. Besides reporting the statistical properties of the proposed procedure, we also use both synthesized and real data sets to demonstrate the usefulness of our method.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.