Volume 74, Issue 2
BIOMETRIC METHODOLOGY

Risk prediction for heterogeneous populations with application to hospital admission prediction

Jared D. Huling

Department of Statistics, University of Wisconsin‐Madison, Wisconsin 53706, U.S.A.

Search for more papers by this author
Menggang Yu

Corresponding Author

E-mail address: meyu@biostat.wisc.edu

Department of Biostatistics and Medical Informatics, University of Wisconsin‐Madison, Wisconsin 53792, U.S.A.

email: meyu@biostat.wisc.edu

Search for more papers by this author
Muxuan Liang

Department of Statistics, University of Wisconsin‐Madison, Wisconsin 53706, U.S.A.

Search for more papers by this author
Maureen Smith

Department of Population Health Sciences, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin 53792, U.S.A.

Search for more papers by this author
First published: 26 October 2017
Citations: 2

Summary

This article is motivated by the increasing need to model risk for large hospital and health care systems that provide services to diverse and complex patients. Often, heterogeneity across a population is determined by a set of factors such as chronic conditions. When these stratifying factors result in overlapping subpopulations, it is likely that the covariate effects for the overlapping groups have some similarity. We exploit this similarity by imposing structural constraints on the importance of variables in predicting outcomes such as hospital admission. Our basic assumption is that if a variable is important for a subpopulation with one of the chronic conditions, then it should be important for the subpopulation with both conditions. However, a variable can be important for the subpopulation with two particular chronic conditions but not for the subpopulations of people with just one of those two conditions. This assumption and its generalization to more conditions are reasonable and aid greatly in borrowing strength across the subpopulations. We prove an oracle property for our estimation method and show that even when the structural assumptions are misspecified, our method will still include all of the truly nonzero variables in large samples. We demonstrate impressive performance of our method in extensive numerical studies and on an application in hospital admission prediction and validation for the Medicare population of a large health care provider.

Number of times cited according to CrossRef: 2

  • A practical model for research with learning health systems: Building and implementing effective complex case management, Applied Ergonomics, 10.1016/j.apergo.2019.103023, 84, (103023), (2020).
  • Anticipatory Care in Potentially Preventable Hospitalizations: Making Data Sense of Complex Health Journeys, Frontiers in Public Health, 10.3389/fpubh.2018.00376, 6, (2019).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.