Volume 9, Issue 5
ADVANCED REVIEW

Subgroup identification for precision medicine: A comparative review of 13 methods

Wei‐Yin Loh

Corresponding Author

E-mail address: loh@stat.wisc.edu

Department of Statistics, University of Wisconsin, Madison, Wisconsin

Correspondence

Wei‐Yin Loh, Department of Statistics, University of Wisconsin, Madison, WI.

Email: loh@stat.wisc.edu

Search for more papers by this author
Luxi Cao

Department of Statistics, University of Wisconsin, Madison, Wisconsin

Search for more papers by this author
Peigen Zhou

Department of Statistics, University of Wisconsin, Madison, Wisconsin

Search for more papers by this author
First published: 09 June 2019
Citations: 1
Funding information National Science Foundation, Grant/Award Number: DMS‐1305725; University of Wisconsin Graduate School

Abstract

Natural heterogeneity in patient populations can make it very hard to develop treatments that benefit all patients. As a result, an important goal of precision medicine is identification of patient subgroups that respond to treatment at a much higher (or lower) rate than the population average. Despite there being many subgroup identification methods, there is no comprehensive comparative study of their statistical properties. We review 13 methods and use real‐world and simulated data to compare the performance of their publicly available software using seven criteria: (a) bias in selection of subgroup variables, (b) probability of false discovery, (c) probability of identifying correct predictive variables, (d) bias in estimates of subgroup treatment effects, (e) expected subgroup size, (f) expected true treatment effect of subgroups, and (g) subgroup stability. The results show that many methods fare poorly on at least one criterion.

This article is categorized under:

  • Technologies > Machine Learning
  • Algorithmic Development > Hierarchies and Trees
  • Algorithmic Development > Statistics
  • Application Areas > Health Care

Abstract

Subgroup (in green) for breast cancer data; sample sizes and estimated treatment effects (log relative risks) beside and below nodes

Number of times cited according to CrossRef: 1

  • Machine Learning for Precision Health Economics and Outcomes Research (P-HEOR): Conceptual Review of Applications and Next Steps, Journal of Health Economics and Outcomes Research, 10.36469/jheor.2020.12698, (35-42), (2020).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.