Volume 30, Issue 21
Research Article

Subgroup identification based on differential effect search—A recursive partitioning method for establishing response to treatment in patient subpopulations

Ilya Lipkovich

Corresponding Author

E-mail address: lipkovichia@lilly.com

Eli Lilly and Company, Indianapolis, IN, USA

Ilya Lipkovich, Eli Lilly and Company, Indianapolis, IN, USA.

E‐mail: lipkovichia@lilly.com

Search for more papers by this author
Alex Dmitrienko

E-mail address: alexei@lilly.com

Eli Lilly and Company, Indianapolis, IN, USA

Search for more papers by this author
Jonathan Denne

E-mail address: jon_denne@lilly.com

Eli Lilly and Company, Indianapolis, IN, USA

Search for more papers by this author
Gregory Enas

E-mail address: gge@lilly.com

Eli Lilly and Company, Indianapolis, IN, USA

Search for more papers by this author
First published: 22 July 2011
Citations: 90

Abstract

We propose a novel recursive partitioning method for identifying subgroups of subjects with enhanced treatment effects based on a differential effect search algorithm. The idea is to build a collection of subgroups by recursively partitioning a database into two subgroups at each parent group, such that the treatment effect within one of the two subgroups is maximized compared with the other subgroup. The process of data splitting continues until a predefined stopping condition has been satisfied. The method is similar to ‘interaction tree’ approaches that allow incorporation of a treatment‐by‐split interaction in the splitting criterion. However, unlike other tree‐based methods, this method searches only within specific regions of the covariate space and generates multiple subgroups of potential interest. We develop this method and provide guidance on key topics of interest that include generating multiple promising subgroups using different splitting criteria, choosing optimal values of complexity parameters via cross‐validation, and addressing Type I error rate inflation inherent in data mining applications using a resampling‐based method. We evaluate the operating characteristics of the procedure using a simulation study and illustrate the method with a clinical trial example. Copyright © 2011 John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 90

  • Building Health Application Recommender System Using Partially Penalized Regression, Statistical Modeling in Biomedical Research, 10.1007/978-3-030-33416-1_6, (105-123), (2020).
  • Exploratory Subgroup Identification for Biopharmaceutical Development, Design and Analysis of Subgroups with Biopharmaceutical Applications, 10.1007/978-3-030-40105-4_12, (245-270), (2020).
  • Biostatistics in Clinical Trials, Quantitative Methods in Pharmaceutical Research and Development, 10.1007/978-3-030-48555-9, (1-70), (2020).
  • Subgroup analysis with a nonparametric unimodal symmetric error distribution, Communications in Statistics - Theory and Methods, 10.1080/03610926.2019.1710754, (1-22), (2020).
  • A semiparametric Bayesian approach to population finding with time‐to‐event and toxicity data in a randomized clinical trial, Biometrics, 10.1111/biom.13289, 0, 0, (2020).
  • Subgroup analysis based on structured mixed-effects models for longitudinal data, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2020.1730867, (1-16), (2020).
  • Inference on Selected Subgroups in Clinical Trials, Journal of the American Statistical Association, 10.1080/01621459.2020.1740096, (1-18), (2020).
  • Precision medicine: Subgroup identification in longitudinal trajectories, Statistical Methods in Medical Research, 10.1177/0962280220904114, (096228022090411), (2020).
  • A multiple comparison procedure for dose‐finding trials with subpopulations, Biometrical Journal, 10.1002/bimj.201800111, 62, 1, (53-68), (2019).
  • Exploratory identification of predictive biomarkers in randomized trials with normal endpoints, Statistics in Medicine, 10.1002/sim.8452, 39, 7, (923-939), (2019).
  • Sodium Hexametaphosphate‐Modified Thermoplastic Starch Materials Prepared with the Assistance of Supercritical CO2, Starch - Stärke, 10.1002/star.201900055, 72, 1-2, (2019).
  • Joint model for recurrent event data with a cured fraction and a terminal event, Biometrical Journal, 10.1002/bimj.201800321, 62, 1, (24-33), (2019).
  • Structural Characterization of Controlled Decrystallization of Cassava Starch, Starch - Stärke, 10.1002/star.201900049, 72, 1-2, (2019).
  • Chitooligosaccharide Biguanidine Alleviates Liver Injury and Insulin Resistance in Type 2 Diabetic Rats, Starch - Stärke, 10.1002/star.201900203, 72, 1-2, (2019).
  • Stability and Thermodynamic Attributes of Starch Hydrolyzing α‐Amylase of Anoxybacillus rupiensis TS‐4, Starch - Stärke, 10.1002/star.201900105, 72, 1-2, (2019).
  • Preparation and Characterization of Composites of Hydroxypropyl Tapioca Starch and Zein, Starch - Stärke, 10.1002/star.201900204, 72, 1-2, (2019).
  • Latent variable models for harmonization of test scores: A case study on memory, Biometrical Journal, 10.1002/bimj.201800146, 62, 1, (34-52), (2019).
  • Physicochemical and Functional Characterization of Potato Starch (Solanum Tuberosum ssp. Andigenum) from the Quebrada De Humahuaca, Argentina, Starch - Stärke, 10.1002/star.201900069, 72, 1-2, (2019).
  • Joint mean–covariance random effect model for longitudinal data, Biometrical Journal, 10.1002/bimj.201800311, 62, 1, (7-23), (2019).
  • Influence of Storage Period on the Physicochemical Properties and In Vitro Digestibility of Starch in Packaged Cooked Chestnut Kernel, Starch - Stärke, 10.1002/star.201900080, 72, 1-2, (2019).
  • Complex Formation between Soy Proteins and Potato Starch: Effect of pH, Biopolymer Ratio, and Biopolymer Concentration, Starch - Stärke, 10.1002/star.201900020, 72, 1-2, (2019).
  • Germinated Wheat Starch as a Substrate to Produce Cyclodextrins: Application in Inclusion Complex to Improve the Thermal Stability of Orange Essential Oil, Starch - Stärke, 10.1002/star.201900083, 72, 1-2, (2019).
  • SIMEX for correction of dietary exposure effects with Box‐Cox transformed data, Biometrical Journal, 10.1002/bimj.201900066, 62, 1, (221-237), (2019).
  • A comparison of subgroup identification methods in clinical drug development: Simulation study and regulatory considerations, Pharmaceutical Statistics, 10.1002/pst.1951, 18, 5, (600-626), (2019).
  • Composite interaction tree for simultaneous learning of optimal individualized treatment rules and subgroups, Statistics in Medicine, 10.1002/sim.8105, 38, 14, (2632-2651), (2019).
  • A model‐based multithreshold method for subgroup identification, Statistics in Medicine, 10.1002/sim.8136, 38, 14, (2605-2631), (2019).
  • Subgroup identification via homogeneity pursuit for dense longitudinal/spatial data, Statistics in Medicine, 10.1002/sim.8192, 38, 17, (3256-3271), (2019).
  • PSICA: Decision trees for probabilistic subgroup identification with categorical treatments, Statistics in Medicine, 10.1002/sim.8308, 38, 22, (4436-4452), (2019).
  • Estimating individual treatment effects by gradient boosting trees, Statistics in Medicine, 10.1002/sim.8357, 38, 26, (5146-5159), (2019).
  • Subgroup identification for precision medicine: A comparative review of 13 methods, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10.1002/widm.1326, 9, 5, (2019).
  • Case‐only trees and random forests for exploring genotype‐specific treatment effects in randomized clinical trials with dichotomous end points, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/rssc.12366, 68, 5, (1371-1391), (2019).
  • Bibliography, Exploratory Subgroup Analyses in Clinical Research, 10.1002/9781119536734, (197-215), (2019).
  • ASIED: a Bayesian adaptive subgroup-identification enrichment design, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2019.1696356, (1-16), (2019).
  • Subgroup analysis and interpretation for phase 3 confirmatory trials: White paper of the EFSPI/PSI working group on subgroup analysis, Pharmaceutical Statistics, 10.1002/pst.1919, 18, 2, (126-139), (2018).
  • Subgroups from regression trees with adjustment for prognostic effects and postselection inference, Statistics in Medicine, 10.1002/sim.7677, 38, 4, (545-557), (2018).
  • Development of predictive signatures for treatment selection in precision medicine with survival outcomes, Pharmaceutical Statistics, 10.1002/pst.1842, 17, 2, (105-116), (2018).
  • Subgroup identification in dose‐finding trials via model‐based recursive partitioning, Statistics in Medicine, 10.1002/sim.7594, 37, 10, (1608-1624), (2018).
  • A recursive partitioning approach for subgroup identification in individual patient data meta‐analysis, Statistics in Medicine, 10.1002/sim.7609, 37, 9, (1550-1561), (2018).
  • Subgroup analysis with semiparametric models toward precision medicine, Statistics in Medicine, 10.1002/sim.7638, 37, 11, (1830-1845), (2018).
  • Random forests of interaction trees for estimating individualized treatment effects in randomized trials, Statistics in Medicine, 10.1002/sim.7660, 37, 17, (2547-2560), (2018).
  • A Bayesian Approach to Intervention-Based Clustering, ACM Transactions on Intelligent Systems and Technology, 10.1145/3156683, 9, 4, (1-23), (2018).
  • Methylation differences reveal heterogeneity in preterm pathophysiology: results from bipartite network analyses, Journal of Perinatal Medicine, 10.1515/jpm-2017-0126, 46, 5, (509-521), (2018).
  • Towards Team-Centered Informatics: Accelerating Innovation in Multidisciplinary Scientific Teams Through Visual Analytics, The Journal of Applied Behavioral Science, 10.1177/0021886318794606, (002188631879460), (2018).
  • Subgroup identification by recursive segmentation, Journal of Applied Statistics, 10.1080/02664763.2018.1444152, (1-24), (2018).
  • Hierarchical Bayes approach for subgroup analysis, Statistical Methods in Medical Research, 10.1177/0962280217721782, 28, 1, (275-288), (2017).
  • Optimal treatment assignment to maximize expected outcome with multiple treatments, Biometrics, 10.1111/biom.12811, 74, 2, (506-516), (2017).
  • Learning Optimal Personalized Treatment Rules in Consideration of Benefit and Risk: With an Application to Treating Type 2 Diabetes Patients With Insulin Therapies, Journal of the American Statistical Association, 10.1080/01621459.2017.1303386, 113, 521, (1-13), (2017).
  • Estimates of subgroup treatment effects in overall nonsignificant trials: To what extent should we believe in them?, Pharmaceutical Statistics, 10.1002/pst.1810, 16, 4, (280-295), (2017).
  • Enhancement of the adaptive signature design for learning and confirming in a single pivotal trial, Pharmaceutical Statistics, 10.1002/pst.1811, 16, 5, (312-321), (2017).
  • Patient subgroup identification for clinical drug development, Statistics in Medicine, 10.1002/sim.7236, 36, 9, (1414-1428), (2017).
  • Subgroup finding via Bayesian additive regression trees, Statistics in Medicine, 10.1002/sim.7276, 36, 15, (2391-2403), (2017).
  • Multiplicity considerations in subgroup analysis, Statistics in Medicine, 10.1002/sim.7416, 36, 28, (4446-4454), (2017).
  • Subgroup detection and sample size calculation with proportional hazards regression for survival data, Statistics in Medicine, 10.1002/sim.7441, 36, 29, (4646-4659), (2017).
  • Bayesian population finding with biomarkers in a randomized clinical trial, Biometrics, 10.1111/biom.12677, 73, 4, (1355-1365), (2017).
  • Sparse Estimation and Uncertainty with Application to Subgroup Analysis, Political Analysis, 10.1017/pan.2016.14, 25, 1, (1-40), (2017).
  • Evaluating the impact of treating the optimal subgroup, Statistical Methods in Medical Research, 10.1177/0962280217708664, 26, 4, (1630-1640), (2017).
  • Comparing Four Methods for Estimating Tree-Based Treatment Regimes, The International Journal of Biostatistics, 10.1515/ijb-2016-0068, 13, 1, (2017).
  • Differential Treatment Response to Insulin Intensification Therapy: A Post Hoc Analysis of a Randomized Trial Comparing Premixed and Basal-Bolus Insulin Regimens, Diabetes Therapy, 10.1007/s13300-017-0286-z, 8, 4, (915-928), (2017).
  • A comparative study of subgroup identification methods for differential treatment effect: Performance metrics and recommendations, Statistical Methods in Medical Research, 10.1177/0962280217710570, (096228021771057), (2017).
  • Identification of predicted individual treatment effects in randomized clinical trials, Statistical Methods in Medical Research, 10.1177/0962280215623981, 27, 1, (142-157), (2016).
  • Model averaging for treatment effect estimation in subgroups, Pharmaceutical Statistics, 10.1002/pst.1796, 16, 2, (133-142), (2016).
  • Tutorial in biostatistics: data‐driven subgroup identification and analysis in clinical trials, Statistics in Medicine, 10.1002/sim.7064, 36, 1, (136-196), (2016).
  • A subgroup cluster‐based Bayesian adaptive design for precision medicine, Biometrics, 10.1111/biom.12613, 73, 2, (367-377), (2016).
  • Greedy outcome weighted tree learning of optimal personalized treatment rules, Biometrics, 10.1111/biom.12593, 73, 2, (391-400), (2016).
  • Ensemble survival trees for identifying subpopulations in personalized medicine, Biometrical Journal, 10.1002/bimj.201500075, 58, 5, (1151-1163), (2016).
  • Exploratory subgroup analysis in clinical trials by model selection, Biometrical Journal, 10.1002/bimj.201500147, 58, 5, (1217-1228), (2016).
  • Identification of subpopulations with distinct treatment benefit rate using the Bayesian tree, Biometrical Journal, 10.1002/bimj.201500180, 58, 6, (1357-1375), (2016).
  • Estimating optimal treatment regimes via subgroup identification in randomized control trials and observational studies, Statistics in Medicine, 10.1002/sim.6920, 35, 19, (3285-3302), (2016).
  • Personalized Effective Dose Selection in Dose Ranging Studies, Statistical Applications from Clinical Trials and Personalized Medicine to Finance and Business Analytics, 10.1007/978-3-319-42568-9_8, (91-104), (2016).
  • Individualizing drug dosage with longitudinal data, Statistics in Medicine, 10.1002/sim.7016, 35, 24, (4474-4488), (2016).
  • Identification of subgroups with differential treatment effects for longitudinal and multiresponse variables, Statistics in Medicine, 10.1002/sim.7020, 35, 26, (4837-4855), (2016).
  • A Bayesian credible subgroups approach to identifying patient subgroups with positive treatment effects, Biometrics, 10.1111/biom.12522, 72, 4, (1026-1036), (2016).
  • Subgroup analyses in confirmatory clinical trials: time to be specific about their purposes, BMC Medical Research Methodology, 10.1186/s12874-016-0122-6, 16, 1, (2016).
  • Model-Based Recursive Partitioning for Subgroup Analyses, The International Journal of Biostatistics, 10.1515/ijb-2015-0032, 12, 1, (45-63), (2016).
  • Identifying back pain subgroups: developing and applying approaches using individual patient data collected within clinical trials, Programme Grants for Applied Research, 10.3310/pgfar04100, 4, 10, (1-278), (2016).
  • Methods for identification and confirmation of targeted subgroups in clinical trials: A systematic review, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2015.1092034, 26, 1, (99-119), (2015).
  • A regression tree approach to identifying subgroups with differential treatment effects, Statistics in Medicine, 10.1002/sim.6454, 34, 11, (1818-1833), (2015).
  • Identification of drug‐induced toxicity biomarkers for treatment determination, Pharmaceutical Statistics, 10.1002/pst.1684, 14, 4, (284-293), (2015).
  • Regularized outcome weighted subgroup identification for differential treatment effects, Biometrics, 10.1111/biom.12322, 71, 3, (645-653), (2015).
  • Tree-based methods for individualized treatment regimes, Biometrika, 10.1093/biomet/asv028, 102, 3, (501-514), (2015).
  • Subgroup identification for treatment selection in biomarker adaptive design, BMC Medical Research Methodology, 10.1186/s12874-015-0098-7, 15, 1, (2015).
  • Comment, Statistics in Biopharmaceutical Research, 10.1080/19466315.2015.1095795, 7, 4, (305-308), (2015).
  • A PRIM approach to predictive‐signature development for patient stratification, Statistics in Medicine, 10.1002/sim.6343, 34, 2, (317-342), (2014).
  • Fair treatment comparisons in observational research, Statistical Analysis and Data Mining: The ASA Data Science Journal, 10.1002/sam.11235, 7, 5, (376-384), (2014).
  • Rejoinder, International Statistical Review, 10.1111/insr.12057, 82, 3, (367-370), (2014).
  • Binary classification with pFDR‐pFNR losses, Biometrical Journal, 10.1002/bimj.201200054, 55, 3, (463-477), (2014).
  • Qualitative interaction trees: a tool to identify qualitative treatment–subgroup interactions, Statistics in Medicine, 10.1002/sim.5933, 33, 2, (219-237), (2013).
  • Local control for identifying subgroups of interest in observational research: persistence of treatment for major depressive disorder, International Journal of Methods in Psychiatric Research, 10.1002/mpr.1390, 22, 3, (185-194), (2013).
  • Treatment Benefit and Treatment Harm Rate to Characterize Heterogeneity in Treatment Effect, Biometrics, 10.1111/biom.12038, 69, 3, (724-731), (2013).
  • A nonparametric method for value function guided subgroup identification via gradient tree boosting for censored survival data, Statistics in Medicine, 10.1002/sim.8714, 0, 0, (undefined).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.