Subgroup identification based on differential effect search—A recursive partitioning method for establishing response to treatment in patient subpopulations
Abstract
We propose a novel recursive partitioning method for identifying subgroups of subjects with enhanced treatment effects based on a differential effect search algorithm. The idea is to build a collection of subgroups by recursively partitioning a database into two subgroups at each parent group, such that the treatment effect within one of the two subgroups is maximized compared with the other subgroup. The process of data splitting continues until a predefined stopping condition has been satisfied. The method is similar to ‘interaction tree’ approaches that allow incorporation of a treatment‐by‐split interaction in the splitting criterion. However, unlike other tree‐based methods, this method searches only within specific regions of the covariate space and generates multiple subgroups of potential interest. We develop this method and provide guidance on key topics of interest that include generating multiple promising subgroups using different splitting criteria, choosing optimal values of complexity parameters via cross‐validation, and addressing Type I error rate inflation inherent in data mining applications using a resampling‐based method. We evaluate the operating characteristics of the procedure using a simulation study and illustrate the method with a clinical trial example. Copyright © 2011 John Wiley & Sons, Ltd.
Citing Literature
Number of times cited according to CrossRef: 90
- Eun Jeong Oh, Min Qian, Ken Cheung, David C. Mohr, Building Health Application Recommender System Using Partially Penalized Regression, Statistical Modeling in Biomedical Research, 10.1007/978-3-030-33416-1_6, (105-123), (2020).
- Xin Huang, Yihua Gu, Yan Sun, Ivan S. F. Chan, Exploratory Subgroup Identification for Biopharmaceutical Development, Design and Analysis of Subgroups with Biopharmaceutical Applications, 10.1007/978-3-030-40105-4_12, (245-270), (2020).
- Olga V. Marchenko, Lisa M. LaVange, Natallia V. Katenka, Biostatistics in Clinical Trials, Quantitative Methods in Pharmaceutical Research and Development, 10.1007/978-3-030-48555-9, (1-70), (2020).
- Ao Yuan, Yizhao Zhou, Ming T. Tan, Subgroup analysis with a nonparametric unimodal symmetric error distribution, Communications in Statistics - Theory and Methods, 10.1080/03610926.2019.1710754, (1-22), (2020).
- Satoshi Morita, Peter Müller, Hiroyasu Abe, A semiparametric Bayesian approach to population finding with time‐to‐event and toxicity data in a randomized clinical trial, Biometrics, 10.1111/biom.13289, 0, 0, (2020).
- Juan Shen, Annie Qu, Subgroup analysis based on structured mixed-effects models for longitudinal data, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2020.1730867, (1-16), (2020).
- Xinzhou Guo, Xuming He, Inference on Selected Subgroups in Clinical Trials, Journal of the American Statistical Association, 10.1080/01621459.2020.1740096, (1-18), (2020).
- Yishu Wei, Lei Liu, Xiaogang Su, Lihui Zhao, Hongmei Jiang, Precision medicine: Subgroup identification in longitudinal trajectories, Statistical Methods in Medical Research, 10.1177/0962280220904114, (096228022090411), (2020).
- Marius Thomas, Björn Bornkamp, Martin Posch, Franz König, A multiple comparison procedure for dose‐finding trials with subpopulations, Biometrical Journal, 10.1002/bimj.201800111, 62, 1, (53-68), (2019).
- Julia Krzykalla, Axel Benner, Annette Kopp‐Schneider, Exploratory identification of predictive biomarkers in randomized trials with normal endpoints, Statistics in Medicine, 10.1002/sim.8452, 39, 7, (923-939), (2019).
- Si‐yu Mi, Liang‐shuang Sun, James Runt, Mu‐chen Kuo, Kuo‐shien Huang, Jen‐taut Yeh, Sodium Hexametaphosphate‐Modified Thermoplastic Starch Materials Prepared with the Assistance of Supercritical CO2, Starch - Stärke, 10.1002/star.201900055, 72, 1-2, (2019).
- Yang‐Jin Kim, Joint model for recurrent event data with a cured fraction and a terminal event, Biometrical Journal, 10.1002/bimj.201800321, 62, 1, (24-33), (2019).
- Ri‐Hui Lin, Yan‐Ye Fan, Tao Liu, Hui Yang, Li‐Juan Ma, Xia‐Jie Huang, Yue Liu, Structural Characterization of Controlled Decrystallization of Cassava Starch, Starch - Stärke, 10.1002/star.201900049, 72, 1-2, (2019).
- Liyan Zhao, Qifang Zheng, Yalu Zou, Yuanyuan Wang, Yuntang Wu, Xiaofei Liu, Chitooligosaccharide Biguanidine Alleviates Liver Injury and Insulin Resistance in Type 2 Diabetic Rats, Starch - Stärke, 10.1002/star.201900203, 72, 1-2, (2019).
- Bhavtosh A. Kikani, Susen Kourien, Upasna Rathod, Stability and Thermodynamic Attributes of Starch Hydrolyzing α‐Amylase of Anoxybacillus rupiensis TS‐4, Starch - Stärke, 10.1002/star.201900105, 72, 1-2, (2019).
- Jie Liu, Rui Lai, Xiangli Wang, Haiyang Wang, Yawei Liu, Preparation and Characterization of Composites of Hydroxypropyl Tapioca Starch and Zein, Starch - Stärke, 10.1002/star.201900204, 72, 1-2, (2019).
- Edwin R. Heuvel, Lauren E. Griffith, Nazmul Sohel, Isabel Fortier, Graciela Muniz‐Terrera, Parminder Raina, Latent variable models for harmonization of test scores: A case study on memory, Biometrical Journal, 10.1002/bimj.201800146, 62, 1, (34-52), (2019).
- Sonia Calliope, Jorge Wagner, Norma Samman, Physicochemical and Functional Characterization of Potato Starch (Solanum Tuberosum ssp. Andigenum) from the Quebrada De Humahuaca, Argentina, Starch - Stärke, 10.1002/star.201900069, 72, 1-2, (2019).
- Yongxin Bai, Manling Qian, Maozai Tian, Joint mean–covariance random effect model for longitudinal data, Biometrical Journal, 10.1002/bimj.201800311, 62, 1, (7-23), (2019).
- Tianyi Ding, Lina Kan, Yanwen Wu, Yun Bai, Jie Ouyang, Influence of Storage Period on the Physicochemical Properties and In Vitro Digestibility of Starch in Packaged Cooked Chestnut Kernel, Starch - Stärke, 10.1002/star.201900080, 72, 1-2, (2019).
- Die Dong, Zhengliang Qi, Bo Cui, Complex Formation between Soy Proteins and Potato Starch: Effect of pH, Biopolymer Ratio, and Biopolymer Concentration, Starch - Stärke, 10.1002/star.201900020, 72, 1-2, (2019).
- Dianini Hüttner Kringel, Julia Baranzelli, Jéssie Da Natividade Schöffer, Shanise Lisie Mello El Halal, Martha Zavariz De Miranda, Alvaro Renato Guerra Dias, Elessandra Da Rosa Zavareze, Germinated Wheat Starch as a Substrate to Produce Cyclodextrins: Application in Inclusion Complex to Improve the Thermal Stability of Orange Essential Oil, Starch - Stärke, 10.1002/star.201900083, 72, 1-2, (2019).
- Timm Intemann, Kirsten Mehlig, Stefaan De Henauw, Alfonso Siani, Tassos Constantinou, Luis A. Moreno, Dénes Molnár, Toomas Veidebaum, Iris Pigeot, SIMEX for correction of dietary exposure effects with Box‐Cox transformed data, Biometrical Journal, 10.1002/bimj.201900066, 62, 1, (221-237), (2019).
- Cynthia Huber, Norbert Benda, Tim Friede, A comparison of subgroup identification methods in clinical drug development: Simulation study and regulatory considerations, Pharmaceutical Statistics, 10.1002/pst.1951, 18, 5, (600-626), (2019).
- Xin Qiu, Yuanjia Wang, Composite interaction tree for simultaneous learning of optimal individualized treatment rules and subgroups, Statistics in Medicine, 10.1002/sim.8105, 38, 14, (2632-2651), (2019).
- Jingli Wang, Jialiang Li, Yaguang Li, Weng Kee Wong, A model‐based multithreshold method for subgroup identification, Statistics in Medicine, 10.1002/sim.8136, 38, 14, (2605-2631), (2019).
- Jialiang Li, Mu Yue, Wenyang Zhang, Subgroup identification via homogeneity pursuit for dense longitudinal/spatial data, Statistics in Medicine, 10.1002/sim.8192, 38, 17, (3256-3271), (2019).
- Oleg Sysoev, Krzysztof Bartoszek, Eva‐Charlotte Ekström, Katarina Ekholm Selling, PSICA: Decision trees for probabilistic subgroup identification with categorical treatments, Statistics in Medicine, 10.1002/sim.8308, 38, 22, (4436-4452), (2019).
- Shonosuke Sugasawa, Hisashi Noma, Estimating individual treatment effects by gradient boosting trees, Statistics in Medicine, 10.1002/sim.8357, 38, 26, (5146-5159), (2019).
- Wei‐Yin Loh, Luxi Cao, Peigen Zhou, Subgroup identification for precision medicine: A comparative review of 13 methods, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10.1002/widm.1326, 9, 5, (2019).
- James Y. Dai, Michael LeBlanc, Case‐only trees and random forests for exploring genotype‐specific treatment effects in randomized clinical trials with dichotomous end points, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/rssc.12366, 68, 5, (1371-1391), (2019).
- Gerd Rosenkranz, Bibliography, Exploratory Subgroup Analyses in Clinical Research, 10.1002/9781119536734, (197-215), (2019).
- Yanxun Xu, Florica Constantine, Yuan Yuan, Yili L. Pritchett, ASIED: a Bayesian adaptive subgroup-identification enrichment design, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2019.1696356, (1-16), (2019).
- Aaron Dane, Amy Spencer, Gerd Rosenkranz, Ilya Lipkovich, Tom Parke, Subgroup analysis and interpretation for phase 3 confirmatory trials: White paper of the EFSPI/PSI working group on subgroup analysis, Pharmaceutical Statistics, 10.1002/pst.1919, 18, 2, (126-139), (2018).
- Wei‐Yin Loh, Michael Man, Shuaicheng Wang, Subgroups from regression trees with adjustment for prognostic effects and postselection inference, Statistics in Medicine, 10.1002/sim.7677, 38, 4, (545-557), (2018).
- Yu‐Chuan Chen, Un Jung Lee, Chen‐An Tsai, James J. Chen, Development of predictive signatures for treatment selection in precision medicine with survival outcomes, Pharmaceutical Statistics, 10.1002/pst.1842, 17, 2, (105-116), (2018).
- Marius Thomas, Björn Bornkamp, Heidi Seibold, Subgroup identification in dose‐finding trials via model‐based recursive partitioning, Statistics in Medicine, 10.1002/sim.7594, 37, 10, (1608-1624), (2018).
- Dipesh Mistry, Nigel Stallard, Martin Underwood, A recursive partitioning approach for subgroup identification in individual patient data meta‐analysis, Statistics in Medicine, 10.1002/sim.7609, 37, 9, (1550-1561), (2018).
- Ao Yuan, Xiaofei Chen, Yizhao Zhou, Ming T. Tan, Subgroup analysis with semiparametric models toward precision medicine, Statistics in Medicine, 10.1002/sim.7638, 37, 11, (1830-1845), (2018).
- Xiaogang Su, Annette T. Peña, Lei Liu, Richard A. Levine, Random forests of interaction trees for estimating individualized treatment effects in randomized trials, Statistics in Medicine, 10.1002/sim.7660, 37, 17, (2547-2560), (2018).
- Igor Kulev, Pearl Pu, Boi Faltings, A Bayesian Approach to Intervention-Based Clustering, ACM Transactions on Intelligent Systems and Technology, 10.1145/3156683, 9, 4, (1-23), (2018).
- Suresh K. Bhavnani, Bryant Dang, Varun Kilaru, Maria Caro, Shyam Visweswaran, George Saade, Alicia K. Smith, Ramkumar Menon, Methylation differences reveal heterogeneity in preterm pathophysiology: results from bipartite network analyses, Journal of Perinatal Medicine, 10.1515/jpm-2017-0126, 46, 5, (509-521), (2018).
- Suresh K. Bhavnani, Shyam Visweswaran, Rohit Divekar, Allan R. Brasier, Towards Team-Centered Informatics: Accelerating Innovation in Multidisciplinary Scientific Teams Through Visual Analytics, The Journal of Applied Behavioral Science, 10.1177/0021886318794606, (002188631879460), (2018).
- Alexander Hapfelmeier, Kurt Ulm, Bernhard Haller, Subgroup identification by recursive segmentation, Journal of Applied Statistics, 10.1080/02664763.2018.1444152, (1-24), (2018).
- Yu-Yi Hsu, Jyoti Zalkikar, Ram C Tiwari, Hierarchical Bayes approach for subgroup analysis, Statistical Methods in Medical Research, 10.1177/0962280217721782, 28, 1, (275-288), (2017).
- Zhilan Lou, Jun Shao, Menggang Yu, Optimal treatment assignment to maximize expected outcome with multiple treatments, Biometrics, 10.1111/biom.12811, 74, 2, (506-516), (2017).
- Yuanjia Wang, Haoda Fu, Donglin Zeng, Learning Optimal Personalized Treatment Rules in Consideration of Benefit and Risk: With an Application to Treating Type 2 Diabetes Patients With Insulin Therapies, Journal of the American Statistical Association, 10.1080/01621459.2017.1303386, 113, 521, (1-13), (2017).
- Julien Tanniou, Ingeborg Tweel, Steven Teerenstra, Kit C.B. Roes, Estimates of subgroup treatment effects in overall nonsignificant trials: To what extent should we believe in them?, Pharmaceutical Statistics, 10.1002/pst.1810, 16, 4, (280-295), (2017).
- Gu Mi, Enhancement of the adaptive signature design for learning and confirming in a single pivotal trial, Pharmaceutical Statistics, 10.1002/pst.1811, 16, 5, (312-321), (2017).
- Xin Huang, Yan Sun, Paul Trow, Saptarshi Chatterjee, Arunava Chakravartty, Lu Tian, Viswanath Devanarayan, Patient subgroup identification for clinical drug development, Statistics in Medicine, 10.1002/sim.7236, 36, 9, (1414-1428), (2017).
- Siva. Sivaganesan, Peter Müller, Bin Huang, Subgroup finding via Bayesian additive regression trees, Statistics in Medicine, 10.1002/sim.7276, 36, 15, (2391-2403), (2017).
- Alex Dmitrienko, Brian Millen, Ilya Lipkovich, Multiplicity considerations in subgroup analysis, Statistics in Medicine, 10.1002/sim.7416, 36, 28, (4446-4454), (2017).
- Suhyun Kang, Wenbin Lu, Rui Song, Subgroup detection and sample size calculation with proportional hazards regression for survival data, Statistics in Medicine, 10.1002/sim.7441, 36, 29, (4646-4659), (2017).
- Satoshi Morita, Peter Müller, Bayesian population finding with biomarkers in a randomized clinical trial, Biometrics, 10.1111/biom.12677, 73, 4, (1355-1365), (2017).
- Marc Ratkovic, Dustin Tingley, Sparse Estimation and Uncertainty with Application to Subgroup Analysis, Political Analysis, 10.1017/pan.2016.14, 25, 1, (1-40), (2017).
- Alexander R Luedtke, Mark J van der Laan, Evaluating the impact of treating the optimal subgroup, Statistical Methods in Medical Research, 10.1177/0962280217708664, 26, 4, (1630-1640), (2017).
- Aniek Sies, Iven Van Mechelen, Comparing Four Methods for Estimating Tree-Based Treatment Regimes, The International Journal of Biostatistics, 10.1515/ijb-2016-0068, 13, 1, (2017).
- Li Xin Shi, Peng Fei Li, Jia Ning Hou, Differential Treatment Response to Insulin Intensification Therapy: A Post Hoc Analysis of a Randomized Trial Comparing Premixed and Basal-Bolus Insulin Regimens, Diabetes Therapy, 10.1007/s13300-017-0286-z, 8, 4, (915-928), (2017).
- Demissie Alemayehu, Yang Chen, Marianthi Markatou, A comparative study of subgroup identification methods for differential treatment effect: Performance metrics and recommendations, Statistical Methods in Medical Research, 10.1177/0962280217710570, (096228021771057), (2017).
- Andrea Lamont, Michael D Lyons, Thomas Jaki, Elizabeth Stuart, Daniel J Feaster, Kukatharmini Tharmaratnam, Daniel Oberski, Hemant Ishwaran, Dawn K Wilson, M Lee Van Horn, Identification of predicted individual treatment effects in randomized clinical trials, Statistical Methods in Medical Research, 10.1177/0962280215623981, 27, 1, (142-157), (2016).
- Björn Bornkamp, David Ohlssen, Baldur P. Magnusson, Heinz Schmidli, Model averaging for treatment effect estimation in subgroups, Pharmaceutical Statistics, 10.1002/pst.1796, 16, 2, (133-142), (2016).
- Ilya Lipkovich, Alex Dmitrienko, Ralph B., Tutorial in biostatistics: data‐driven subgroup identification and analysis in clinical trials, Statistics in Medicine, 10.1002/sim.7064, 36, 1, (136-196), (2016).
- Wentian Guo, Yuan Ji, Daniel V. T. Catenacci, A subgroup cluster‐based Bayesian adaptive design for precision medicine, Biometrics, 10.1111/biom.12613, 73, 2, (367-377), (2016).
- Ruoqing Zhu, Ying‐Qi Zhao, Guanhua Chen, Shuangge Ma, Hongyu Zhao, Greedy outcome weighted tree learning of optimal personalized treatment rules, Biometrics, 10.1111/biom.12593, 73, 2, (391-400), (2016).
- Yu‐Chuan Chen, James J. Chen, Ensemble survival trees for identifying subpopulations in personalized medicine, Biometrical Journal, 10.1002/bimj.201500075, 58, 5, (1151-1163), (2016).
- Gerd K. Rosenkranz, Exploratory subgroup analysis in clinical trials by model selection, Biometrical Journal, 10.1002/bimj.201500147, 58, 5, (1217-1228), (2016).
- Changyu Shen, Yang Hu, Xiaochun Li, Yadong Wang, Peng‐Sheng Chen, Alfred E. Buxton, Identification of subpopulations with distinct treatment benefit rate using the Bayesian tree, Biometrical Journal, 10.1002/bimj.201500180, 58, 6, (1357-1375), (2016).
- Haoda Fu, Jin Zhou, Douglas E. Faries, Estimating optimal treatment regimes via subgroup identification in randomized control trials and observational studies, Statistics in Medicine, 10.1002/sim.6920, 35, 19, (3285-3302), (2016).
- Xiwen Ma, Wei Zheng, Yuefeng Lu, Personalized Effective Dose Selection in Dose Ranging Studies, Statistical Applications from Clinical Trials and Personalized Medicine to Finance and Business Analytics, 10.1007/978-3-319-42568-9_8, (91-104), (2016).
- Xiaolu Zhu, Annie Qu, Individualizing drug dosage with longitudinal data, Statistics in Medicine, 10.1002/sim.7016, 35, 24, (4474-4488), (2016).
- Wei‐Yin Loh, Haoda Fu, Michael Man, Victoria Champion, Menggang Yu, Identification of subgroups with differential treatment effects for longitudinal and multiresponse variables, Statistics in Medicine, 10.1002/sim.7020, 35, 26, (4837-4855), (2016).
- Patrick M. Schnell, Qi Tang, Walter W. Offen, Bradley P. Carlin, A Bayesian credible subgroups approach to identifying patient subgroups with positive treatment effects, Biometrics, 10.1111/biom.12522, 72, 4, (1026-1036), (2016).
- Julien Tanniou, Ingeborg van der Tweel, Steven Teerenstra, Kit C. B. Roes, Subgroup analyses in confirmatory clinical trials: time to be specific about their purposes, BMC Medical Research Methodology, 10.1186/s12874-016-0122-6, 16, 1, (2016).
- Heidi Seibold, Achim Zeileis, Torsten Hothorn, Model-Based Recursive Partitioning for Subgroup Analyses, The International Journal of Biostatistics, 10.1515/ijb-2015-0032, 12, 1, (45-63), (2016).
- Shilpa Patel, Siew Wan Hee, Dipesh Mistry, Jake Jordan, Sally Brown, Melina Dritsaki, David R Ellard, Tim Friede, Sarah E Lamb, Joanne Lord, Jason Madan, Tom Morris, Nigel Stallard, Colin Tysall, Adrian Willis, Martin Underwood, Identifying back pain subgroups: developing and applying approaches using individual patient data collected within clinical trials, Programme Grants for Applied Research, 10.3310/pgfar04100, 4, 10, (1-278), (2016).
- Thomas Ondra, Alex Dmitrienko, Tim Friede, Alexandra Graf, Frank Miller, Nigel Stallard, Martin Posch, Methods for identification and confirmation of targeted subgroups in clinical trials: A systematic review, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2015.1092034, 26, 1, (99-119), (2015).
- Wei‐Yin Loh, Xu He, Michael Man, A regression tree approach to identifying subgroups with differential treatment effects, Statistics in Medicine, 10.1002/sim.6454, 34, 11, (1818-1833), (2015).
- Tzu‐Pin Lu, James J. Chen, Identification of drug‐induced toxicity biomarkers for treatment determination, Pharmaceutical Statistics, 10.1002/pst.1684, 14, 4, (284-293), (2015).
- Yaoyao Xu, Menggang Yu, Ying‐Qi Zhao, Quefeng Li, Sijian Wang, Jun Shao, Regularized outcome weighted subgroup identification for differential treatment effects, Biometrics, 10.1111/biom.12322, 71, 3, (645-653), (2015).
- E. B. Laber, Y. Q. Zhao, Tree-based methods for individualized treatment regimes, Biometrika, 10.1093/biomet/asv028, 102, 3, (501-514), (2015).
- Tzu-Pin Lu, James J. Chen, Subgroup identification for treatment selection in biomarker adaptive design, BMC Medical Research Methodology, 10.1186/s12874-015-0098-7, 15, 1, (2015).
- Robert Hemmings, Comment, Statistics in Biopharmaceutical Research, 10.1080/19466315.2015.1095795, 7, 4, (305-308), (2015).
- Gong Chen, Hua Zhong, Anton Belousov, Viswanath Devanarayan, A PRIM approach to predictive‐signature development for patient stratification, Statistics in Medicine, 10.1002/sim.6343, 34, 2, (317-342), (2014).
- Kenneth K. Lopiano, Robert L. Obenchain, S. Stanley Young, Fair treatment comparisons in observational research, Statistical Analysis and Data Mining: The ASA Data Science Journal, 10.1002/sam.11235, 7, 5, (376-384), (2014).
- Wei‐Yin Loh, Rejoinder, International Statistical Review, 10.1111/insr.12057, 82, 3, (367-370), (2014).
- Thorsten Dickhaus, Benjamin Blankertz, Frank C. Meinecke, Binary classification with pFDR‐pFNR losses, Biometrical Journal, 10.1002/bimj.201200054, 55, 3, (463-477), (2014).
- Elise Dusseldorp, Iven Van Mechelen, Qualitative interaction trees: a tool to identify qualitative treatment–subgroup interactions, Statistics in Medicine, 10.1002/sim.5933, 33, 2, (219-237), (2013).
- Douglas E. Faries, Yi Chen, Ilya Lipkovich, Anthony Zagar, Xianchen Liu, Robert L. Obenchain, Local control for identifying subgroups of interest in observational research: persistence of treatment for major depressive disorder, International Journal of Methods in Psychiatric Research, 10.1002/mpr.1390, 22, 3, (185-194), (2013).
- Changyu Shen, Jaesik Jeong, Xiaochun Li, Peng‐Sheng Chen, Alfred Buxton, Treatment Benefit and Treatment Harm Rate to Characterize Heterogeneity in Treatment Effect, Biometrics, 10.1111/biom.12038, 69, 3, (724-731), (2013).
- Pingye Zhang, Junshui Ma, Xinqun Chen, Yue Shentu, A nonparametric method for value function guided subgroup identification via gradient tree boosting for censored survival data, Statistics in Medicine, 10.1002/sim.8714, 0, 0, (undefined).




