Nonparametric survival analysis using Bayesian Additive Regression Trees (BART)
Abstract
Bayesian additive regression trees (BART) provide a framework for flexible nonparametric modeling of relationships of covariates to outcomes. Recently, BART models have been shown to provide excellent predictive performance, for both continuous and binary outcomes, and exceeding that of its competitors. Software is also readily available for such outcomes. In this article, we introduce modeling that extends the usefulness of BART in medical applications by addressing needs arising in survival analysis. Simulation studies of one‐sample and two‐sample scenarios, in comparison with long‐standing traditional methods, establish face validity of the new approach. We then demonstrate the model's ability to accommodate data from complex regression models with a simulation study of a nonproportional hazards scenario with crossing survival functions and survival function estimation in a scenario where hazards are multiplicatively modified by a highly nonlinear function of the covariates. Using data from a recently published study of patients undergoing hematopoietic stem cell transplantation, we illustrate the use and some advantages of the proposed method in medical investigations. Copyright © 2016 John Wiley & Sons, Ltd.
Citing Literature
Number of times cited according to CrossRef: 28
- Alexis Bellot, Mihaela Van Der Schaar, Flexible Modelling of Longitudinal Medical Data, ACM Transactions on Computing for Healthcare, 10.1145/3377164, 1, 1, (1-15), (2020).
- David Watson, The Rhetoric and Reality of Anthropomorphism in Artificial Intelligence, The 2019 Yearbook of the Digital Ethics Lab, 10.1007/978-3-030-29145-7_4, (45-65), (2020).
- Jennifer Hill, Antonio Linero, Jared Murray, Bayesian Additive Regression Trees: A Review and Look Forward, Annual Review of Statistics and Its Application, 10.1146/annurev-statistics-031219-041110, 7, 1, (251-278), (2020).
- Yanxun Xu, Daniel Scharfstein, Peter Müller, Michael Daniels, A Bayesian nonparametric approach for evaluating the causal effect of treatment in randomized trials with semi-competing risks, Biostatistics, 10.1093/biostatistics/kxaa008, (2020).
- Guangzhi Ma, Xuejing Zhao, Regression of survival data via twin support vector regression, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2020.1757710, (1-13), (2020).
- Satoshi Morita, Peter Müller, Hiroyasu Abe, A semiparametric Bayesian approach to population finding with time‐to‐event and toxicity data in a randomized clinical trial, Biometrics, 10.1111/biom.13289, 0, 0, (2020).
- Yushu Shi, Purushottam Laud, Joan Neuner, A dependent Dirichlet process model for survival data with competing risks, Lifetime Data Analysis, 10.1007/s10985-020-09506-0, (2020).
- Antonio R. Linero, Debajyoti Sinha, Stuart R. Lipsitz, Semiparametric mixed‐scale models using shared Bayesian forests, Biometrics, 10.1111/biom.13107, 76, 1, (131-144), (2019).
- Shannon Wongvibulsin, Katherine C. Wu, Scott L. Zeger, Clinical risk prediction with random forests for survival, longitudinal, and multivariate (RF-SLAM) data analysis, BMC Medical Research Methodology, 10.1186/s12874-019-0863-0, 20, 1, (2019).
- Rodney Sparapani, Noura M. Dabbouseh, David Gutterman, Jun Zhang, Haiying Chen, David A. Bluemke, Joao A. C. Lima, Gregory L. Burke, Elsayed Z. Soliman, Detection of Left Ventricular Hypertrophy Using Bayesian Additive Regression Trees: The MESA, Journal of the American Heart Association, 10.1161/JAHA.118.009959, 8, 5, (2019).
- Yaoyuan Vincent Tan, Jason Roy, Bayesian additive regression trees and the General BART model, Statistics in Medicine, 10.1002/sim.8347, 38, 25, (5048-5069), (2019).
- Kazeem Adesina Dauda, Biswabrata Pradhan, B. Uma Shankar, Sushmita Mitra, Decision tree for modeling survival data with competing risks, Biocybernetics and Biomedical Engineering, 10.1016/j.bbe.2019.05.001, 39, 3, (697-708), (2019).
- Daniele Bottigliengo, Paola Berchialla, Corrado Lanera, Danila Azzolina, Giulia Lorenzoni, Matteo Martinato, Daniela Giachino, Ileana Baldi, Dario Gregori, The Role of Genetic Factors in Characterizing Extra-Intestinal Manifestations in Crohn’s Disease Patients: Are Bayesian Machine Learning Methods Improving Outcome Predictions?, Journal of Clinical Medicine, 10.3390/jcm8060865, 8, 6, (865), (2019).
- Rodney Sparapani, Brent R Logan, Robert E McCulloch, Purushottam W Laud, Nonparametric competing risks analysis using Bayesian Additive Regression Trees, Statistical Methods in Medical Research, 10.1177/0962280218822140, (096228021882214), (2019).
- David Watson, The Rhetoric and Reality of Anthropomorphism in Artificial Intelligence, Minds and Machines, 10.1007/s11023-019-09506-6, (2019).
- Yanxun Xu, Peter F. Thall, William Hua, Borje S. Andersson, Bayesian non‐parametric survival regression for optimizing precision dosing of intravenous busulfan in allogeneic stem cell transplantation, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/rssc.12331, 68, 3, (809-828), (2018).
- Yanxun Xu, Peter Müller, Apostolia M. Tsimberidou, Donald Berry, A nonparametric Bayesian basket trial design, Biometrical Journal, 10.1002/bimj.201700162, 61, 5, (1160-1174), (2018).
- Antonio R. Linero, Yun Yang, Bayesian regression tree ensembles that adapt to smoothness and sparsity, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12293, 80, 5, (1087-1110), (2018).
- Rashelle J. Musci, Hadi Kharrazi, Renee F. Wilson, Ryoko Susukida, Fardad Gharghabi, Allen Zhang, Lawrence Wissow, Karen A. Robinson, Holly C. Wilcox, The study of effect moderation in youth suicide-prevention studies, Social Psychiatry and Psychiatric Epidemiology, 10.1007/s00127-018-1574-2, 53, 12, (1303-1310), (2018).
- Rodney A Sparapani, Lisa E Rein, Sergey S Tarima, Tourette A Jackson, John R Meurer, Non-parametric recurrent events analysis with BART and an application to the hospital admissions of patients with diabetes, Biostatistics, 10.1093/biostatistics/kxy032, (2018).
- Nicholas C Henderson, Thomas A Louis, Gary L Rosner, Ravi Varadhan, Individualized treatment effects with censored data via fully nonparametric Bayesian accelerated failure time models, Biostatistics, 10.1093/biostatistics/kxy028, (2018).
- Thomas A. Murray, Ying Yuan, Peter F. Thall, A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes, Journal of the American Statistical Association, 10.1080/01621459.2017.1340887, 113, 523, (1255-1267), (2017).
- Antonio R. Linero, A review of tree-based Bayesian methods, Communications for Statistical Applications and Methods, 10.29220/CSAM.2017.24.6.543, 24, 6, (543-559), (2017).
- Brent R Logan, Rodney Sparapani, Robert E McCulloch, Purushottam W Laud, Decision making and uncertainty quantification for individualized treatments using Bayesian Additive Regression Trees, Statistical Methods in Medical Research, 10.1177/0962280217746191, (096228021774619), (2017).
- Munish Bhatia, Sandeep K. Sood, Internet of Things based activity surveillance of defence personnel, Journal of Ambient Intelligence and Humanized Computing, 10.1007/s12652-017-0507-3, (2017).
- Rodney Sparapani, Robert McCulloch, Bayesian Additive Regression Trees (BART), Wiley StatsRef: Statistics Reference Online, 10.1002/9781118445112, (1-9), (2014).
- Matthias Schmid, Moritz Berger, Competing risks analysis for discrete time‐to‐event data, WIREs Computational Statistics , 10.1002/wics.1529, 0, 0, (undefined).
- Arman Oganisian, Jason A. Roy, A practical introduction to Bayesian estimation of causal effects: Parametric and nonparametric approaches, Statistics in Medicine, 10.1002/sim.8761, 0, 0, (undefined).




