Survival analysis using auxiliary variables via non‐parametric multiple imputation
Abstract
We develop an approach, based on multiple imputation, that estimates the marginal survival distribution in survival analysis using auxiliary variables to recover information for censored observations. To conduct the imputation, we use two working survival models to define a nearest neighbour imputing risk set. One model is for the event times and the other for the censoring times. Based on the imputing risk set, two non‐parametric multiple imputation methods are considered: risk set imputation, and Kaplan–Meier imputation. For both methods a future event or censoring time is imputed for each censored observation. With a categorical auxiliary variable, we show that with a large number of imputes the estimates from the Kaplan–Meier imputation method correspond to the weighted Kaplan–Meier estimator. We also show that the Kaplan–Meier imputation method is robust to mis‐specification of either one of the two working models. In a simulation study with time independent and time‐dependent auxiliary variables, we compare the multiple imputation approaches with an inverse probability of censoring weighted method. We show that all approaches can reduce bias due to dependent censoring and improve the efficiency. We apply the approaches to AIDS clinical trial data comparing ZDV and placebo, in which CD4 count is the time‐dependent auxiliary variable. Copyright © 2005 John Wiley & Sons, Ltd.
Citing Literature
Number of times cited according to CrossRef: 37
- Ismaïl Ahmed, Philippe Flandre, Weighted Kaplan‐Meier estimators motivating to estimate HIV‐1 RNA reduction censored by a limit of detection, Statistics in Medicine, 10.1002/sim.8455, 39, 7, (968-983), (2020).
- Tian Dai, Ying Guo, Limin Peng, Amita Manatunga, Nonparametric estimation of broad sense agreement between ordinal and censored continuous outcomes, Statistics in Medicine, 10.1002/sim.8523, 39, 14, (1952-1964), (2020).
- Meng Xia, Susan Murray, Nabihah Tayob, Regression analysis of recurrent‐event‐free time from multiple follow‐up windows, Statistics in Medicine, 10.1002/sim.8385, 39, 1, (1-15), (2019).
- Ying Ding, Shengchun Kong, Shan Kang, Wei Chen, A semiparametric imputation approach for regression with censored covariate with application to an AMD progression study, Statistics in Medicine, 10.1002/sim.7816, 37, 23, (3293-3308), (2018).
- Raphaël Porcher, Justine Jacot, Jay S Wunder, David J Biau, Identifying treatment responders using counterfactual modeling and potential outcomes, Statistical Methods in Medical Research, 10.1177/0962280218804569, (096228021880456), (2018).
- Yuan Luo, Peter Szolovits, Anand S Dighe, Jason M Baron, 3D-MICE: integration of cross-sectional and longitudinal imputation for multi-analyte longitudinal clinical data, Journal of the American Medical Informatics Association, 10.1093/jamia/ocx133, 25, 6, (645-653), (2017).
- Alain Vandormael, Adrian Dobra, Till Bärnighausen, Tulio de Oliveira, Frank Tanser, Incidence rate estimation, periodic testing and the limitations of the mid-point imputation approach, International Journal of Epidemiology, 10.1093/ije/dyx134, 47, 1, (236-245), (2017).
- Nabihah Tayob, Susan Murray, Statistical consequences of a successful lung allocation system – recovering information and reducing bias in models for urgency, Statistics in Medicine, 10.1002/sim.7283, 36, 15, (2435-2451), (2017).
- Yi Deng, Changgee Chang, Moges Seyoum Ido, Qi Long, Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data, Scientific Reports, 10.1038/srep21689, 6, 1, (2016).
- Chanelle J. Howe, Stephen R. Cole, Bryan Lau, Sonia Napravnik, Joseph J. Eron, Selection Bias Due to Loss to Follow Up in Cohort Studies, Epidemiology, 10.1097/EDE.0000000000000409, 27, 1, (91-97), (2016).
- Kemmawadee Preedalikit, Ivy Liu, Yuichi Hirose, Nokuthaba Sibanda, Daniel Fernández, Joint Modeling of Survival and Longitudinal Ordered Data Using a Semiparametric Approach, Australian & New Zealand Journal of Statistics, 10.1111/anzs.12153, 58, 2, (153-172), (2016).
- Bo Hu, Liang Li, Tom Greene, Joint multiple imputation for longitudinal outcomes and clinical events that truncate longitudinal follow‐up, Statistics in Medicine, 10.1002/sim.6590, 35, 17, (2991-3006), (2015).
- Chiu‐Hsieh Hsu, Jeremy M. G. Taylor, Chengcheng Hu, Analysis of accelerated failure time data with dependent censoring using auxiliary variables via nonparametric multiple imputation, Statistics in Medicine, 10.1002/sim.6534, 34, 19, (2768-2780), (2015).
- Chanelle J. Howe, Lauren E. Cain, Joseph W. Hogan, Are All Biases Missing Data Problems?, Current Epidemiology Reports, 10.1007/s40471-015-0050-8, 2, 3, (162-171), (2015).
- Ritesh Ramchandani, Dianne M. Finkelstein, David A. Schoenfeld, A model‐informed rank test for right‐censored data with intermediate states, Statistics in Medicine, 10.1002/sim.6417, 34, 9, (1454-1466), (2015).
- Tim P Morris, Ian R White, Patrick Royston, Tuning multiple imputation by predictive mean matching and local residual draws, BMC Medical Research Methodology, 10.1186/1471-2288-14-75, 14, 1, (2014).
- Fang Xiang, Susan Murray, Xiaohong Liu, Analysis of transplant urgency and benefit via multiple imputation, Statistics in Medicine, 10.1002/sim.6250, 33, 26, (4655-4670), (2014).
- Richard E Kennedy, Kofi P Adragni, Hemant K Tiwari, Jenifer H Voeks, Thomas G Brott, George Howard, Risk-stratified imputation in survival analysis, Clinical Trials: Journal of the Society for Clinical Trials, 10.1177/1740774513493150, 10, 4, (530-539), (2013).
- Nigel Melville, Michael McQuaid, Research Note —Generating Shareable Statistical Databases for Business Value: Multiple Imputation with Multimodal Perturbation , Information Systems Research, 10.1287/isre.1110.0361, 23, 2, (559-574), (2012).
- Stef Buuren, References, Flexible Imputation of Missing Data, 10.1201/b11826-16, (2012).
- James R. Carpenter, Michael G. Kenward, Bibliography, Multiple Imputation and its Application, 10.1002/9781119942283, (316-326), (2012).
- Anna SC Conlon, Jeremy MG Taylor, Daniel J Sargent, Greg Yothers, Using cure models and multiple imputation to utilize recurrence as an auxiliary variable for overall survival, Clinical Trials: Journal of the Society for Clinical Trials, 10.1177/1740774511414741, 8, 5, (581-590), (2011).
- Lyrica Xiaohong Liu, Susan Murray, Alex Tsodikov, Multiple imputation based on restricted mean model for censored data, Statistics in Medicine, 10.1002/sim.4163, 30, 12, (1339-1350), (2011).
- Wenqin Pan, Donglin Zeng, Estimating Mean Cost Using Auxiliary Covariates, Biometrics, 10.1111/j.1541-0420.2010.01540.x, 67, 3, (996-1006), (2011).
- Lori E. Dodd, Edward L. Korn, Boris Freidlin, Robert Gray, Suman Bhattacharya, An Audit Strategy for Progression‐Free Survival, Biometrics, 10.1111/j.1541-0420.2010.01539.x, 67, 3, (1092-1099), (2011).
- Chanelle J. Howe, Stephen R. Cole, Joan S. Chmiel, Alvaro Muñoz, Limitation of Inverse Probability-of-Censoring Weights in Estimating Survival in the Presence of Strong Selection Bias, American Journal of Epidemiology, 10.1093/aje/kwq385, 173, 5, (569-577), (2011).
- Wen Ye, Jeremy M.G. Taylor, Xihong Lin, The authors replied as follows:, Biometrics, 10.1111/j.1541-0420.2009.01324_2.x, 66, 3, (987-991), (2010).
- Noémie Soullier, Elise de La Rochebrochard, Jean Bouyer, Multiple imputation for estimation of an occurrence rate in cohorts with attrition and discrete follow-up time points: a simulation study, BMC Medical Research Methodology, 10.1186/1471-2288-10-79, 10, 1, (2010).
- Chiu‐Hsieh Hsu, Jeremy M. G. Taylor, A robust weighted Kaplan–Meier approach for data with dependent censoring using linear combinations of prognostic covariates, Statistics in Medicine, 10.1002/sim.3969, 29, 21, (2215-2223), (2010).
- Y. Li, J. M. G. Taylor, Predicting treatment effects using biomarker data in a meta‐analysis of clinical trials, Statistics in Medicine, 10.1002/sim.3931, 29, 18, (1875-1889), (2010).
- Wen Ye, Jeremy M.G. Taylor, Xihong Lin, The authors replied as follows: , Biometrics, 10.1111/j.1541-0420.2009.01325.x, (2009).
- Chiu-Hsieh Hsu, Qi Long, David S Alberts, Estimation of colorectal adenoma recurrence with dependent censoring, BMC Medical Research Methodology, 10.1186/1471-2288-9-66, 9, 1, (2009).
- Chiu‐Hsieh Hsu, Jeremy M. G. Taylor, Nonparametric comparison of two survival functions with dependent censoring via nonparametric multiple imputation, Statistics in Medicine, 10.1002/sim.3480, 28, 3, (462-475), (2008).
- Ping K. Ruan, Robert J. Gray, Analyses of cumulative incidence functions via non‐parametric multiple imputation, Statistics in Medicine, 10.1002/sim.3402, 27, 27, (5709-5724), (2008).
- P. Royston, M. K. B. Parmar, D. G. Altman, Visualizing Length of Survival in Time-to-Event Studies: A Complement to Kaplan Meier Plots, JNCI Journal of the National Cancer Institute, 10.1093/jnci/djm265, 100, 2, (92-97), (2008).
- Filia Vonta, Alex Karagrigoriou, Variable selection strategies in survival models with multiple imputations, Lifetime Data Analysis, 10.1007/s10985-007-9050-4, 13, 3, (295-315), (2007).
- Chiu‐Hsieh Hsu, Jeremy M. G. Taylor, Susan Murray, Daniel Commenges, Multiple imputation for interval censored data with auxiliary variables, Statistics in Medicine, 10.1002/sim.2581, 26, 4, (769-781), (2006).




