Volume 25, Issue 20
Research Article

Survival analysis using auxiliary variables via non‐parametric multiple imputation

Chiu‐Hsieh Hsu

Corresponding Author

E-mail address: phsu@azcc.arizona.edu

Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health and Arizona Cancer Center, University of Arizona, Tucson, AZ 85724‐5024, U.S.A.

Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health and Arizona Cancer Center, University of Arizona, 1515 N Campbell, PO Box 245024, Tucson, AZ 85724‐5024, U.S.A.Search for more papers by this author
Jeremy M. G. Taylor

Department of Biostatistics, School of Public Health, University of Michigan, 1420 Washington Heights, Ann Arbor, MI 48109, U.S.A.

Search for more papers by this author
Susan Murray

Department of Biostatistics, School of Public Health, University of Michigan, 1420 Washington Heights, Ann Arbor, MI 48109, U.S.A.

Search for more papers by this author
Daniel Commenges

INSERM E0338 Biostatistics, ISPED, Bordeaux 2 University, Bordeaux 33000, France

Search for more papers by this author
First published: 12 December 2005
Citations: 37

Abstract

We develop an approach, based on multiple imputation, that estimates the marginal survival distribution in survival analysis using auxiliary variables to recover information for censored observations. To conduct the imputation, we use two working survival models to define a nearest neighbour imputing risk set. One model is for the event times and the other for the censoring times. Based on the imputing risk set, two non‐parametric multiple imputation methods are considered: risk set imputation, and Kaplan–Meier imputation. For both methods a future event or censoring time is imputed for each censored observation. With a categorical auxiliary variable, we show that with a large number of imputes the estimates from the Kaplan–Meier imputation method correspond to the weighted Kaplan–Meier estimator. We also show that the Kaplan–Meier imputation method is robust to mis‐specification of either one of the two working models. In a simulation study with time independent and time‐dependent auxiliary variables, we compare the multiple imputation approaches with an inverse probability of censoring weighted method. We show that all approaches can reduce bias due to dependent censoring and improve the efficiency. We apply the approaches to AIDS clinical trial data comparing ZDV and placebo, in which CD4 count is the time‐dependent auxiliary variable. Copyright © 2005 John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 37

  • Weighted Kaplan‐Meier estimators motivating to estimate HIV‐1 RNA reduction censored by a limit of detection, Statistics in Medicine, 10.1002/sim.8455, 39, 7, (968-983), (2020).
  • Nonparametric estimation of broad sense agreement between ordinal and censored continuous outcomes, Statistics in Medicine, 10.1002/sim.8523, 39, 14, (1952-1964), (2020).
  • Regression analysis of recurrent‐event‐free time from multiple follow‐up windows, Statistics in Medicine, 10.1002/sim.8385, 39, 1, (1-15), (2019).
  • A semiparametric imputation approach for regression with censored covariate with application to an AMD progression study, Statistics in Medicine, 10.1002/sim.7816, 37, 23, (3293-3308), (2018).
  • Identifying treatment responders using counterfactual modeling and potential outcomes, Statistical Methods in Medical Research, 10.1177/0962280218804569, (096228021880456), (2018).
  • 3D-MICE: integration of cross-sectional and longitudinal imputation for multi-analyte longitudinal clinical data, Journal of the American Medical Informatics Association, 10.1093/jamia/ocx133, 25, 6, (645-653), (2017).
  • Incidence rate estimation, periodic testing and the limitations of the mid-point imputation approach, International Journal of Epidemiology, 10.1093/ije/dyx134, 47, 1, (236-245), (2017).
  • Statistical consequences of a successful lung allocation system – recovering information and reducing bias in models for urgency, Statistics in Medicine, 10.1002/sim.7283, 36, 15, (2435-2451), (2017).
  • Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data, Scientific Reports, 10.1038/srep21689, 6, 1, (2016).
  • Selection Bias Due to Loss to Follow Up in Cohort Studies, Epidemiology, 10.1097/EDE.0000000000000409, 27, 1, (91-97), (2016).
  • Joint Modeling of Survival and Longitudinal Ordered Data Using a Semiparametric Approach, Australian & New Zealand Journal of Statistics, 10.1111/anzs.12153, 58, 2, (153-172), (2016).
  • Joint multiple imputation for longitudinal outcomes and clinical events that truncate longitudinal follow‐up, Statistics in Medicine, 10.1002/sim.6590, 35, 17, (2991-3006), (2015).
  • Analysis of accelerated failure time data with dependent censoring using auxiliary variables via nonparametric multiple imputation, Statistics in Medicine, 10.1002/sim.6534, 34, 19, (2768-2780), (2015).
  • Are All Biases Missing Data Problems?, Current Epidemiology Reports, 10.1007/s40471-015-0050-8, 2, 3, (162-171), (2015).
  • A model‐informed rank test for right‐censored data with intermediate states, Statistics in Medicine, 10.1002/sim.6417, 34, 9, (1454-1466), (2015).
  • Tuning multiple imputation by predictive mean matching and local residual draws, BMC Medical Research Methodology, 10.1186/1471-2288-14-75, 14, 1, (2014).
  • Analysis of transplant urgency and benefit via multiple imputation, Statistics in Medicine, 10.1002/sim.6250, 33, 26, (4655-4670), (2014).
  • Risk-stratified imputation in survival analysis, Clinical Trials: Journal of the Society for Clinical Trials, 10.1177/1740774513493150, 10, 4, (530-539), (2013).
  • Research Note —Generating Shareable Statistical Databases for Business Value: Multiple Imputation with Multimodal Perturbation , Information Systems Research, 10.1287/isre.1110.0361, 23, 2, (559-574), (2012).
  • References, Flexible Imputation of Missing Data, 10.1201/b11826-16, (2012).
  • Bibliography, Multiple Imputation and its Application, 10.1002/9781119942283, (316-326), (2012).
  • Using cure models and multiple imputation to utilize recurrence as an auxiliary variable for overall survival, Clinical Trials: Journal of the Society for Clinical Trials, 10.1177/1740774511414741, 8, 5, (581-590), (2011).
  • Multiple imputation based on restricted mean model for censored data, Statistics in Medicine, 10.1002/sim.4163, 30, 12, (1339-1350), (2011).
  • Estimating Mean Cost Using Auxiliary Covariates, Biometrics, 10.1111/j.1541-0420.2010.01540.x, 67, 3, (996-1006), (2011).
  • An Audit Strategy for Progression‐Free Survival, Biometrics, 10.1111/j.1541-0420.2010.01539.x, 67, 3, (1092-1099), (2011).
  • Limitation of Inverse Probability-of-Censoring Weights in Estimating Survival in the Presence of Strong Selection Bias, American Journal of Epidemiology, 10.1093/aje/kwq385, 173, 5, (569-577), (2011).
  • The authors replied as follows:, Biometrics, 10.1111/j.1541-0420.2009.01324_2.x, 66, 3, (987-991), (2010).
  • Multiple imputation for estimation of an occurrence rate in cohorts with attrition and discrete follow-up time points: a simulation study, BMC Medical Research Methodology, 10.1186/1471-2288-10-79, 10, 1, (2010).
  • A robust weighted Kaplan–Meier approach for data with dependent censoring using linear combinations of prognostic covariates, Statistics in Medicine, 10.1002/sim.3969, 29, 21, (2215-2223), (2010).
  • Predicting treatment effects using biomarker data in a meta‐analysis of clinical trials, Statistics in Medicine, 10.1002/sim.3931, 29, 18, (1875-1889), (2010).
  • The authors replied as follows: , Biometrics, 10.1111/j.1541-0420.2009.01325.x, (2009).
  • Estimation of colorectal adenoma recurrence with dependent censoring, BMC Medical Research Methodology, 10.1186/1471-2288-9-66, 9, 1, (2009).
  • Nonparametric comparison of two survival functions with dependent censoring via nonparametric multiple imputation, Statistics in Medicine, 10.1002/sim.3480, 28, 3, (462-475), (2008).
  • Analyses of cumulative incidence functions via non‐parametric multiple imputation, Statistics in Medicine, 10.1002/sim.3402, 27, 27, (5709-5724), (2008).
  • Visualizing Length of Survival in Time-to-Event Studies: A Complement to Kaplan Meier Plots, JNCI Journal of the National Cancer Institute, 10.1093/jnci/djm265, 100, 2, (92-97), (2008).
  • Variable selection strategies in survival models with multiple imputations, Lifetime Data Analysis, 10.1007/s10985-007-9050-4, 13, 3, (295-315), (2007).
  • Multiple imputation for interval censored data with auxiliary variables, Statistics in Medicine, 10.1002/sim.2581, 26, 4, (769-781), (2006).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.