Statistical Methods for Analyzing Right‐Censored Length‐Biased Data under Cox Model
Abstract
Summary Length‐biased time‐to‐event data are commonly encountered in applications ranging from epidemiological cohort studies or cancer prevention trials to studies of labor economy. A longstanding statistical problem is how to assess the association of risk factors with survival in the target population given the observed length‐biased data. In this article, we demonstrate how to estimate these effects under the semiparametric Cox proportional hazards model. The structure of the Cox model is changed under length‐biased sampling in general. Although the existing partial likelihood approach for left‐truncated data can be used to estimate covariate effects, it may not be efficient for analyzing length‐biased data. We propose two estimating equation approaches for estimating the covariate coefficients under the Cox model. We use the modern stochastic process and martingale theory to develop the asymptotic properties of the estimators. We evaluate the empirical performance and efficiency of the two methods through extensive simulation studies. We use data from a dementia study to illustrate the proposed methodology, and demonstrate the computational algorithms for point estimates, which can be directly linked to the existing functions in S‐PLUS or R.
Citing Literature
Number of times cited according to CrossRef: 74
- Da Xu, Yong Zhou, Proportional Mean Residual Life Model with Varying Coefficients for Length-Biased and Right-Censored Data, Acta Mathematica Sinica, English Series, 10.1007/s10114-020-8079-0, 36, 5, (578-596), (2020).
- Joseph Magagnoli, Siddharth Narendran, Felipe Pereira, Tammy H. Cummings, James W. Hardin, S. Scott Sutton, Jayakrishna Ambati, Outcomes of Hydroxychloroquine Usage in United States Veterans Hospitalized with COVID-19, Med, 10.1016/j.medj.2020.06.001, (2020).
- Shannon E Majowicz, Dimitra Panagiotoglou, Marsha Taylor, Mahmood R Gohari, Gilaad G Kaplan, Ashok Chaurasia, Scott T Leatherdale, Richard J Cook, David M Patrick, Steen Ethelberg, Eleni Galanis, Determining the long-term health burden and risk of sequelae for 14 foodborne infections in British Columbia, Canada: protocol for a retrospective population-based cohort study, BMJ Open, 10.1136/bmjopen-2019-036560, 10, 8, (e036560), (2020).
- Xiao-lin Chen, Regression Analysis for the Additive Hazards Model with General Biased Survival Data, Acta Mathematicae Applicatae Sinica, English Series, 10.1007/s10255-020-0949-9, 36, 3, (545), (2020).
- Zexi Cai, Tony Sit, Censored quantile regression model with time‐varying covariates under length‐biased sampling, Biometrics, 10.1111/biom.13230, 0, 0, (2020).
- Nusrat Harun, Bo Cai, Yu Shen, A Bayesian semiparametric method for analyzing length-biased data, Journal of Applied Statistics, 10.1080/02664763.2020.1753028, (1-16), (2020).
- Li-Pang Chen, Grace Y. Yi, Semiparametric methods for left-truncated and right-censored survival data with covariate measurement error, Annals of the Institute of Statistical Mathematics, 10.1007/s10463-020-00755-2, (2020).
- Chunjie Wang, Jingjing Jiang, Linlin Luo, Shuying Wang, Bayesian analysis of the Box-Cox transformation model based on left-truncated and right-censored data, Journal of Applied Statistics, 10.1080/02664763.2020.1784854, (1-13), (2020).
- Li-Pang Chen, Variable selection and estimation for the additive hazards model subject to left-truncation, right-censoring and measurement error in covariates, Journal of Statistical Computation and Simulation, 10.1080/00949655.2020.1800705, (1-40), (2020).
- Yifan He, Yong Zhou, Nonparametric and semiparametric estimators of restricted mean survival time under length-biased sampling, Lifetime Data Analysis, 10.1007/s10985-020-09498-x, (2020).
- Zhiping Qiu, Huijuan Ma, Jianhua Shi, Reweighting estimators for the transformation models with length-biased sampling data and missing covariates, Communications in Statistics - Theory and Methods, 10.1080/03610926.2020.1812653, (1-24), (2020).
- Bella Vakulenko‐Lagun, Micha Mandel, Rebecca A. Betensky, Inverse probability weighting methods for Cox regression with right‐truncated data, Biometrics, 10.1111/biom.13162, 76, 2, (484-495), (2019).
- Fangfang Bai, Xuerong Chen, Yan Chen, Tao Huang, A general quantile residual life model for length‐biased right‐censored data, Scandinavian Journal of Statistics, 10.1111/sjos.12390, 46, 4, (1191-1205), (2019).
- Fei Gao, Kwun Chuen Gary Chan, Semiparametric regression analysis of length‐biased interval‐censored data, Biometrics, 10.1111/biom.12970, 75, 1, (121-132), (2019).
- Jin Piao, Jing Ning, Yu Shen, Semiparametric model for bivariate survival data subject to biased sampling, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12308, 81, 2, (409-429), (2019).
- Da Xu, Yong Zhou, Local composite partial likelihood estimation for length-biased and right-censored data, Journal of Statistical Computation and Simulation, 10.1080/00949655.2019.1628963, (1-17), (2019).
- Chengbo Li, Yong Zhou, The estimation for the general additive–multiplicative hazard model using the length-biased survival data, Statistical Papers, 10.1007/s00362-018-01079-3, (2019).
- Chyong-Mei Chen, Pao-sheng Shen, Yi Liu, On semiparametric transformation model with LTRC data: pesudo likelihood approach, Statistical Papers, 10.1007/s00362-018-01080-w, (2019).
- Li-Pang Chen, Semiparametric estimation for the transformation model with length-biased data and covariate measurement error, Journal of Statistical Computation and Simulation, 10.1080/00949655.2019.1687700, (1-23), (2019).
- Li Xun, Li Tao, Yong Zhou, Estimators of quantile difference between two samples with length-biased and right-censored data, TEST, 10.1007/s11749-019-00657-3, (2019).
- Shanshan Song, Yong Zhou, Nonparametric estimation of the ROC curve for length-biased and right-censored data, Communications in Statistics - Theory and Methods, 10.1080/03610926.2019.1604963, (1-21), (2019).
- Feipeng Zhang, Xingqiu Zhao, Yong Zhou, An embedded estimating equation for the additive risk model with biased-sampling data, Science China Mathematics, 10.1007/s11425-017-9268-0, 61, 8, (1495-1518), (2018).
- Wenhua Wei, Alan T. K. Wan, Yong Zhou, Partially linear transformation model for length-biased and right-censored data, Journal of Nonparametric Statistics, 10.1080/10485252.2018.1424335, 30, 2, (332-367), (2018).
- Jianhua Shi, Huijuan Ma, Yong Zhou, The nonparametric quantile estimation for length-biased and right-censored data, Statistics & Probability Letters, 10.1016/j.spl.2017.10.020, 134, (150-158), (2018).
- Xuan Wang, Xiao-Hua Zhou, Semiparametric maximum likelihood estimation for the Cox model with length-biased survival data, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2017.11.004, 196, (163-173), (2018).
- Li‐Pang Chen, Semiparametric estimation for the accelerated failure time model with length‐biased sampling and covariate measurement error, Stat, 10.1002/sta4.209, 7, 1, (2018).
- Yutao Liu, Shucong Zhang, Yong Zhou, Semiparametric quantile-difference estimation for length-biased and right-censored data, Science China Mathematics, 10.1007/s11425-017-9250-0, (2018).
- Chi Hyun Lee, Jing Ning, Yu Shen, Model diagnostics for the proportional hazards model with length-biased data, Lifetime Data Analysis, 10.1007/s10985-018-9422-y, (2018).
- Chi Hyun Lee, Jing Ning, Yu Shen, Analysis of restricted mean survival time for length‐biased data, Biometrics, 10.1111/biom.12772, 74, 2, (575-583), (2017).
- Micha Mandel, Jacobo de Uña‐Álvarez, David K. Simon, Rebecca A. Betensky, Inverse probability weighted Cox regression for doubly truncated data, Biometrics, 10.1111/biom.12771, 74, 2, (481-487), (2017).
- Fan Wu, Sehee Kim, Jing Qin, Rajiv Saran, Yi Li, A pairwise likelihood augmented Cox estimator for left‐truncated data, Biometrics, 10.1111/biom.12746, 74, 1, (100-108), (2017).
- Mu Zhao, Cun-jie Lin, Yong Zhou, Analyzing right-censored length-biased data with additive hazards model, Acta Mathematicae Applicatae Sinica, English Series, 10.1007/s10255-017-0705-y, 33, 4, (893-908), (2017).
- Annie J. Lee, Karen Marder, Roy N. Alcalay, Helen Mejia‐Santana, Avi Orr‐Urtreger, Nir Giladi, Susan Bressman, Yuanjia Wang, Estimation of genetic risk function with covariates in the presence of missing genotypes, Statistics in Medicine, 10.1002/sim.7376, 36, 22, (3533-3546), (2017).
- Jing Ning, Chuan Hong, Liang Li, Xuelin Huang, Yu Shen, Estimating treatment effects in observational studies with both prevalent and incident cohorts, Canadian Journal of Statistics, 10.1002/cjs.11317, 45, 2, (202-219), (2017).
- Gongjun Xu, Tony Sit, Lan Wang, Chiung-Yu Huang, Estimation and Inference of Quantile Regression for Survival Data Under Biased Sampling, Journal of the American Statistical Association, 10.1080/01621459.2016.1222286, 112, 520, (1571-1586), (2017).
- Huijuan Ma, Yong Zhou, Pseudo-likelihood for case–cohort studies under length-biased sampling, Communications in Statistics - Theory and Methods, 10.1080/03610926.2014.983613, 46, 1, (28-48), (2016).
- YanFeng Li, HuiJuan Ma, DeHui Wang, Yong Zhou, Analyzing the general biased data by additive risk model, Science China Mathematics, 10.1007/s11425-015-0383-5, 60, 4, (685-700), (2016).
- Hao Liu, Yu Shen, Jing Ning, Jing Qin, Sample size calculations for prevalent cohort designs, Statistical Methods in Medical Research, 10.1177/0962280214544730, 26, 1, (280-291), (2016).
- Sy Han Chiou, Gongjun Xu, Rank-based estimation for semiparametric accelerated failure time model under length-biased sampling, Statistics and Computing, 10.1007/s11222-016-9634-5, 27, 2, (483-500), (2016).
- Jieli Ding, Tsui-Shan Lu, Jianwen Cai, Haibo Zhou, Recent progresses in outcome-dependent sampling with failure time data, Lifetime Data Analysis, 10.1007/s10985-015-9355-7, 23, 1, (57-82), (2016).
- Bella Vakulenko-Lagun, Micha Mandel, Yair Goldberg, Nonparametric estimation in the illness-death model using prevalent data, Lifetime Data Analysis, 10.1007/s10985-016-9373-0, 23, 1, (25-56), (2016).
- Yu Shen, Jing Ning, Jing Qin, Nonparametric and semiparametric regression estimation for length-biased survival data, Lifetime Data Analysis, 10.1007/s10985-016-9367-y, 23, 1, (3-24), (2016).
- S. M. A. Jahanshahi, A. Habibi Rad, V. Fakoor, Goodness-of-fit test under length-biased sampling, Communications in Statistics - Theory and Methods, 10.1080/03610926.2016.1157187, 46, 15, (7580-7592), (2016).
- Cunjie Lin, Yong Zhou, Semiparametric varying-coefficient model with right-censored and length-biased data, Journal of Multivariate Analysis, 10.1016/j.jmva.2016.08.008, 152, (119-144), (2016).
- Qiaozhen Zhang, Hongsheng Dai, Bo Fu, A proportional hazards model for time-to-event data with epidemiological bias, Journal of Multivariate Analysis, 10.1016/j.jmva.2016.08.003, 152, (224-236), (2016).
- Xuerong Chen, Yeqian Liu, Jianguo Sun, Yong Zhou, Semiparametric Quantile Regression Analysis of Right‐censored and Length‐biased Failure Time Data with Partially Linear Varying Effects, Scandinavian Journal of Statistics, 10.1111/sjos.12221, 43, 4, (921-938), (2016).
- Xu Liu, Xinyuan Song, Shangyu Xie, Yong Zhou, Variable selection for frailty transformation models with application to diabetic complications, Canadian Journal of Statistics, 10.1002/cjs.11291, 44, 3, (375-394), (2016).
- Feipeng Zhang, Heng Peng, Yong Zhou, Composite partial likelihood estimation for length-biased and right-censored data with competing risks, Journal of Multivariate Analysis, 10.1016/j.jmva.2016.04.002, 149, (160-176), (2016).
- Jung-Yu Cheng, Shu-Chun Huang, Shinn-Jia Tzeng, Quantile regression methods for left-truncated and right-censored data, Journal of Statistical Computation and Simulation, 10.1080/00949655.2015.1016433, 86, 3, (443-459), (2015).
- Pierre-Jérôme Bergeron, Ewa Sucha, Jaime Younger, Goodness-of-Fit Tests for Length-Biased Right-Censored Data with Application to Survival with Dementia, Applied Statistics in Biomedicine and Clinical Trials Design, 10.1007/978-3-319-12694-4_20, (329-345), (2015).
- XiaoPing Chen, JianHua Shi, Yong Zhou, Monotone rank estimation of transformation models with length-biased and right-censored data, Science China Mathematics, 10.1007/s11425-015-5035-z, 58, 10, (1-14), (2015).
- Huijuan Ma, Feipeng Zhang, Yong Zhou, Composite estimating equation approach for additive risk model with length-biased and right-censored data, Statistics & Probability Letters, 10.1016/j.spl.2014.08.021, 96, (45-53), (2015).
- Xuan Wang, Qihua Wang, Estimation for semiparametric transformation models with length-biased sampling, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2014.08.001, 156, (80-89), (2015).
- Sonya L. Heltshe, Karen Kafadar, Philip C. Prorok, Quantification of length‐bias in screening trials with covariate‐dependent test sensitivity, Biometrical Journal, 10.1002/bimj.201400152, 57, 5, (777-796), (2015).
- Ashkan Ertefaie, Masoud Asgharian, David A. Stephens, Double Bias: Estimation of Causal Effects from Length-Biased Samples in the Presence of Confounding, The International Journal of Biostatistics, 10.1515/ijb-2014-0037, 11, 1, (2015).
- Na Hu, Xuerong Chen, Jianguo Sun, Regression Analysis of Length‐biased and Right‐censored Failure Time Data with Missing Covariates, Scandinavian Journal of Statistics, 10.1111/sjos.12115, 42, 2, (438-452), (2014).
- Jung-Yu Cheng, Shinn-Jia Tzeng, Quantile regression of right-censored length-biased data using the Buckley–James-type method, Computational Statistics, 10.1007/s00180-014-0507-0, 29, 6, (1571-1592), (2014).
- Yujie Zhong, Richard J. Cook, Measurement Error for Age of Onset in Prevalent Cohort Studies, Applied Mathematics, 10.4236/am.2014.511160, 05, 11, (1672-1683), (2014).
- Cunjie Lin, Yong Zhou, Inference for the treatment effects in two sample problems with right-censored and length-biased data, Statistics & Probability Letters, 10.1016/j.spl.2014.03.009, 90, (17-24), (2014).
- Cunjie Lin, Yong Zhou, Analyzing right-censored and length-biased data with varying-coefficient transformation model, Journal of Multivariate Analysis, 10.1016/j.jmva.2014.05.003, 130, (45-63), (2014).
- Masoud Asgharian, Christina Wolfson, David Wolfson, Analysis of Biased Survival Data, Statistics in Action, 10.1201/b16597, (2014).
- Ashkan Ertefaie, Masoud Asgharian, David Stephens, Propensity score estimation in the presence of length‐biased sampling: a non‐parametric adjustment approach, Stat, 10.1002/sta4.46, 3, 1, (83-94), (2014).
- Huixia Judy Wang, Lan Wang, Quantile regression analysis of length‐biased survival data, Stat, 10.1002/sta4.42, 3, 1, (31-47), (2014).
- Feipeng Zhang, Xuerong Chen, Yong Zhou, Proportional hazards model with varying coefficients for length-biased data, Lifetime Data Analysis, 10.1007/s10985-013-9257-5, 20, 1, (132-157), (2013).
- Jane Paik Kim, Wenbin Lu, Tony Sit, Zhiliang Ying, A Unified Approach to Semiparametric Transformation Models Under General Biased Sampling Schemes, Journal of the American Statistical Association, 10.1080/01621459.2012.746073, 108, 501, (217-227), (2013).
- K. C. G. Chan, Survival analysis without survival data: connecting length-biased and case-control data, Biometrika, 10.1093/biomet/ast008, 100, 3, (764-770), (2013).
- Hao Liu, Jing Qin, Yu Shen, Imputation for semiparametric transformation models with biased-sampling data, Lifetime Data Analysis, 10.1007/s10985-012-9225-5, 18, 4, (470-503), (2012).
- K. C. G. Chan, Y. Q. Chen, C.-Z. Di, Proportional mean residual life model for right-censored length-biased data, Biometrika, 10.1093/biomet/ass049, 99, 4, (995-1000), (2012).
- C.-Y. Huang, J. Qin, D. A. Follmann, A maximum pseudo-profile likelihood estimator for the Cox model under length-biased sampling, Biometrika, 10.1093/biomet/asr072, 99, 1, (199-210), (2012).
- Chiung-yu Huang, Jing Qin, Composite Partial Likelihood Estimation Under Length-Biased Sampling, With Application to a Prevalent Cohort Study of Dementia, Journal of the American Statistical Association, 10.1080/01621459.2012.682544, 107, 499, (946-957), (2012).
- Xue-rong Chen, Yong Zhou, Quantile regression for right-censored and length-biased data, Acta Mathematicae Applicatae Sinica, English Series, 10.1007/s10255-012-0157-3, 28, 3, (443-462), (2012).
- Yu‐Jen Cheng, Mei‐Cheng Wang, Estimating Propensity Scores and Causal Survival Functions Using Prevalent Survival Data, Biometrics, 10.1111/j.1541-0420.2012.01754.x, 68, 3, (707-716), (2012).
- Jing Qin, Jing Ning, Hao Liu, Yu Shen, Maximum Likelihood Estimations and EM Algorithms With Length-Biased Data, Journal of the American Statistical Association, 10.1198/jasa.2011.tm10156, 106, 496, (1434-1449), (2011).
- Richard J. Cook, Pierre‐Jérôme Bergeron, Information in the sample covariate distribution in prevalent cohorts, Statistics in Medicine, 10.1002/sim.4180, 30, 12, (1397-1409), (2011).




