Abstract
The proportional hazards (PH) model is, arguably, the most popular model for the analysis of lifetime data arising from epidemiological studies, among many others. In such applications, analysts may be faced with censored outcomes and/or studies which institute enrollment criterion leading to left truncation. Censored outcomes arise when the event of interest is not observed but rather is known relevant to an observation time(s). Left truncated data occur in studies that exclude participants who have experienced the event prior to being enrolled in the study. If not accounted for, both of these features can lead to inaccurate inferences about the population under study. Thus, to overcome this challenge, herein we propose a novel unified PH model that can be used to accommodate both of these features. In particular, our approach can seamlessly analyze exactly observed failure times along with interval-censored observations, while aptly accounting for left truncation. To facilitate model fitting, an expectation–maximization algorithm is developed through the introduction of carefully structured latent random variables. To provide modeling flexibility, a monotone spline representation is used to approximate the cumulative baseline hazard function. The performance of our methodology is evaluated through a simulation study and is further illustrated through the analysis of two motivating data sets; one that involves child mortality in Nigeria and the other prostate cancer.
Similar content being viewed by others
References
Afzal AR, Dong C, Lu X (2017) Estimation of partly linear additive hazards model with left-truncated and right-censored data. Stat Model 17(6):423–448
Alioum A, Commenges D (1996) A proportional hazards model for arbitrarily censored and truncated data. Biometrics 52:512–524
Anderson-Bergman C (2017) icenReg: regression models for interval censored data in R. J Stat Softw 81(1):1–23
Andriole GL, Crawford ED, Grubb RL III, Buys SS, Chia D, Church TR, Fouad MN, Gelmann EP, Kvale PA, Reding DJ et al (2009) Mortality results from a randomized prostate-cancer screening trial. N Engl J Med 360(13):1310–1319
Andriole GL, Crawford ED, Grubb RL III, Buys SS, Chia D, Church TR, Fouad MN, Isaacs C, Kvale PA, Reding DJ et al (2012) Prostate cancer screening in the randomized prostate, lung, colorectal, and ovarian cancer screening trial: mortality results after 13 years of follow-up. J Natl Cancer Inst 104(2):125–132
Cai T, Betensky RA (2003) Hazard regression for interval-censored data with penalized spline. Biometrics 59(3):570–579
Cai B, Lin X, Wang L (2011) Bayesian proportional hazards model for current status data with monotone splines. Comput Stat Data Anal 55(9):2644–2651
Chen CM, Shen PS (2018) Conditional maximum likelihood estimation in semiparametric transformation model with LTRC data. Lifetime Data Anal 24(2):250–272
Chen DGD, Sun J, Peace KE (2012) Interval-censored time-to-event data: methods and applications. CRC Press, Boca Raton
Chen CM, Shen PS, Wei JCC, Lin L (2017) A semiparametric mixture cure survival model for left-truncated and right-censored data. Biom J 59(2):270–290
Cox DR (1972) Regression models and life-tables. J R Stat Soc: Ser B (Methodol) 34(2):187–202
Cox DR (1975) Partial likelihood. Biometrika 62(2):269–276
Datta S, Satten GA, Williamson JM (2000) Consistency and asymptotic normality of estimators in a proportional hazards model with interval censoring and left truncation. Ann Inst Stat Math 52(1):160–172
Doehler K, Davidian M (2008) ‘Smooth’inference for survival functions with arbitrarily censored data. Stat Med 27(26):5421–5439
Finkelstein DM (1986) A proportional hazards model for interval-censored failure time data. Biometrics 42:845–854
Goggins WB, Finkelstein DM, Schoenfeld DA, Zaslavsky AM (1998) A Markov chain Monte Carlo EM algorithm for analyzing interval-censored data under the cox proportional hazards model. Biometrics 54:1498–1507
Gross ST, Lai TL (1996) Nonparametric estimation and regression analysis with left-truncated and right-censored data. J Am Stat Assoc 91(435):1166–1180
Harrell FE (2015) Cox proportional hazards regression model. In: Regression modeling strategies. Springer, pp 475–519
Huang CY, Qin J (2013) Semiparametric estimation for the additive hazards model with left-truncated and right-censored data. Biometrika 100(4):877–888
Huber-Carol C, Vonta I (2004) Frailty models for arbitrarily censored and truncated data. Lifetime Data Anal 10(4):369–388
Johnson ME, Tolley HD, Bryson MC, Goldman AS (1982) Covariate analysis of survival data: a small-sample study of cox’s model. Biometrics 38:685–698
Joly P, Commenges D, Letenneur L (1998) A penalized likelihood approach for arbitrarily censored and truncated data: application to age-specific incidence of dementia. Biometrics 54:185–194
Kay R (1979) Some further asymptotic efficiency calculations for survival data regression models. Biometrika 66(1):91–96
Kim JS (2003) Efficient estimation for the proportional hazards model with left-truncated and" case 1" interval-censored data. Stat Sin 13:519–537
Kim M, Paik MC, Jang J, Cheung YK, Willey J, Elkind MS, Sacco RL (2017) Cox proportional hazards models with left truncation and time-varying coefficient: application of age at event as outcome in cohort studies. Biom J 59(3):405–419
Komárek A, Lesaffre E, Hilton JF (2005) Accelerated failure time model for arbitrarily censored data with smoothed error distribution. J Comput Graph Stat 14(3):726–745
Li J, Ma S (2013) Survival analysis in medicine and genetics. CRC Press, Boca Raton
Li C, Pak D, Todem D (2020) Adaptive lasso for the cox regression with interval censored and possibly left truncated data. Stat Methods Med Res 29(4):1243–1255
Lin X (2017) A Bayesian semiparametric accelerated failure time model for arbitrarily censored data with covariates subject to measurement error. Commun. Stat. Simul. Comput. 46(1):747–756
Lin X, Cai B, Wang L, Zhang Z (2015) A Bayesian proportional hazards model for general interval-censored data. Lifetime Data Anal 21(3):470–490
Link CL (1984) Confidence intervals for the survival function using cox’s proportional-hazard model with covariates. Biometrics 40:601–609
Liu X (2012) Survival analysis: models and applications. Wiley, New York
Liu H, Shen Y (2009) A semiparametric regression cure model for interval-censored data. J Am Stat Assoc 104(487):1168–1178
Lu M, McMahan CS (2018) A partially linear proportional hazards model for current status data. Biometrics 74(4):1240–1249
McMahan CS, Wang L, Tebbs JM (2013) Regression analysis for current status data using the EM algorithm. Stat Med 32(25):4452–4466
Pan W (1999) Extending the iterative convex minorant algorithm to the cox model for interval-censored data. J Comput Graph Stat 8(1):109–120
Pan W (2000) A multiple imputation approach to cox regression with interval-censored data. Biometrics 56(1):199–203
Pan W, Chappell R (1998) A nonparametric estimator of survival functions for arbitrarily truncated and censored data. Lifetime Data Anal 4(2):187–202
Pan C, Cai B, Wang L (2020) A Bayesian approach for analyzing partly interval-censored data under the proportional hazards model. Stat Methods Med Res 29(11):3192–3204
Pu Z, Li L (1999) Regression models with arbitrarily interval-censored observations. Commun Stat Theory Methods 28(7):1547–1563
Ramsay JO (1988) Monotone regression splines in action. Stat Sci 3:425–441
Rondeau V, Mazroui Y, Gonzalez JR (2012) frailtypack: an R package for the analysis of correlated survival data with frailty models using penalized likelihood estimation or parametrical estimation. J Stat Softw 47(4):1–28
Satten GA (1996) Rank-based inference in the proportional hazards model for interval censored data. Biometrika 83(2):355–370
Shen PS (2009) Semiparametric analysis of survival data with left truncation and right censoring. Comput Stat Data Anal 53(12):4417–4432
Shen PS (2020) Nonparametric estimators of survival function under the mixed case interval-censored model with left truncation. Lifetime Data Anal 26(3):624–637
Shen PS, Weng LN (2019) The Cox–Aalen model for left-truncated and mixed interval-censored data. Statistics 53(5):1152–1167
Shen PS, Chen HJ, Pan WH, Chen CM (2019) Semiparametric regression analysis for left-truncated and interval-censored data without or with a cure fraction. Comput Stat Data Anal 140:74–87
Singh RS, Totawattage DP (2013) The statistical analysis of interval-censored failure time data with applications. Open J Stat 3(2):12
Sun J (2007) The statistical analysis of interval-censored failure time data. Springer, Berlin
Therneau TM, Grambsch PM (2000) The Cox model. In: Modeling survival data: extending the Cox model. Springer, pp 39–77
Tsiatis AA (1981) A large sample study of cox’s regression model. Ann Stat 9(1):93–108
Turnbull BW (1976) The empirical distribution function with arbitrarily grouped, censored and truncated data. J R Stat Soc Ser B (Methodol) 38(3):290–295
Wang L, Wang L (2021) Regression analysis of arbitrarily censored survival data under the proportional odds model. Stat Med 40(16):3724–3739
Wang N, Wang L, McMahan CS (2015a) Regression analysis of bivariate current status data under the gamma-frailty proportional hazards model using the EM algorithm. Comput Stat Data Anal 83:140–150
Wang P, Tong X, Zhao S, Sun J (2015b) Regression analysis of left-truncated and case i interval-censored data with the additive hazards model. Commun Stat Theory Methods 44(8):1537–1551
Wang L, McMahan CS, Hudgens MG, Qureshi ZP (2016) A flexible, computationally efficient method for fitting the proportional hazards model to interval-censored data. Biometrics 72(1):222–231
Wang P, Li D, Sun J (2021) A pairwise pseudo-likelihood approach for left-truncated and interval-censored data under the Cox model. Biometrics 77(4):1303–1314
Zhang M, Davidian M (2008) “Smooth’’ semiparametric regression analysis for arbitrarily censored time-to-event data. Biometrics 64(2):567–576
Zhang Z, Sun J (2010) Interval censoring. Stat Methods Med Res 19(1):53–70
Zhou H, Hanson T (2018) A unified framework for fitting Bayesian semiparametric models to arbitrarily censored survival data, including spatially referenced data. J Am Stat Assoc 113(522):571–581
Zhou H, Hanson T, Zhang J (2017a) Generalized accelerated failure time spatial frailty model for arbitrarily censored data. Lifetime Data Anal 23(3):495–515
Zhou H, Hanson T, Zhang J (2017b) spBayesSurv: fitting Bayesian spatial survival models using R. ar**v:1705.04584
Acknowledgements
Dr. McMahan acknowledges support of Grant R01 AI121351 from the National Institutes of Health, Grant OIA-1826715 from the National Science Foundation, and Grant N00014-19-1-2295 from the Office of Naval Research. Dr. Wang acknowledges support of R01CA218578 from the National Institutes of Health.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Withana Gamage, P.W., McMahan, C.S. & Wang, L. A flexible parametric approach for analyzing arbitrarily censored data that are potentially subject to left truncation under the proportional hazards model. Lifetime Data Anal 29, 188–212 (2023). https://doi.org/10.1007/s10985-022-09579-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10985-022-09579-z