Abstract
Times between consecutive events are often of interest in medical studies. Usually the events represent different states of the disease process and are modeled using multi-state models. This paper introduces and studies a feasible estimation method for the transition probabilities in a progressive three-state model. We assume that the vector of gap times \((T_1,T_2)\) satisfies a nonparametric location-scale regression model \(T_2=m(T_1)+\sigma (T_1)\epsilon \), where the functions \(m\) and \(\sigma \) are ‘smooth’, and \(\epsilon \) is independent of \(T_1\). Under this model, Van Keilegom et al. (J Stat Plan Inference 141:1118–1131, 2011) proposed estimators of the transition probabilities. However, the important issue of automatic bandwidth choice in this setting has not been examined, making the analysis of real datasets rather difficult. In this paper, we study the performance of their estimator in practice, we propose some modifications and study practical issues related to the implementation of the estimator, which involves the choice of an appropriate bandwidth. In an extensive simulation study the good performance of the method is shown. Simulations also demonstrate that the proposed estimator compares favorably with alternative estimators. Furthermore, the proposed methodology is illustrated with a real database on breast cancer.
Similar content being viewed by others
References
Aalen OO, Johansen S (1978) An empirical transition matrix for nonhomogeneous Markov chains based on censored observations. Scand J Stat 5:141–150
Amorim AP, Uña-Álvarez J, Meira-Machado LF (2001) Presmoothing the transition probabilities in the illness-death model. Stat Probab 81:797–806
Andersen PK, Borgan O, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer, New York
Beran R (1981) Nonparametric regression with randomly censored survival data. Technical report. University of California, Berkeley
Cadarso-Suárez C, Meira-Machado LF, Kneib T, Gude F (2010) Flexible hazard ratio curves for continuous predictors in multi-state models: a P-spline approach. Stat Model 10:291–314
Chavez-Uribe E, Cameselle-Teijeiro J, Vinuela JE et al (2007) Hypoploidy defines patients with poor prognosis in breast cancer. Oncol Rep 17:1109–1114
Dabrowska DM (1992) Variable bandwidth conditional Kaplan-Meier estimate. Scand J Stat 19:351–361
Hougaard P (2000) Analysis of multivariate survival data. Springer, New-York
Kaplan EL, Meier P (1958) Nonparametric estimation from incomplete observations. J Am Stat Assoc 53:457–481
Li G, Datta G (2001) A bootstrap approach to nonparametric regression for right censored data. Ann Inst Stat Math 53:708–729
Meira-Machado LF, de Uña-Álvarez J, Cadarso-Suárez C (2006) Nonparametric estimation of transition probabilities in a non-Markov illness-death model. Lifetime Data Anal 12:325–344
Van Keilegom I, Akritas M (1999) Transfer of tail information in censored regression models. Ann Stat 27:1745–1784
Van Keilegom I, Akritas M, Veraverbeke N (2001) Estimation of the conditional distribution in regression with censored data: a comparative study. Comput Stat Data Anal 35:487–500
Van Keilegom I, de Uña-Álvarez J, Meira-Machado LF (2011) Nonparametric location-scale models for successive survival times under dependent censoring. J Stat Plan Inference 141:1118–1131
Acknowledgments
Luís Meira-Machado acknowledges financial support from FEDER Funds through “Programa Operacional Factores de Competitividade—COMPETE” and by Portuguese Funds through FCT—“Fundação para a Ciência e a Tecnologia”, in the form of grants PTDC/MAT/104879/2008 and Est-C/MAT/UI0013/2011. Luís Meira-Machado and Carmen Cadarso-Suárez acknowledge the support received by the Spanish Ministry of Industry and Innovation, Grant MTM2011-28285-C02-01. Roca-Pardiñas acknowledges financial support from research Grants MTM2011-23204 (FEDER support included) of the Spanish Ministry of Industry and Innovation, and Galician Regional Authority (Xunta de Galicia) grant 10PXIB300068PR. Van Keilegom acknowledges financial support from the European Research Council under the European Community’s Seventh Framework Programme (FP7/2007-2013) / ERC Grant agreement No. 203650, from IAP research network P7/06 of the Belgian Government (Belgian Science Policy), and from the contract ‘Projet d’Actions de Recherche Concertées’ (ARC) 11/16-039 of the ‘Communauté française de Belgique’, granted by the ‘Académie universitaire Louvain’. We are also grateful to both the associate editor and the two peer referees for their valuable comments and suggestions, which served to make a substantial improvement to this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Meira-Machado, L., Roca-Pardiñas, J., Van Keilegom, I. et al. Bandwidth selection for the estimation of transition probabilities in the location-scale progressive three-state model. Comput Stat 28, 2185–2210 (2013). https://doi.org/10.1007/s00180-013-0402-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-013-0402-0