1 Introduction
Driven by applications, there is a constant interest in timetoevent analysis to extend the predictive models to situations where the lifetimes of interest suffer from complex censoring mechanisms. Here we consider the case where instead of the lifetime of interest one observes independent copies of a finite nonnegative duration and of a discrete variable such that
(1.1) 
Depending on the application, the inequality signs in (1.1) could be strict or not. Let us point out that the limit case where the event (resp.
) has zero probability corresponds to the usual random rightcensoring (resp. leftcensoring) setup, while the case where the probability of the event
is null corresponds to the current status framework.Let us assume that and let
be a vector of random covariates. All the random variables we consider are defined on some probability space
Although takes values only on the real line, we allow a positive probability for the event that is we allow for cured individuals (see, for instance Fang et al. (2005) and Zheng et al. (2006) and the references therein for the applications where infinity lifetimes could occur). Symmetrically, we also allow the zero lifetime to have positive probability, that is a zeroinflated law for could be taken into account (see Braekers & Growels (2015) for some motivations).Let be the support of
The conditional probability distribution of
given is characterized by the subdistributions functionsLet denote the associated measures. Moreover, let
be the unconditional versions of these subdistributions. Clearly,
The conditional distribution function of given is then
It is important to understand that, based on the data, one could only identify the conditional subdistributions . For identifying and consistently estimating the conditional law of the lifetime of interest , one should introduce some assumptions on the censoring mechanism. In other words, one has to consider a latent model. Several censoring mechanisms have proposed in the case without covariates. Turnbull (1974) considered two censoring times such that the case (resp. ) (resp. ) corresponds to the event and (resp. and ) (resp. and ). Patilea & Rolin (2006b) relaxed the condition and proposed two models that could be easily illustrated using simple electric circuits with three components connected in series and/or parallel. Patilea and Rolin (2006a) extended the standard rightcensoring (resp. leftcensoring) model by allowing uncensored lifetimes for which one only knows that are smaller (resp. larger) than the observation . This corresponds, for instance, to the case of a medical study where a disease is detected for a patient, but the onset time could not be determined from medical records, personal information, etc, while for other patients with the disease detected the onset time is available. The model of Turnbull does not allow to express the law of the lifetime of interest as an explicit function of the subdistributions , as it is the case for the models proposed by Patilea & Rolin (2006a, 2006b). Thus a numerical algorithm is necessary to compute Turnbull’s estimator. It is important to keep in mind that any of these latent models could be correct and useful for a specific application. The data does not allow to check the validity of the model. Turnbull’s model, perhaps the most popular model for data structures as we consider here, is not necessarily justified in applications where there is no natural interpretation of the variables and .
The aim of this paper is to extend the modeling of data as in equation (1.1) to the case where some covariates are available. Kim et al. (2010) extended Turnbull’s model to the case with covariates using a proportional hazard approach. Here we consider the extension of the approaches proposed by Patilea & Rolin (2006a) imposing the same proportional hazard assumption. More precisely, we propose two novel latent models for observed lifetimes as in (1.1) in the presence of covariates. Both models are well suited for data as in (1.1), and hence could be used in applications. The decision to use one of them, or the one proposed by Kim et al. (2010), could be made only on the basis of additional information on the application. Current status data corresponds to . Right (resp. left) censored data corresponds to (resp. ). This explains the terminology we propose for our models: modified Cox regressions with current status lifetimes. For each of the new models, we introduce a semiparametric estimator for the finitedimensional parameters, together with the corresponding baseline cumulative hazard functions estimators. Our estimators are easy to implement.
The paper is organized as follows. Our semiparametric models are introduced in section 2. They extend the standard right, respectively left, random censoring proportional hazard models. In section 3 we introduce the semiparametric estimators of the covariates coefficients, and the estimators of the cumulative hazard and survival functions. In particular, we provide an estimator for the cure rate and the zerolifetime probability. The theoretical results are presented
2 Censored and current status lifetimes
In our models we follow the idea of Cox’s semiparametric proportional hazard model. In both models we are able to express the baseline cumulative hazard function as a functional of distribution of the observations, characterized by the conditional subdistributions and the law of , and the coefficients of the covariates. This makes that the coefficients of the covariates could be estimated by maximizing an likelihoodtype criterion that is build as an explicit function of the observations. Thus the numerical aspects are very much simplified, compared to the model considered by Kim et al. (2010). With at hand the estimate of the finitedimensional parameters, we could easily build the estimator of the baseline cumulative hazard function. In particular, using the estimate of total mass of the baseline cumulative hazard, we provide a simple estimate of the conditional cure rate . Similarly, we could provide an estimator for the conditional zerolifetime probability . The extension to the case of mixture models, such as considered by Fang et al. (2005), where the cure rate or the zerolifetime could depend on possibly different set of covariates, is left for future work.
2.1 Rightcensoring case
Let be a random censoring time and be a Bernoulli random variable with success probability . Let and be the conditional distribution function and survivor function of given . Similarly, and denote the distribution function and the survivor function of . Following Patilea & Rolin (2006a), the latent model for is defined by:
Let us notice that is the classical rightcensoring limit case, while would correspond to the pure current status setup. The later limit case is not included in what follows since we assume In the case the observed outcome is not be influenced by the value of
For identification purposes we consider the following assumption.
A1: Assume that:

a) conditionally on , the latent variables and are independent;

b) and are independent.
The independence assumptions allow to write
(2.2) 
The system could be solved for the quantities and First, let us write
Since , we deduce
Integrating out the covariate and taking we could derive the simple representation
(2.3) 
Let us point out that one could replace the condition A1b) by the weaker condition that and are independent given and still write the equations (2.2) with replaced by some function of the covariates In this case one would derive the conditional version of the representation (2.3), but then the estimation of would require the estimation of the conditional versions of and For the sake of a simpler setup we suppose that does not depend on the covariates.
Next, we solve (2.2) for the conditional distribution of For this purpose we follow a proportional hazards model approach and we suppose that the risk function of given could be written as
(2.4) 
where is some unknown baseline hazard function and is a vector of unknown regression parameters. (Herein the vectors are matrix columns and denotes the transposed of ) With this assumption, for each and we could write
Hence,
Moreover,
As a consequence, for any such that ,
(2.5) 
Thus, the baseline cumulative hazard function could be expressed as a functional of the observed variables and the finitedimensional parameters of the model :
(2.6) 
The conditional survival function of the lifetime of interest can be expressed as
Herein, the notation means the productintegral over the interval , as formally defined in Gill & Johansen (1990). In particular, the conditional cure probability can be expressed as
2.2 Leftcensoring case
Let be a random censoring time and be a Bernoulli random variable with success probability . In this case the latent model for is defined by:
The case corresponds to the classical leftcensored data situation. Consider the assumptions A1a) and A1b). Then we can write
(2.7) 
This system also could be solved for the quantities and First, combining the first and the third equation, deduce
so that
Moreover, for each and we could write
where
is the conditional reverse hazard measure. The quantity could be interpreted as the conditional probability that the event occurs in the interval , given that the event occurs no later than . This measure has the property
In particular,
Inspired by the proportional hazards approach, let us consider that the conditional reverse hazard function of given could be written as
(2.8) 
where is some unknown baseline reverse hazard function and is a vector of unknown regression parameters.
Similar to the rightcensoring case, one can deduce
(2.9) 
and the baseline cumulative reverse hazard is obtained as .
3 Semiparametric likelihood estimation
Let , denote the observations that are independent copies of In the following, we consider with some positive integer. With observations of the covariates and of lifetimes as in (1.1), a natural likelihoodtype criterion is the one considered by Kim et al. (2010) :
(3.1) 
In this criterion, the factors involving the distribution of are dropped, as they are supposed uninformative.
To write the likelihoodtype criterion , we only used a hazard rate as in (2.4), without specifying any censoring mechanism or latent model. Alternatively, one could write the likelihood in terms of the cumulative reverse hazard we defined in section 2.2, using only the assumption (2.8). The two criteria are equivalent and would be valid for the type of data we consider. Next, one could follow the profiling idea. In the case where this leads to Cox’s partial likelihood with rightcensored data. See Murphy & van der Vaart (2000). A similar situation, Cox’s partial likelihood with leftcensored data, occurs when . Unfortunately, given a value , the maximization with respect to (or ) of does not have a nondegenerate, explicit solution when both and are positive. See Kim et al. (2010), the Remark on page 1341. A possible solution, proposed by Kim et al., would be to consider a numerical approximation. Here we propose an alternative, more convenient and sound route. To estimate the parameters of interest, one has to consider a model for the censoring mechanism. In the model considered by Kim et al. (2010), there is no way to connect the infinitedimensional parameter (or ) to the quantities that could be easily estimated from the data, such as . This makes the profiling approach complicated. The profiling approach is very appealing in the standard rightcensoring (resp. leftcensoring) case because there could be easily expressed in terms of , and (resp. ).
In the two models we propose, the relationship between quantities that could be estimated by sample means from the data and the infinitedimensional parameter (or ) is explicit and this allows us to build a userfriendly approximated likelihood. These models does not only make the optimization of the likelihoodtype criteria simpler. First of all, they induce censoring mechanisms that make sense in some applications. See Patilea & Rolin (2006a) for a discussion.
3.1 The rightcensoring and current status data case
The parameters of our first model are and the hazard function . Let and denote the true values of the parameters. Using the notation from equation (2.6) we can also write .
In view of equation (2.3) let us consider
as estimator of For estimating we shall use a partial likelihood approach. With at hand an estimate of , we will use an empirical version of equation (2.6) and build an estimate of . For these purposes let us define empirical quantities
For a column vector , and Let
and
(3.2) 
Consider
as the empirical version of the cumulative hazard function , as defined in (2.6).
Using these empirical quantities, and recalling that , we can write the following approximation of the criterion defined in (3.1) :
where is some threshold that prevents from dividing by zero, it will be specified below. Hence, let us define the approximate loglikelihood function
and The regression parameter is then estimated by
where is a set of parameters and is fixed by the statistician. For theoretical results, one needs conditions allowing to control for small values of This is technical condition that is usually ignored in practice where one would simply take equal to the largest uncensored observation. Next, the cumulative hazard function is estimated by
and the conditional survival function of the lifetime of interest is estimated by
The conditional cure probability is then estimated by
3.2 The leftcensoring and current status data case
In the case of the model for leftcensored and current status data the estimate of is
Next, using the same notation as above, let us define
Let us denote
and, for any such that , consider
Let us fix some (small) value such that and, similarly to the construction presented in section 3.1, define the approximated loglikelihood function
where The regression parameter is then estimated by
where is a set of parameters and is fixed by the statistician. Like in the previous model, imposing a bound , here it should be a lower one, is a technical condition usually ignored in applications. Next, the conditional distribution function of the lifetime of interest is estimated by
The zero lifetime conditional probability is then estimated by
and the baseline cumulative reverse hazard is estimated by .
4 Asymptotic results
For the asymptotic results we only consider the investigation of the rightcensored and current status data case. For the leftcensored and current status data case the results are similar and could be obtained after obvious modifications.
Let be the probability distribution of and for any integrable function let Let
be the empirical distribution function and
Let us introduce the following additional assumptions.

The vector of covariates lies in , with
fixed, has a positive definite variance and is bounded, that is
a.s. Moreover, is an interior point of the parameter set that is a compact subset of , and ; 
The value is such that
For simplicity we rule out the case because in this case and a.s., that is we are exactly in the classical PH model under rightcensoring. Since is strictly positive, Assumption A3) is equivalent to Also for simplicity, in the sequel we assume that the lifetime of interest and the censoring time are almost surely different. Let us notice that the construction we propose in sections 2.1 and 2.2 adapts to the case where depends on the sample size, or to the case where is an infinitedimensional space. The study of the properties of the estimators defined in such cases is left for future work.
Theorem 4.1 (Consistency).
Let . Assume and Assumptions A1–A3 hold true. Then:

, in probability;

in probability.
Theorem 4.2 (I.i.d. representation).
Under the assumptions of Theorem 4.1 we have:
where is some squares integrable function and is a reminder term that is uniformly negligible, that is
Corollary 4.3 (Clt).
Under the assumptions of Theorem 4.2
where is a tight, zeromean Gaussian process with covariance function
We could also derive the asymptotic law of the estimator of the survivor function for an arbitrary value in the support of the covariates. The following result is a straightforward extension of classical results for Cox PH model, see Link (1984).
Corollary 4.4 (CLT for the conditional survivor).
Under the assumptions of Theorem 4.2 and for any fixed
where is a tight, zeromean Gaussian process.
Let us now investigate the estimator of the cure rate. Suppose that has a bounded support and let be its right endpoint. Assume that Then in our model we necessarily have and Since one cannot identify the law of beyond the last uncensored observation, by an usual convention, These quantities could be estimated by The following corollary is a direct consequence of Corollary 4.4.
Corollary 4.5 (CLT for the conditional cure rate).
Suppose that the assumptions A1, A2 hold true. Moreover, has a bounded support with right endpoint . Assume that Then
where is the largest uncensored observation and with from Corollary 4.4.
The estimation of the covariance functions of the processes and , and of the variance is quite difficult. Therefore we propose an alternative route, based on the weighted bootstrap, for estimating the asymptotic law of our estimators. Let us consider that is an uniformly consistent estimator of . Next, let us define
where are i.i.d., with zero mean and unit variance random variables, for instance gaussian, independent of the data.
Theorem 4.6 (Asymptotic law approximation).
References
 [1] Braekers, R. & Grouwels, Y. (2015). A semiparametric Cox’s regression model for zeroinflated leftcensored time to event data. Communications in Statistics – Theory and Methods 45(7), 1969–1988.
 [2] Cox, D.R. (1972). Regression models and life tables (with discussion). J. Roy. Statist. Soc. Ser. B 34, 187–220.
 [3] Cox, D.R. (1975). Partial likelihood. Biometrika 62, 269–276.
 [4] Fang, H.B., Li, G., & Sun, J. (2005). Maximum likelihood estimation in a semiparametric Logistic/proportionalhazards mixture model. Scand. J. Statist. 32, 59–75.

[5]
Gill, R.D. (1994). Lectures on survival analysis
. Lectures on probability theory: Ecole d’été de probabilités de SaintFlour XXII. Lecture notes in mathematics 1581. Springer.
 [6] Gill, R.D., Johansen, S. (1990). A Survey of ProductIntegration with a View Toward Application in Survival Analysis. Ann. Statist. 18(4), 1501–1555.
 [7] Huang, J. (1999). Asymptotic properties of nonparametric estimation based on partly intervalcensored data. Statistica Sinica 9, 501–519.
 [8] Kim, J.S. (2003). Maximum likelihood estimation for the proportional hazards models with partly intervalcensored data. J. Royal Stat. Soc. B 65, 489–502.
 [9] Kim, Y., Kim, B., Jang, W. (2010). Asymptotic properties of the maximum likelihood estimator for the proportional hazards model with doubly censored data. J. Multivar. Anal. 101, 1339–1351.
 [10] Kosorok, M.D. (2008). Introduction to empirical process and semiparametric inference. Springer Series in Statistics, Springer: NewYork.

[11]
Link, C.L.
(1984). Confidence intervals for the survival function using Cox’s proportionalhazard model with covariates.
Biometrics 40, 601–609.  [12] Murphy, S. A., & A. W. van der Vaart (2000). On profile likelihood. J. Amer. Statist. Assoc. 95(450), 449–465.
 [13] Patilea, V., & Rolin, J.M. (2006a). Productlimit estimators of the survival function for two modified forms of currentstatus data. Bernoulli 12, 801–819.
 [14] Patilea, V., & Rolin, J.M. (2006b). Productlimit estimators of the survival function with twice censored data. Ann. Statist. 34, 925–938.
 [15] Turnbull, B.W. (1974). Nonparametric estimation of a survivorship function with doubly censored data. J. Amer. Statist. Assoc. 69, 169–173.
 [16] van der Vaart, A.D. (1998). Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge.
 [17] van der Vaart, A.D., & Wellner, J.A. (1996). Weak convergence and empirical processes. Springer Series in Statistics. SpringerVerlag, NewYork.
 [18] van der Vaart, A.D., & Wellner, J.A. (2007). Empirical processes indexed by estimated functions. In Asymptotics: Particles, Processes and Inverse Problems, IMS Lecture Notes–Monograph Series, Vol. 55 (2007) 234–252.
 [19] Zheng, D., Yin, G. & Ibrahim, J.G. (2006). Semiparametric Transformation Models for Survival Data With a Cure Fraction. J. Amer. Statist. Assoc. 101(474), 670–684.
5 Appendix
5.1 Notation
For any matrix , we denote by Let us recall that vectors are considered as column matrices. The spaces of functions we consider are endowed with the uniform (supremum) norm that is denoted by Let and denote the partial derivation operators with respect to and respectively.
Let
and
Let
The criterion is expected to be the limit of the approximated loglikelihood function . Let us recall that denotes the probability distribution of and for any integrable function let Moreover,
is the empirical measure, and Finally, define
5.2 Proof of Theorem 4.1
To prove consistency for , it suffices, for instance, to use the results from section 5.2 of van der Vaart (1998). This means to check that
Comments
There are no comments yet.