Biostatistics Advance Access first published online on April 24, 2006
This version published online on February 19, 2007
Biostatistics, doi:10.1093/biostatistics/kxj040
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Parametric regression on cumulative incidence function
Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261, USA jeong{at}nsabp.pitt.edu
Department of Statistics and Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI 53706, USA
* To whom correspondence should be addressed.
| SUMMARY |
|---|
|
|
|---|
We propose parametric regression analysis of cumulative incidence function with competing risks data. A simple form of Gompertz distribution is used for the improper baseline subdistribution of the event of interest. Maximum likelihood inferences on regression parameters and associated cumulative incidence function are developed for parametric models, including a flexible generalized odds rate model. Estimation of the long-term proportion of patients with cause-specific events is straightforward in the parametric setting. Simple goodness-of-fit tests are discussed for evaluating a fixed odds rate assumption. The parametric regression methods are compared with an existing semiparametric regression analysis on a breast cancer data set where the cumulative incidence of recurrence is of interest. The results demonstrate that the likelihood-based parametric analyses for the cumulative incidence function are a practically useful alternative to the semiparametric analyses.
Keywords: Breast cancer; Clinical trial; Competing risks; Cumulative incidence; Cure model; Improper distribution; Regression; Transformation model
| 1. INTRODUCTION |
|---|
|
|
|---|
Competing risks data are encountered frequently in medical research. In breast cancer trials, like Protocol B-19 (Fisher and others, 1989
When analyzing cause-specific failure patterns, investigators may be interested in the effects of covariates on the event-specific failure probabilities. Such analyses may involve testing the effects of treatment adjusted for important prognostic factors, in addition to testing the effects of the prognostic factors. For patient management, it is helpful to characterize the proportion of events over time for causes of interest conditionally on covariates, including treatment. Gray (1988)
proposed a nonparametric inference procedure to compare the cumulative incidence curves among two groups, with alternative tests discussed in Pepe (1991)
. Fine and Gray (1999)
and Fine (2001)
adapted the proportional hazards model (Cox, 1972
, 1975
) to the cumulative incidence function and proposed inferences for the effects of treatment and other continuous prognostic factors. In breast cancer trials at NSABP, one may wish to quantify the effect of radiation therapy adjusted for known risk factors, such as age, tumor size, number of positive lymph nodes, and estrogen receptor level. Regression models are critical when adjusting for such variables, which are usually collected as either count or continuous measurements.
Unlike with traditional survival endpoints, the proportion of recurrences in breast cancer tends to increase for a period of time and then plateau (Karrison and others, 1999
). Those patients who do not experience recurrences can be viewed as a cured population (Boag, 1949
) and inferences about the recurrence "distribution" can be conceptualized in a cure model framework. The asymptote of the cumulative distribution function of recurrences is less than 1, that is, the "distribution" is improper. A key issue in regression modeling of recurrences is the effect of covariates on the leveling off point of the cumulative distribution function.
The most widely used analyses of competing risks data in practical applications, like the breast cancer data, are nonparametric and semiparametric. A major advantage of these approaches is that there is no need to assume an underlying distributional form for the cumulative incidence function, which is difficult in the competing risks setting, owing to the impropriety of this function. Of course, such flexibility arises at the cost of efficiency loss relative to parametric models, especially with small sample sizes (Miller, 1983
; Jeong and Oakes, 2003
). The trade-off for additional parametric assumptions is the potential bias associated with model misspecification (Meier and others, 2004
). On the other hand, the parametric models permit extrapolation of long-term event probabilities, which are of inherent interest and which cannot generally be identified from nonparametric and semiparametric models. Moreover, parametric regression models are amenable to formal maximum likelihood (and Bayesian) inferences, unlike the semiparametric analyses of Fine and Gray (1999)
and Fine (2001)
.
Jeong and Fine (2006)
proposed a direct parameterization of the cumulative incidence function without covariates. Empirical studies using NSABP breast cancer data showed that a simple form of the improper Gompertz distribution (Gompertz, 1825
) provided better fits than more complex parametric mixture models (Larson and Dinse, 1985
), as measured by the agreement of the fits with nonparametric estimates of the cumulative incidence functions. The direct parameterization is more parsimonious and has a more straightforward interpretation than do indirect parameterizations, where the cause-specific hazard function (Bryant and Dignam, 2004
) and/or the overall survival function (Benichou and Gail, 1990
) are modeled with proper distributions. This is particularly true if long-term event probabilities are of interest, which cannot be inferred directly from the component models in the indirect parameterization.
In this paper, we extend Jeong and Fine (2006)
to the regression setting. Maximum likelihood inferences are developed in which parametric models for the cumulative incidence functions for all causes are fit simultaneously. Inferences are based on standard asymptotic results for the maximum likelihood estimators. Our general parametric framework encompasses models which are parametric specializations of the models in Fine and Gray (1999)
and Fine (2001)
. In the analysis of the Protocol B-19 data set, we employ generalized odds rate regression models (Dabrowska and Doksum, 1998
), with Gompertz baseline distributions, including both proportional hazards and proportional odds models as special cases. The proposed parametric procedure permits goodness-of-fit tests for the proportional hazards and proportional odds assumptions, assuming that the parametric model for the base distribution is correctly specified.
In Section 2, we introduce the competing risks data and the associated notation. In Section 3, general parametric regression models are formulated for the cumulative incidence function. The Gompertz distribution is presented and its suitability as a model for the baseline distribution is discussed. In Section 4, maximum likelihood estimation is presented. In Section 5, the variances of the estimated regression parameters and cumulative incidence probabilities are derived. In Section 6, the parametric procedure is compared with the semiparametric FineGray (1999) model on NSABP Protocol B-19.
| 2. CUMULATIVE INCIDENCE FUNCTIONONE SAMPLE CASE |
|---|
|
|
|---|
The basic identifiable quantities from competing risk data (T,K) are the cause-specific hazard and cumulative incidence functions, where T is time to the first event and K
(1,...,nK) is the event type, where nK is the number of event types. The cause-specific hazard function for an event K = k at time t is
k(t) = lim
0Pr(t < T
t +
,K = k|T
t)/
. For small
, |
| (2.1) |
As noted in Jeong and Fine (2006)
, the left-hand side in (2.1) approximates the probability density function for the kth cause-specific event as
approaches 0. This implies that the cumulative incidence function for the kth cause-specific event is
|
| (2.2) |
where S(t) = Pr(T > t) and
k(t) = 

k(u)du is the cumulative hazard function for the kth cause-specific event. In (2.1), the cause-specific hazard function
k(t) on the right-hand side makes the probability density function for cause-specific events of type k improper whenever
k <
k
k. Therefore, the cumulative incidence function in (2.2) may also be improper.
In practice, T is typically subject to additional independent right censoring. To nonparametrically estimate the cumulative incidence function, the overall survival function S(·) may be replaced by the KaplanMeier estimator (Kaplan and Meier, 1958
) and the cause-specific cumulative hazard function
k(·) may be replaced by a NelsonAalen estimator (Nelson, 1972
; Aalen, 1978
). Bryant and Dignam (2004)
recently proposed a semiparametric inference procedure by parameterizing only the cause-specific hazard function in the integral (2.2), with S(·) being estimated nonparametrically. They noticed an efficiency gain over the nonparametric estimator of Fk. Benichou and Gail (1990)
considered fully parametric inference on the cumulative incidence function by parameterizing the cause-specific hazard function and the overall survival function. Larson and Dinse (1985)
considered parametric inference for a mixture model representation of the joint distribution of (T,K). Jeong and Fine (2006)
proposed direct parametric modeling of Fk(·), and suggested the Gompertz distribution, which is tailored to the unique features of Fk(·). They showed that the direct parametric approach may provide a better fit than either the cause-specific hazard or the mixture model approaches when there is a plateau in the tail of the cumulative incidence function.
| 3. REGRESSION MODELS |
|---|
|
|
|---|
For direct regression modeling of the cumulative incidence function, it is convenient to consider a transformation model structure (Fine and Gray, 1999
|
| (3.1) |
where uk(t) is an invertible and monotonically increasing function, ßk is a P x 1 parameter vector, and Z is a time-independent P x 1 covariate vector. For two individuals with covariate vectors Z1 and Z2, the conditional cumulative incidence functions satisfy a vertical shift model
|
| (3.2) |
Specifying gk(·) to be the logit function gives a proportional odds model for cause k events, with the regression parameters in ßk corresponding to time-independent log odds ratios per unit increases in the covariates.
Fine and Gray (1999)
considered the proportional hazards model to directly infer the effects of covariates on the cumulative incidence of type k events. Their model was originally posited in terms of the subdistribution hazard function (Gray, 1988
) specifying that
|
| (3.3) |
where 
(t;Z) = lim
0Pr{t
T
t +
,K = k|T
t
(T
t
K
k);Z}/
. The subdistribution hazard function 
(·) is the hazard function for the improper random variable T* = I(K = k)xT + {1 I(K = k)}x
, where I(·) is an indicator function. The model (3.3) corresponds to the transformation model (3.1) where gk(v) = log{ log(1 v)}. Equivalently, the cumulative probability of a type k event is given by
|
| (3.4) |
where uk(t) = logk{

k0(s)ds}. For semiparametric inference about ßk separately from
k0(t) and separately from the models for Fj(t;Z) (j
k), Fine and Gray (1999)
constructed a partial likelihood in which the risk set for type k events is constructed so that subjects having already experienced events other than type k are always at future "risk" of a type k event. This differs from the traditional cause-specific hazard analysis where the occurrence of an event other than type k removes an individual from future risk sets (Kalbfleisch and Prentice, 1980
).
Extending Fine (2001)
, we propose a general parametric class of transformation models, in which there may be unknown parameters in gk(·). Each event type has its own model, with distinct parameters for gk(·), uk(·), and ßk. The link function gk(·) in (3.3) may have arbitrary parametric form gk(vk;
k), where
k may be unknown. In the Protocol B-19 analysis, we employ the odds rate transformation
|
| (3.5) |
which includes the proportional hazards and proportional odds models (Dabrowska and Doksum, 1998
) when 
0 and
= 1, respectively. Adopting a flexible link function is useful for assessing the goodness-of-fit of the proportional hazards model and other fully specified models for gk(·). Under model (3.5), the cumulative probability of a type k event is given by
|
| (3.6) |
In this paper, the Gompertz (1825)
distribution is used to parameterize the log baseline cumulative subdistribution hazard function, uk(t), which permits the asymptote of the cumulative distribution function to be < 1. The cumulative distribution function can be written
|
| (3.7) |
where
<
<
and 0 <
<
. The hazard function {dB(t;
,
)/dt}{1 B(t;
,
)} 1 is
|
| (3.8) |
and hence the cumulative hazard function is given by uG(t;
,
) =
{exp(
t) 1}/
. An improper distribution occurs when
< 0 and
<
. The hazard function (3.8) can fit either increasing or decreasing hazards, depending on the signs of the parameters.
The implied parametric regression model for the cumulative incidence function is
|
| (3.9) |
where
k = (
k,ß
,
k,
k). For large t, one minus the cumulative distribution function in (3.9) equals the proportion of patients cured, given a set of covariate values. With competing risks data, this cured fraction is the proportion of individuals never experiencing event k, for example, breast cancer recurrence. Note that, when
< 0, as t
, the formula reduces to
|
| (3.10) |
so that the proportion never experiencing a type k event is 1 Fk(
;
k,Z).
| 4. MAXIMUM LIKELIHOOD ESTIMATION |
|---|
|
|
|---|
Let Ti and Ci be the potential failure time and the potential censoring time, respectively, for the ith subject. Define Xi = min(Ti,Ci). The indicators for the competing events are
![]() |
for k = 1,...,nK. In the breast cancer example where nK = 2,
1i = 1 if the ith patient experiences a recurrence as a first event and 0 otherwise, and
2i is similarly defined for the ith patient experiencing events other than recurrences. The observable data are denoted as (Xi,
1i,...,
nKi,Zi)(i = 1,
,n).
Following similar arguments for direct inference for Fk in the one-sample case (Jeong and Fine, 2006
), given covariate Zi = zi, the likelihood function is
![]() | (4.1) |
where fk(x,
k;zi) = dFk(x,
k;zi)/dx (k = 1,...,nK). Note that the likelihood involves information from all failure types and does not factor into separate pieces for each type. This differs from the parameterization based on the cause-specific hazard functions (Prentice and others, 1978
), where the likelihood factors so that inferences about cause 1 may be carried out separately from the models for other causes. Under this formulation, misspecification of cause-specific hazard models for other causes does not lead to bias in the estimated model for cause 1. A limitation is that direct inference about the cumulative incidence functions is not possible.
In (4.1), the cumulative probability of failure from any event is the sum of the nK cumulative incidence functions, F1(·),...,FnK(·). Under the cause-specific hazard formulation, this cumulative probability is obtained from the overall hazard rate, which is the sum of the corresponding nK cause-specific hazard functions (Benichou and Gail, 1990
; Bryant and Dignam, 2004
).
When the proportional subdistribution hazard model (Fine and Gray, 1999
) is assumed for type k events, Fk(x,
k;zi) may be replaced with Fk(x;z) in (3.4) after parameterizing uk(x) by u
(x;
k,
k). In this paper, our focus is a model with general gk(vk;
k), including the odds rate model (3.9), which accommodates a range of nonproportional hazards models.
From (4.1), the log-likelihood function is given by
![]() | (4.2) |
Differentiating (4.2) and setting the resulting score function equal to 0 with respect to
k, the maximum likelihood estimator
, k = 1,...,nK, can be obtained. The maximum likelihood estimator of Fk(t;
k,z) is
, k = 1,...,nK.
| 5. LARGE SAMPLE INFERENCES |
|---|
|
|
|---|
The observed information matrix can be derived by taking the second derivatives of the log-likelihood function. Given Z = z, the variance of
can be evaluated by the multivariate delta method as |
| (5.1) |
where
Fk(t,
k;z)/
k is a vector of the first derivatives of the cumulative incidence function for the kth cause-specific event with respect to
k. The matrix
is a submatrix of the inverse of the observed information matrix corresponding to the variance of
, evaluated at
. A pointwise 95% confidence interval for Fk(t;z) is
|
| (5.2) |
Confidence intervals may also be based on inverting the likelihood ratio test, which may have better small sample properties than (5.2). A disadvantage is that these intervals cannot be calculated using output from the fitted model and additional computations are needed.
One may evaluate the regression parameters and the form of gk(·) with simple Wald-type test statistics. Under model (3.9), the hypotheses are (i) H0:
k =
k0, k = 1,...,nK, for testing the proportional hazards (
k0 = 0) or odds (
k0 = 1) assumption and (ii) H0:ßkp = 0, k = 1,...,nK, p = 1,...,P (where ßk = (ßk1,...,ßkP)T), for testing the covariate effect on the cumulative incidence of type k events. To test (i), the statistic
|
| (5.3) |
may be used. To test the hypothesis (ii), we consider
|
| (5.4) |
From the theory of maximum likelihood estimation, the statistics Z
k and Zßkp follow the standard normal distribution asymptotically when the corresponding null hypothesis holds. Likelihood ratio and score tests may also be utilized for testing.
| 6. APPLICATION TO BREAST CANCER DATA |
|---|
|
|
|---|
The data comes from Protocol B-19, one of the earliest clinical trials on breast cancer treatment at NSABP. Two adjuvant chemotherapy regimens, methotrexate and 5-fluorouracil (MF) and cyclophosphamide plus MF (CMF), were compared among breast cancer patients with negative axillary lymph nodes and negative estrogen receptors. The patients have been followed 15 + years for cancer recurrence and mortality. Fisher and others (2004)
In the analysis, we define recurrence as any breast cancer recurrence in local, regional, or distant sites as first events. Other competing first events include second primary cancer other than breast and deaths without evidence of any disease. This definition implies disease-specific versusnon-disease-specific events in terms of breast cancer. In this cohort, the numbers of recurrences and other events are 211 and 161, respectively.
The focus of the analysis is the effects of treatment and other baseline prognostic factors on the cumulative incidence functions of recurrence and other events, which quantify long-term disease burden. Since all patients on NSABP B-19 are axillary lymph node negative and estrogen receptor negative, we only consider treatment group (trt), tumor size (tsize), and age at randomization (age) as potential covariates in our models. We analyze two models: a simple model with one covariate for the treatment effect and a full model based on the three covariates.
We begin by analyzing the model with a single treatment effect covariate (trt), coded 0 for the MF group and 1 for the CMF group. For comparison, the method of Fine and Gray (1999)
, denoted as FG, was used to fit a semiparametric transformation model with gk(vk) = log{ log(1 vk)} in (3.3) and uk(·) completely unspecified. Parametric generalized odds rate regression models with Gompertz base distribution were fit with
k = 0 (proportional hazards; PH(G)),
k = 1 (proportional odds; PO(G)), and with
k estimated (GOR(G)). Table 1 summarizes parametric and semiparametric estimates of the regression models for breast cancer recurrence and other competing events.
|
The negative sign of
for recurrence indicates that the estimated distribution is improper. Testing the null hypothesis H0:
0 in this case gives a one-sided p-value less than 0.0001. Interestingly, for other events
and testing the null hypothesis H0:
0 gives a one-sided p-value of 0.20, indicating the estimated cumulative incidence function is proper. These results make intuitive sense if one considers that the cumulative incidence functions are estimated using data from the observation period of Protocol B-19, which only spans 14 years. Few breast cancer recurrences occur after 10 years and the cumulative incidence function for recurrence plateaus between 10 and 14 years; see Figure 1. An improper distribution is clearly warranted. On the other hand, other events occur at a fairly steady rate over the entire time period, with the cumulative incidence increasing linearly up to 14 years; see Figure 1. While one would expect that this incidence curve would plateau at later times, over the first 14 years, the curve is better described by a proper distribution. Of course, it would be inappropriate to extrapolate the fitted Gompertz model to estimate long-term probabilities of other events.
|
The proportional hazards and proportional odds assumptions were tested using the statistic (5.3). The p-values from testing the proportional hazards (odds) assumptions are 0.41 (0.16) and 0.45 (0.46) for recurrence and other events, respectively. Neither of the models is rejected for either event type, reflecting the large variances of
.
Given the weak evidence against proportional hazards and odds models, we now consider analyses fixing
= 0 or 1, which may have greater efficiency and greater interpretability than analyses in which
is estimated (see Table 1). Under the proportional hazards model, Wald tests based on the FG semiparametric estimates of the treatment effects give p-values of 0.0001 for recurrences and 0.52 for the other events. After fixing
k = 1 under the proportional odds model with Gompertz baseline, Wald tests for the treatment effects result in p-values of 0.0001 and 0.50, for recurrence and other events. Under the parametric proportional hazards model with Gompertz baseline and
k = 0, the corresponding p-values are 0.0002 and 0.51. The estimated treatment effects under the semiparametric and parametric proportional hazards models are almost identical. Under the proportional odds model, the odds of recurrence on CMF is exp( 0.6) = 0.55 that on MF, at all times t, based on the cumulative incidence functions. This interpretation may be more natural than that based on the proportional hazards model with
= 0, where the regression parameters denote subdistribution hazard ratios.
Figure 1(a) shows a comparative plot of the nonparametric (dotted and dashed) estimates, semiparametric estimates from the proportional hazards model (dashed), and parametric estimates from the generalized odds rate model (solid) and the proportional odds model (dotted) of the cumulative incidence curves for recurrence up to year 14. Figure 1(b) shows a similar plot for the other events. To check the baseline Gompertz assumption, the parametric and semiparametric estimates of the baseline cumulative hazard functions were compared under the proportional hazards model (Figure 2). The parametric estimates of u0(t) are calculated as
, where
and
are the estimates from the parametric proportional hazards model with Gompertz baseline. The semiparametric estimates are calculated using
, where
is the semiparametric estimate of the baseline cumulative subdistribution from the FG model. In both Figures1 and 2, the parametric curves agree reasonably well with the nonparametric and semiparametric estimates, although there is some evidence of lack of fit in the first few years of the follow-up period for breast cancer recurrence in Figure 1(a). There do not appear to be substantial differences between the different parametric fits. These findings suggest that the cumulative incidence functions may be well approximated by simple Gompertz models. Moreover, the estimated long-term failure probabilities are fairly insensitive to
in the odds rate model.
|
Next, we fit a regression model with age and tsize, as well as trt. The parametric and semiparametric estimates are summarized in Table 2. P-values for testing the proportional hazards (odds) assumptions are 0.73 (0.32) and 0.36 (0.27) for recurrence and other events, respectively. Again, there is large variability in estimation of
, which makes it difficult to differentiate between different transformations in (3.1).
|
For breast cancer recurrence, Wald tests for ßtrt, ßage, and ßtsize give p-values of 0.0001, 0.321, and 0.010 from the parametric proportional odds model, 0.0001, 0.330, and 0.004 from the semiparametric proportional hazards model, and 0.0001, 0.335, and 0.012 from the parametric proportional hazards model with Gompertz baseline. For other non-breast-cancer-related events, p-values for the regression coefficients are 0.548, 0.003, and 0.839 from the parametric proportional odds model, 0.540, 0.008, and 0.930 from the FG model, and 0.557, 0.003, and 0.808 from the parametric proportional hazards model with Gompertz baseline.
For both recurrence and other events, the results are rather consistent across models. There are significant treatment and tumor size effects on recurrence among node-negative and estrogen receptor-negative breast cancer patients in Protocol B-19. The observation of a positive correlation between tumor size and breast cancer recurrence rate is anticipated, but is still important information for patients and investigators. The decreased risk of recurrence with CMF persists after adjusting for initial disease severity, as measured by tumor size. For non-breast-cancer-related events, only the effect of age is significant. This may be because the etiology of the other causes of failure is connected to the aging process, which is unaffected by either treatment or the size of the original breast tumor.
We consider estimation of the recurrence rates conditionally on treatment group and tumor size, whose effects are statistically significant in all models. The parametric generalized odds rate model with unknown
(smooth lines) and the semiparametric FG model (dashed lines with jumps) are shown in Figure 3 for different values of tumor size in each treatment group. For the odds rate model, the point estimates are supplemented by 95% pointwise confidence intervals. The parametric curves are calculated with
|
|
|
where group = 0 for the MF arm or 1 for the CMF arm and tsize = 15, 30, or 45 mm. The two cumulative incidence estimates agree, on average, with the parametric model tending to underestimate at early time points. The semiparametric curves are within the pointwise confidence intervals from the parametric estimates, except for the first few years of the follow-up period. In each treatment group, the probability of recurrence increases as tumor size increases. Parametric estimates of 5- and 10-year recurrence rates in the MF arm at tumor sizes of 15, 30, and 45 mm are 17.9, 21.0, and 24.5% and 22.4, 26.2, and 30.6%, respectively.
The proportion of breast cancer patients never experiencing breast cancer recurrence, which can be interpreted as a cure fraction, can be estimated by replacing the parameters in (3.10) with their maximum likelihood estimates. Under the generalized odds rate model, this yields
|
|
For tumor size 20 mm (median in Protocol B-19), the estimated long-term cure rates are 74.7 and 84.4% for patients on MF and CMF, respectively. This represents a meaningful decrease in disease burden for breast cancer patients receiving CMF. It is important to recognize that such estimation is not possible under nonparametric and semiparametric models, where uk(t) cannot be estimated beyond the largest observed follow-up time.
| 7. DISCUSSION |
|---|
|
|
|---|
In this paper, we proposed maximum likelihood inferences for a direct parametric regression modeling framework for the cumulative incidence function with competing risks data. A general parametric form of the transformation gk(vk;
k) was considered, which includes proportional hazards and proportional odds models. The baseline distribution was modeled using a Gompertz specification, which accommodates improper distributions, like the cumulative incidence function. In theory, any parametric model could be used for uk(t), with the maximum likelihood estimators giving valid inferences under the usual regularity conditions. The parametric analysis enables estimation of the long-term proportion of individuals experiencing a particular event type. This differs from standard nonparametric and semiparametric analyses, where uk(t) is completely unspecified. Of course, care should be exercised when interpreting parametric extrapolations, particularly when there is no evidence of a plateau in the tail of the estimated cumulative incidence function over the observed time period.
A two-parameter Gompertz distribution was used to parameterize the baseline distribution. It is worth noting that this model only allows monotone hazard shapes. Greater flexibility may be obtained with other parametric models which permit unimodal and bathtub shapes. The additional flexibility may be helpful in obtaining more accurate predictions of the cumulative incidence functions over the entire follow-up period, as in Protocol B-19, where there is some evidence of lack of fit at early time points. There may be other applications where the Gompertz model is seriously deficient and different specifications may be needed to obtain reliable results.
| ACKNOWLEDGMENTS |
|---|
This research was supported in part by the Department of Defense grant W81XWH-04-1-0605 and National Health Institute grants 5-U10-CA69974-09 and 5-U10-CA69651-11. Conflict of Interest: None declared.
| REFERENCES |
|---|
|
|
|---|
-
Aalen OO. (1978) Nonparametric inference for a family of counting processes. Annals of Statistics 6:70126.[Web of Science]
Benichou J and Gail MH. (1990) Estimates of absolute cause-specific risk in cohort studies. Biometrics 46:81326.[CrossRef][Web of Science][Medline]
Boag JW. (1949) Maximum likelihood estimates of the proportion of patients cured by cancer therapy. Journal of the Royal Statistical Society, Series B, Methodological 11:1544.
Bryant J and Dignam JJ. (2004) Semiparametric models for cumulative incidence functions. Biometrics 60:18290.[CrossRef][Web of Science][Medline]
Cox DR. (1972) Regression models and life-tables (with discussion). Journal of the Royal Statistical Society, Series B, Methodological 34:187202.
Cox DR. (1975) Partial likelihood. Biometrika 62:26976.
Dabrowska DM and Doksum KA. (1998) Estimation and testing in a two-sample generalized odds-rate model. Journal of the American Statistical Association 83:7449.[CrossRef]
Fine JP. (2001) Regression modelling of competing crude failure probabilities. Biostatistics 2:8597.[Abstract]
Fine JP and Gray RJ. (1999) A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association 94:496509.[CrossRef]
Fisher B, Jeong J, Anderson S, Wolmark N. (2004) Treatment of lymph node-negative, estrogen receptor-negative breast cancer: updated findings from National Surgical Adjuvant Breast and Bowel Project clinical trials. Journal of the National Cancer Institute 96:182331.
Fisher B, Redmond C, Dimitrov NV, Bowman D, Legault-Poisson S, Wickerham DL, Wolmark N, Fisher ER, Margolese R, Sutherland C. (1989) A randomized clinical trial evaluating sequential methotrexate and fluorouracil in the treatment of patients with node-negative breast cancer who have estrogen-receptor-negative tumors. The New England Journal of Medicine 320:4738.[Abstract]
Gaynor JJ, Feuer EJ, Tan CC, Wu DH, Little CR, Straus DJ, Clarkson BD, Brennan MF. (1993) On the use of cause-specific failure and conditional failure probabilities: examples from clinical oncology data. Journal of the American Statistical Association 88:4009.[CrossRef]
Gompertz B. (1825) On the nature of the function expressive of the law of human mortality, and on the new mode of determining the value of life contingencies. Philosophical Transactions of the Royal Society of London A 115:51380.
Gray RJ. (1988) A class of K-sample tests for comparing the cumulative incidence of a competing risk. Annals of Statistics 16:114154.[Web of Science]
Jeong J and Fine JP. (2006) Direct parametric inference for cumulative incidence function. Journal of the Royal Statistical Society, Series C, Applied Statistics 55:187200.[CrossRef][Web of Science]
Jeong J and Oakes D. (2003) On the asymptotic relative efficiency of estimates from Cox's model. Sankhya 65:41121.
Kalbfleisch JD and Prentice RL. (1980) The Statical Analysis of Failure Time Data(John Wiley & Sons, Inc, New York).
Kaplan EL and Meier P. (1958) Nonparametric estimator from incomplete observations. Journal of the American Statistical Association 53:45781.[CrossRef]
Karrison TG, Ferguson DJ, Meier P. (1999) Dormancy of mammary carcinoma after mastectomy. Journal of the National Cancer Institute 191:805.
Korn EL and Dorey FJ. (1992) Applications of crude incidence curves. Statistics in Medicine 11:81329.[Web of Science][Medline]
Larson MG and Dinse GE. (1985) A mixture model for the regression analysis of competing risks data. Journal of the Royal Statistical Society, Series C, Applied Statistics 34:20111.
Meier P, Karrison T, Chappell R, Xie H. (2004) The price of KaplanMeier. Journal of the American Statistical Association 99:8906.[CrossRef]
Miller RG Jr. (1983) What price KaplanMeier? Biometrics 39:107781.[CrossRef][Web of Science][Medline]
Nelson W. (1972) Theory and applications of hazard plotting for censored failure data. Technometrics 14:94565.[CrossRef][Web of Science]
Pepe MS. (1991) Inference for events with dependent risks in multiple endpoint studies. Journal of the American Statistical Association 86:7708.[CrossRef]
Pepe MS and Mori M. (1993) KaplanMeier, marginal or conditional probability curves in summarizing competing risks failure time data? Statistics in Medicine 12:73751.[Web of Science][Medline]
Prentice RL, Kalbfleisch JD, Peterson AV Jr, Flournoy N, Farewell VT, Breslow NE. (1978) The analysis of failure times in the presence of competing risks. Biometrics 34:54154.[CrossRef][Web of Science][Medline]
Taghian A, Jeong J, Mamounas E, Anderson S, Bryant J, Deutsch M, Wolmark N. (2004) Pattern of loco-regional failure in patients with breast cancer treated by mastectomy and chemotherapy ( + / tamoxifen) without radiation: results from five NSABP randomized trials. Journal of Clinical Oncology 22:424754.
Received January 12, 2006; revised April 19, 2006; accepted for publication April 21, 2006.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J.-H. Jeong and J. P. Fine A note on cause-specific residual life Biometrika, March 1, 2009; 96(1): 237 - 242. [Abstract] [PDF] |
||||
![]() |
J. Beyersmann and M. Schumacher Time-dependent covariates in the proportional subdistribution hazards model for competing risks Biostat., October 1, 2008; 9(4): 765 - 776. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Dignam and M. N. Kocherginsky Choice and Interpretation of Statistical Tests Used When Competing Risks Are Present J. Clin. Oncol., August 20, 2008; 26(24): 4027 - 4034. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||








