Skip Navigation


Biostatistics Advance Access originally published online on February 14, 2006
Biostatistics 2006 7(4):515-529; doi:10.1093/biostatistics/kxj023
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
7/4/515    most recent
kxj023v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Chen, Y. Q.
Right arrow Articles by Wang, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Chen, Y. Q.
Right arrow Articles by Wang, Y.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org.

Attributable risk function in the proportional hazards model for censored time-to-event

Ying Qing Chen*

Program in Biostatistics, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA yqchen{at}scharp.org

Chengcheng Hu

Department of Biostatistics, Harvard University, Boston, MA 02115, USA

Yan Wang

Division of Biostatistics, School of Public Health, University of California, Berkeley, CA 94720, USA

* To whom correspondence should be addressed.


    SUMMARY
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 
Time-to-event endpoints are often used in clinical and epidemiological studies to evaluate disease association with hazardous exposures. In the statistical literature of time-to-event analysis, such association is usually measured by the hazard ratio in the proportional hazards model. In public health, it is also of important interest to assess the excess risk attributable to an exposure in a given population. In this article, we extend the notion of ‘population attributable fraction’ for the binary outcomes to the attributable risk function for the event times in prospective studies. A simple estimator of the time-varying attributable risk function is proposed under the proportional hazards model. Its inference procedures are established. Monte-Carlo simulation studies are conducted to evaluate its validity and performance. The proposed methodology is motivated and demonstrated by the data collected in a multicenter acquired immunodeficiency syndrome (AIDS) cohort study to estimate the attributable risk of human immunodeficiency virus type 1 (HIV-1) infections due to several potential risk factors.

Keywords: Attributable fraction; Epidemiologic methods; HIV/AIDS prevention; Population etiologic fraction; Risk assessment


    1. INTRODUCTION
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 
The Multicenter AIDS Cohort Study (MACS) is an ongoing prospective cohort study of the natural history of human immunodeficiency virus type 1 (HIV-1) infection among homosexual and bisexual men to identify their associated risk factors (Kaslow et al., 1987Go). In the MACS and many other prospective studies, various time-to-event endpoints are frequently collected to assess their association with potential risk factors. In particular, the time-to-event outcomes can be the times to HIV seroconversion since enrolment for the MACS HIV uninfected participants. It is thus of important research and public health interest to investigate how the time-to-HIV-seroconversion outcomes are associated with the risk factors, such as needle sharing or having sex with an acquired immunodeficiency syndrome (AIDS) partner. In Figure 1, the Kaplan–Meier estimates of time-to-HIV-seroconversion are plotted for these two risk factors. Apparently, both risk factors are associated with unusually high risk of HIV seroconversion during the observation period.


Figure 1
View larger version (19K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1 Kaplan–Meier estimates for time-to-HIV-seroconversion by risk factors of (a) needle-sharing practice; (b) having sex with an AIDS partner.

 
To measure the association between time-to-HIV-seroconversion and risk factors, the proportional hazards model (Cox, 1972Go),

Formula (1.1)

is often used in the statistical and medical literatures, where Z is the covariate for the risk factors and Formula is the regression parameter. Here, Formula is an unspecified baseline hazard function and Formula is the hazard function for Z. When Z is a risk factor indicator, Formula is the hazards ratio. The parameter Formula hence measures the relative risk in the proportional hazards model. Specifically, for the time-to-HIV-seroconversion in the MACS between 1984 and 1999, we estimate that Formula (Formula) for the risk factor of needle sharing (Formula) against no sharing (Formula), which means that needle-sharing practice is associated with an almost doubled hazard of HIV seroconversion. Similarly, we estimate that Formula (Formula) for the risk factor of having sex with an AIDS partner, implying a 59.9% increase in the hazard of HIV seroconversion for those having sex with an AIDS partner (Formula).

The measure of association itself, however, may be insufficient to determine what public health implication the risk factors would have on the priority of community-level prevention, since it does not take into account the prevalence of risk factors in a given population. In the MACS cohort, although engaging in the needle-sharing practice is associated with a seemingly greater risk of HIV seroconversion than having sex with an AIDS partner, the proportion of this practice in the MACS initially HIV uninfected cohort is much smaller. It is merely 1.3% compared with that of 58.6% for having sex with an AIDS partner. Without taking into account the discrepancy in the prevalence of risk factors, a prevention program that is solely based on the strength of association may not deliver its full potential on the community risk reduction. In fact, since 1980s, there has been growing public health interest in the disease risk attributable to a risk factor, or potentially preventable if the risk factor is eliminated from a given population, as reviewed in Uter and Pfahlberg (2001)Go. An attributable risk would take into account both the strength of association and the prevalence of the risk factors in the population. For example, Silverberg et al. (2004)Go recently studied the attributable risk of the AIDS cases among 525 HIV-1 seroconverted participants due to six host restriction gene variants in the MACS cohort.

In public health sciences, the parameter that characterizes the attributable risk is often referred to as the population attributable fraction or the population etiologic fraction (Benichou, 2000Go, p. 51). When the event outcomes are binary, it is usually defined as (Levin, 1953Go)

Formula (1.2)

where D is a binary event indicator and Z is the binary risk factor indicator. When the HIV seroconversion is considered binary, according to (1.2), the attributable fraction for the HIV seroconversions is calculated as 0.23% for engaging in the needle-sharing practice, while it is 24.1% for having sex with an AIDS partner. That is, in the MACS cohort, a fraction of 0.23% of the HIV seroconversion incidences would be attributed to the needle-sharing practice, but it becomes 24.1% that would be attributed to having sex with an AIDS partner. More discussion on the implication, estimation, and application of the attributable risk can be found in Walter (1976)Go, Greenland and Robins (1988)Go, and Gefeller (1992)Go.

For the attributable risk characterized by the population attributable fractions, statistical methods have been mainly focused on its estimation and inference in various epidemiological study samplings, such as case-control (Drescher and Schill, 1991Go), cross-sectional (Basu and Landis, 1993Go), and cohort designs (Benichou, 2001Go). The recent work of Graubard and Fears (2005)Go further developed more general methodologies to estimate the population attributable fraction across a variety of sample designs. Nevertheless, most of these methods are developed for the binary event outcomes. Only a few of them such as those of Greenland (2001)Go and Silverberg et al. (2004)Go may be potentially extended to the outcomes of censored time-to-event, which are frequently encountered in the studies such as the MACS.

In the sections to follow, we propose some attributable risk measures for the time-to-event outcomes. These measures are functions of time. They allow the attributable risk to be time varying. Under the widely used proportional hazards model, we develop a simple estimator for the hazard-based attributable risk function. Simulation studies are conducted to evaluate its validity and performance. We apply the proposed methodology to the publicly available portion of the MACS data from the study inception in 1984 to 1999. Some technical details are included in the Appendix.


    2. ATTRIBUTABLE RISK FUNCTIONS
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 
Let T be the nonnegative random variable of the time-to-event. A natural extension of Formula for T is, for some Formula,

Formula

where Formula values are the absolute risk functions, i.e. cumulative distribution functions of T. Thus, the attributable fraction of disease risk due to an exposure can be time varying. When t is the end of the follow-up period for a cohort study, Formula, say, Formula is the attributable fraction in (1.2). For rare diseases, when Formula values are usually approximated by their respective cumulative hazard functions of Formula, Formula can also be expressed in Formula. Within an infinitesimal neighborhood of t, an alternative measure of the attributable risk function for T is thus

Formula (2.1)

which is called attributable hazard function, given the nature of its definition. An extended measure is the average attributable hazard function, i.e. Formula on Formula. In particular, Formula is a useful summary measure of Formula. In addition, the range of Formula is Formula. Under the proportional hazards model (1.1), Formula for all Formula if and only if Formula. Since Formula in (1.1), Formula.

To gain some concrete sense of the proposed attributable risk functions of Formula and Formula, we assume that the proportional hazards model (1.1) holds for the exponential baseline hazard functions of 1 and Formula, representing relatively frequent and rare diseases, respectively. Let Formula for the exposed Formula against the unexposed Formula. Three proportions of exposure are considered: 25%, 50%, and 75%, respectively. As shown in Figure 2, the attributable risk function defined by either Formula or Formula is not necessarily constant over time, even when the baseline hazard function itself and the exposure prevalence are constant. When the baseline hazard function is relatively large, the attributable risk functions change more rapidly over time, less otherwise. That is, when the disease is more (less) frequent among the unexposed subjects, the disease risk attributable to the exposure tends to change more (less) rapidly over time. In addition, by comparing Formula with Formula, we find that Formula better approximates Formula for the less frequent disease and the smaller proportion of exposure.


Figure 2
View larger version (25K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 2 Attributable fraction functions in the proportional hazards model Formula with constant Formula. Solid lines are Formula. Dashed lines are Formula.

 
Given the popularity of the proportional hazards model in the literature, it is foreseeable that Formula may be easier to be adapted in model-based estimation than Formula, since Formula is expressed directly in hazard functions. Note that Formula in (2.1) is, however, the hazard function of the marginal distribution Formula, by ignoring the heterogeneity among the subjects in the given population. It usually does not equal Formula where Formula is the distribution function of Formula By Bayes Theorem, we have

Formula

where Formula, Formula, and Formula is the conditional distribution function of Z given Formula. As a result,

Formula (2.2)

When the actual timing of the events is ignored, i.e. the event outcomes are binary, under the logistic regression model

Formula

where Formula and Formula are the regression parameters, Drescher and Becher (1997)Go discovered that the attributable fraction could be expressed as

Formula

for the rare diseases. Compared with Formula in (2.2), this would be exactly Formula if the actual occurrences of the events were scaled up to the maximum follow-up time of Formula. Thus, Formula is considered as a natural extension of Formula for the logistic regression model to the time-to-event outcomes for the proportional hazards model.


    3. ESTIMATION AND INFERENCES
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 
We follow standard notation and assumptions to establish the estimation and inference procedures for the proposed Formula under the proportional hazards model, when the time-to-event outcomes are subject to censoring. Suppose that there are n subjects recruited in the cohort study. Let Formula and Formula be the time-to-event and the censoring time, respectively, Formula. The observed data consist of n independent identically distributed (iid) Formula, Formula, where Formula and Formula. Denote the at-risk indicator Formula. Consider Formula and Formula, Formula.

Assume that Formula is the true value of the regression parameter Formula in the semiparametric proportional hazards model (1.1). The baseline hazard function Formula is unspecified. The maximum partial likelihood estimator, Formula, can then be obtained by solving the partial score equations

Formula

where Formula and Formula. Let Formula. Standard martingale theory of counting processes in Andersen and Gill (1982)Go shows that Formula is consistent and Formula is asymptotically equivalent to

Formula (3.1)

where Formula are martingales with respect to the filtration of Formula

Moreover, the baseline hazard function Formula can be estimated by the Breslow estimator of Formula. Considering the Nelson–Aalen estimator for the marginal hazard function of Formula, i.e. Formula, a natural estimator of the attributable fraction function in (2.2) is thus

Formula

Alternatively, denote Formula. As derived in Xu and O'Quigley (2000)Go, when Formula is independent of Formula and Formula, Formula values are the conditional probabilities of the subjects observed to fail at t given that one of the at-risk subjects would fail at the same time. Therefore, the conditional distribution function of Z given Formula, Formula, can be consistently estimated by Formula This fact leads to the same estimator of Formula

To make inferences based on the proposed Formula, we develop its asymptotic properties as shown in the Appendix. Under the specified regularity conditions, Formula is uniformly consistent for Formula for Formula, i.e. Formula In addition, Formula converges weakly to a zero-mean Gaussian process. Its covariance function of Formula, Formula, is consistently estimated by Formula, where Formula is

Formula

and Formula, respectively. As a result, the variance of Formula is approximately Formula, and the pointwise Formula confidence intervals for Formula can be constructed as

Formula

where Formula is the Formulath percentile of the standard normal distribution.

In addition to the pointwise confidence intervals, it is also of practical interest to consider simultaneous Formulath percentile confidence bands, Formula and Formula, say, such that Formula Due to the fact that there is no independent increment structure in the limiting process of Formula, it is not straightforward to be transformed into the standard Brownian bridge in direct confidence bands calculation. To find appropriate confidence bands, however, the simulation approach in Lin et al. (1994)Go can be adapted for ease of implementation. Specifically, consider n iid standard normal deviates Formula in

Formula

For any set of finite number of time points Formula, Formula, the conditional limiting distribution of Formula given the observed Formula is the same as the unconditional distribution of Formula. As a result, Formula and Formula have the same limiting distribution by the tightness of Formula (Lin et al., 1994Go). Therefore, Formulath percentile simultaneous confidence bands can be constructed as Formula, where Formula is computed such that

Formula


    4. NUMERICAL STUDIES
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 

4.1 Simulations

Simulations are conducted to evaluate the validity and performance of the proposed estimator of Formula in Section 3. In addition to assuming that the baseline hazard functions are constant of 0.01 and 1.00, respectively, time-to-events are generated according to the proportional hazards model (1.1) with Formula and Formula, respectively. Sample sizes are selected to be 200 and 500, respectively. Each subject's binary exposure indicator is generated according to the Bernoulli trial with the exposure probability of 25% and 50%, respectively. Censoring times are generated to yield about 30% and 10% of censored observations. The estimators and their associated variances are calculated at the 75 percentile and median of the marginal survival distribution, Formula and Formula, respectively.

Simulation results are listed in Table 1. For each entry in the table, 1000 simulated data sets are generated to calculate the bias and 95% nominal coverage probability. Here the bias is the difference between the average of the 1000 estimates and the true attributable fraction, and the 95% nominal coverage probability is the percentage of 1000 95% confidence intervals containing the true attributable fraction. As shown in the table, the proposed estimators are virtually unbiased and their confidence intervals maintain the desired coverage probabilities. In addition, the sample standard errors of each 1000 Formula and the average of 1000 Formula are calculated, respectively. It is shown that they are close to each other, which suggests the accuracy of the calculated variance.


View this table:
[in this window]
[in a new window]

 
Table 1 Summary of Simulation Studies under the proportional hazards model Formula

 
4.2 Application to the MACS data

We apply the proposed attributable risk functions to the publicly released portion of the MACS data set (http://www.statepi.jhsph.edu/macs/pdt.html). The MACS is an ongoing prospective study of the natural and treatment histories of HIV-1 infection in homosexual and bisexual men in four U.S. cities of Baltimore, Maryland; Chicago, Illinois; Pittsburgh, Pennsylvania and Los Angeles, California since 1984 (Kaslow et al., 1987Go). The full MACS cohort consists of 5622 HIV-1 seropositive and seronegative participants recruited between 1984 and 1985. They are followed up at six-month intervals. In our application, we use a subset of 3341 participants of the original cohort who were in definitive HIV-1 infection-free status at the initial enrolment and followed through 1999. The mean age of the selected participants is 33.5 years (SE=7.7). Among them, 88.0% are white, 10.7% are black, and 1.3% are other races, with about 85% of the participants having more than high school education. By the end of 1999, a total of 508 cases of seroconversions are identified in the data set.

The primary time-to-event outcome of interest is the time to HIV-1 seroconversion since the study enrolment. Due to the follow-up scheme of six-month visit intervals, the exact dates of seroconversion are often unknown. We thus followed the convention in Kingsley et al. (1991)Go, Detels et al. (1998)Go, and Silverberg et al. (2004)Go to calculate the proxy seroconversion date at one-third of the time between the last HIV-1 seronegative study visit and the first HIV-1 seropositive study visit. Censored observations are mainly due to loss-to-followup, HIV-1-unrelated death or the data inclusion cutoff of December 31, 1999.

In addition to the aforementioned two risk factors of needle-sharing practice and having sex with an AIDS partner, several other prominent risk factors are also examined, i.e. unprotected receptive/insertive anal sex and using cocaine. In Figure 3, survival functions of time-to-HIV-seroconversion are plotted for each additional risk factor, respectively. For the same risk factor of anal sex, the plot shows different patterns between being receptive or being insertive: the receptive participants are associated with higher risk of HIV-seroconversion, while the insertive participants do not show strong association. In fact, the estimates of regression parameter in the proportional hazards model show that being unprotected receptive is significantly associated with 72.8% more of hazard but being unprotected insertive is not significantly associated with higher hazard. For the risk factor of using cocaine, the hazard is more than doubled for those using cocaine, which also shows higher risk in the plot of its survival functions.


Figure 3
View larger version (16K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 3 Kaplan–Meier estimates for time-to-HIV-seroconversion by risk factors of (a) unprotected anal sex (receptive); (b) unprotected anal sex (insertive); (c) using cocaine.

 
To assess the attributable risk of HIV-seroconversion due to these risk factors, we first calculate their usual Formula as defined in (1.2). The results are shown in Table 2, along with other summary statistics and the proportional hazards model estimates. Among the five selected risk factors, using cocaine attributes the most to the population risk by 28.1% given its greatest magnitude of association and high prevalence. Having sex with an AIDS partner and unprotected receptive anal sex attributes similarly to the population risk by 24.1% and 23.5%, respectively, given their similar magnitude of association and prevalence. Although unprotected insertive anal sex is the most prevalent risk factor, its attribution to the population risk is by a small fraction of 1.1%. The attribution is even smaller for needle sharing, regardless of its high association, due to merely 43 (out of 3293) participants using shared needles. We also examine the potential interaction between needle sharing and having sex with an AIDS partner. For those engaged in both of the risk factors, their incidence rate is 31.6%. Because of the low prevalence in needle sharing, however, they consist only 1.2% of the total MACS cohort and the associated attributable risk is about 1.3%. According to Miettinen (1974)Go, this may mean a negative interaction, since Formula. More discussion on judging the interaction can be found in Walter (1976)Go. Nevertheless, in a group similar to the MACS cohort, when an HIV-prevention program is planned to reduce the preventable HIV-transmission, top priorities should be given to the effort of reducing cocaine use, having sex with an AIDS partner and unprotected receptive anal sex.


View this table:
[in this window]
[in a new window]

 
Table 2 Summary statistics and estimates of the proportional hazards model Formula in MACS

 
We further calculate and plot the proposed attributable risk function for each of the five risk factors in Figure 4, respectively. As shown in the figure, all the attributable risk functions do not appear constant over time. For having sex with an AIDS partner, unprotected receptive anal sex and using cocaine, their attributable risk functions change from 26%, 28.3%, and 32.3% to 25.4%, 27.5%, and 30.5%, respectively, over a 15-year period. Although they are generally decreasing, their attributions to the population risk are little changed. They remain the highly attributable risk factors for the HIV seroconversion. An effective prevention program targeting these prominent attributable risk factors should be designed and implemented to reduce the population risk. For needle sharing, although its attribution to the population risk changed about 10% over time, its attribution may not be always decreasing over time. Unprotected insertive anal sex also does not show consistent pattern of attribution to the population risk. Although unprotected insertive anal sex starts relative low attribution, it may arise as time progresses.


Figure 4
View larger version (20K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 4 Estimated attributable risk functions of Formula in the proportional hazards model Formula for the MACS by risk factors (a) Formula: needle sharing; (b) Formula: having sex with an AIDS partner; (c) Formula: unprotected anal sex (receptive); (d) Formula: unprotected anal sex (insertive); (e) Formula: using cocaine.

 
For these model-based attributable risk functions, we use the risk factors measured at the baseline and assume constant regression coefficients in the proportional hazards models. Therefore, according to (2.2), the overall time-varying pattern in the attributable risk functions should be determined by Formula, i.e. the time-varying proportion of the risk factor Z in the MACS cohort. In fact, as the time progresses, those already seroconverted prior to t shall not be considered in Formula. The cohort thus tends to be ‘healthier’, which may explain the generally downward pattern of the attributable risk functions.


    5. DISCUSSION
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 
The attributable risk function is introduced in this article to capture the time-varying contribution of both the relative risk and the prevalence of risk factors to a disease progression. It provides a prospective profile of the population attributable risk in time. It can be also used in planning and prioritizing a prevention program. For example, in promoting a community vaccination program, the attributable risk function can be calculated to measure the timely impact on the preventable infections due to the vaccine.

It needs to point out that the attributable risk function in this article is estimated under the proportional hazards model. We conveniently chose Formula for the main development in this article. To rely less on the assumptions of the proportional hazards model, we shall use Formula expressed in Formulas and their nonparametric estimates, which are also more meaningful in terms of the absolute risks. Since Formula by Taylor expansion, the high-order terms of Formula and Formula have additional influence on the exact difference of Formula. Technical investigation can be conducted to evaluate this difference more generally. In addition, we adopted the stronger version of independence assumption in Xu and O'Quigley (2000)Go to estimate Formula. When the risk factors are mostly categorical with a finite number of categories, such as binary in this article, this assumption can be replaced by the usual weaker version of the conditional independence assumption, as seen in Murray and Tsiatis (1996)Go.

Nevertheless, in this article, we establish a framework of the attributable risk function for the time-to-event outcomes with mostly single binary risk factors. It needs to be further extended to the risk factors of the more general types, either multilevel categorical or continuous. When the possible confounding variables, such as age, gender, and socioeconomic status, are collected with the risk factors, an adjusted attributable risk function is also desirable, as argued in Benichou (2001)Go. To avoid unnecessary technical presentation, we refer to a University of Washington Biostatistics Technical Report (http://www.bepress.com/uwbiostat/paper254/) for more details on some of the proposed extensions.


    APPENDIX
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 

Asymptotic properties of Formula

Let Formula and Formula, Formula. To establish the asymptotic properties of Formula, we assume the necessary regularity conditions specified in Theorem 4.1 of Andersen and Gill (1982)Go:

  1. Formula is continuous, nondecreasing, and Formula;
  2. There exists a compact neighborhood Formula of Formula such that

    Formula


  3. Formula;
  4. Formula is positive definite, where Formula.

First, we decompose the estimator Formula into

Formula

By Taylor's theorem, Formula equals

Formula

where Formula is on the line segment connecting Formula and Formula. Following Theorem 4.1 and Corollary III.2 of Andersen and Gill (1982)Go, we know that Formula and Formula are continuous in Formula. In addition, Formula is bounded away from zero on Formula. Since Formula and Formula for Formula, respectively,

Formula

uniformly for Formula. Due to the consistency of Formula for Formula, we know Formula uniformly on Formula. Also,

Formula (A.1)

It thus follows that Formula uniformly for Formula.

To prove the uniform consistency of Formula, we only need to show Formula. Under the proportional hazards model (1.1), Formula equals

Formula

Therefore, Formula. On the other hand, Formula equals

Formula

Hence, Formula. Under the assumption that C is independent of Formula, Formula. Therefore, the uniform consistency of Formula holds.

To prove the asymptotic normality, Formula can be written as

Formula

By the expression of Formula in (A.1) and the martingale representation of Formula, it further equals

Formula

where Formula. Thus, Formula. Here,

Formula

Since Formula and Formula are both monotonic processes, Formula converges weakly to a zero-mean Gaussian process with covariance function Formula, as shown in the Example 2.11.16 of van der Vaart and Wellner (1996)Go.

In order to prove that Formula is consistently estimated by Formula, it suffices to show by Cauchy–Schwarz inequality that

Formula

These can be established by the consistencies of Formula, the uniform consistency of Formula, and Formula, Formula, respectively, following Lemma 1 of Lin et al. (2000)Go.


    ACKNOWLEDGMENTS
 
The authors would like to thank Professor Scott Zeger, an Associate Editor and Dr. Ross Prentice for their constructive comments that lead to great improvement of the original manuscript. Conflict of Interest: None declared.


    REFERENCES
 TOP
 SUMMARY
 1. INTRODUCTION
 2. ATTRIBUTABLE RISK FUNCTIONS
 3. ESTIMATION AND INFERENCES
 4. NUMERICAL STUDIES
 5. DISCUSSION
 APPENDIX
 REFERENCES
 

    Andersen PK and Gill RD. (1982) Cox's regression model for counting processes: a large sample study. Annals of Statistics 4:1100–1120.

    Basu S and Landis JR. (1993) Model-based estimation of population attributable risk under cross-sectional sampling. American Journal of Epidemiology 142:1338–1343.

    Benichou J. (2000) Attributable risk. In Gail MH and Benichou J (Eds.). Encyclopedia of Epidemiologic Methods(Wiley, Chichester) pp. 50–63.

    Benichou J. (2001) A review of adjusted estimates of attributable risk. Statistical Methods in Medical Research 10:195–216.[Abstract/Free Full Text]

    Cox DR. (1972) Regression models and life-tables (with discussion). Journal of Royal Statistical Society, Series B 34:187–220.

    Detels R, Munoz A, McFarlane G, Kingsley LA, Margolick JB, Giorgi J, Scharager LD, Phair JP. (1998) Effectiveness of potent antiretroviral therapy on time to AIDS and death in men with known HIV infection duration. Journal of American Medical Association 280:1497–1503.[Abstract/Free Full Text]

    Drescher K and Becher H. (1997) Estimating the generalized impact fraction from case-control data. Biometrics 53:1170–1176.[CrossRef][ISI][Medline]

    Drescher K and Schill W. (1991) Attributable risk estimation from case-control data via logistic regression. Biometrics 47:1247–1256.[CrossRef][ISI][Medline]

    Gefeller O. (1992) An annotated bibliography on attributable risk. Biometrical Journal 34:1007–1012.

    Graubard BI and Fears T. (2005) Standard errors for attributable risk for simple and complex sample designs. Biometrics 61:847–855.[CrossRef][ISI][Medline]

    Greenland S. (2001) Estimation of population attributable fractions from fitted incidence ratios and exposure survey data, with an application to electromagnetic fields and childhood leukemia. Biometrics 57:182–188.[CrossRef][ISI][Medline]

    Greenland S and Robins JM. (1988) Conceptual problems in the definition and interpretation of attributable fractions. American Journal of Epidemiology 128:1185–1197.[Free Full Text]

    Kaslow RA, Ostrow DG, Detels R, Phair JP, Polk BF, Rinaldo CR. (1987) The multicenter AIDS cohort study: rationale, organization, and selected characteristics of the participants. American Journal of Epidemiology 126:310–318.[ISI][Medline]

    Kingsley LA, Zhou SY, Bacellar H, Rinaldo CR, Chmiel J, Detels R, Saah A, Vanraden M, Ho M, Munoz A. (1991) Temporal trends in human immunodeficiency virus type 1 seroconversion 1984–1989: a report from the Multicenter AIDS Cohort Study (MACS). American Journal of Epidemiology 134:331–339.[Abstract/Free Full Text]

    Levin ML. (1953) The occurrence of lung cancer in man. ACTA Unio Internationalis Contra Cancrum 9:531–541.

    Lin DY, Fleming TR, Wei LJ. (1994) Confidence bands for survival curves under the proportional hazards model. Biometrika 81:73–81.[Abstract/Free Full Text]

    Lin DY, Wei LJ, Yang I, Ying Z. (2000) Semiparametric regression for the mean and rate functions of recurrent events. Journal of Royal Statistical Society, Series B 62:711–730.[CrossRef]

    Miettinen OS. (1974) Proportion of disease caused or prevented by a given exposure, trait or intervention. American Journal of Epidemiology 99:325–332.[Abstract/Free Full Text]

    Murray S and Tsiatis AA. (1996) Nonparametric survival estimation. Biometrics 52:137–151.[CrossRef][ISI][Medline]

    Silverberg MJ, Smith MW, Chmiel JS, Detels R, Margolick JB, Rinaldo CR, O'Brien SJ, Munoz A. (2004) Fraction of cases of acquired immunodeficiency syndrome prevented by the interactions of identified restriction gene variants. American Journal of Epidemiology 159:232–241.[Abstract/Free Full Text]

    Uter W and Pfahlberg A. (2001) The application of methods to quantify attributable risk in medical practice. Statistical Methods in Medical Research 10:231–237.[Abstract/Free Full Text]

    van der Vaart AW and Wellner JA. (1996) Weak Convergence and Empirical Processes(Springer, New York).

    Walter SD. (1976) The estimation and interpretation of attributable risk in health research. Biometrics 32:829–849.[CrossRef][ISI][Medline]

    Xu RH and O'Quigley J. (2000) Proportional hazards estimate of the conditional survival function. Journal of the Royal Statistical Society, Series B 62:667–680.[CrossRef]

    Received August 25, 2005; revised December 5, 2005; revised February 2, 2006; accepted for publication February 9, 2006.


    Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



    This Article
    Right arrow Abstract Freely available
    Right arrow FREE Full Text (PDF) Freely available
    Right arrow All Versions of this Article:
    7/4/515    most recent
    kxj023v1
    Right arrow Alert me when this article is cited
    Right arrow Alert me if a correction is posted
    Services
    Right arrow Email this article to a friend
    Right arrow Similar articles in this journal
    Right arrow Similar articles in PubMed
    Right arrow Alert me to new issues of the journal
    Right arrow Add to My Personal Archive
    Right arrow Download to citation manager
    Right arrowRequest Permissions
    Right arrow Disclaimer
    Google Scholar
    Right arrow Articles by Chen, Y. Q.
    Right arrow Articles by Wang, Y.
    Right arrow Search for Related Content
    PubMed
    Right arrow PubMed Citation
    Right arrow Articles by Chen, Y. Q.
    Right arrow Articles by Wang, Y.
    Social Bookmarking
     Add to CiteULike   Add to Connotea   Add to Del.icio.us  
    What's this?