Skip Navigation


Biostatistics Advance Access originally published online on May 25, 2005
Biostatistics 2005 6(4):615-632; doi:10.1093/biostatistics/kxi032
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
6/4/615    most recent
kxi032v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Johnson, T. D.
Right arrow Articles by Eisbruch, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Johnson, T. D.
Right arrow Articles by Eisbruch, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oupjournals.org.

A Bayesian mixture model relating dose to critical organs and functional complication in 3D conformal radiation therapy

Timothy D. Johnson*

Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA tdjtdj{at}umich.edu

Jeremy M. G. Taylor

Department of Biostatistics, School of Public Health and Department of Radiation Oncology, School of Medicine, University of Michigan, Ann Arbor, MI 48109, USA

Randall K. Ten Haken and Avraham Eisbruch

Department of Radiation Oncology, School of Medicine, University of Michigan, Ann Arbor, MI 48109, USA

* To whom correspondence should be addressed.


    SUMMARY
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
A goal of cancer radiation therapy is to deliver maximum dose to the target tumor while minimizing complications due to irradiation of critical organs. Technological advances in 3D conformal radiation therapy has allowed great strides in realizing this goal; however, complications may still arise. Critical organs may be adjacent to tumors or in the path of the radiation beam. Several mathematical models have been proposed that describe the relationship between dose and observed functional complication; however, only a few published studies have successfully fit these models to data using modern statistical methods which make efficient use of the data. One complication following radiation therapy of head and neck cancers is the patient's inability to produce saliva. Xerostomia (dry mouth) leads to high susceptibility to oral infection and dental caries and is, in general, unpleasant and an annoyance. We present a dose–damage–injury model that subsumes any of the various mathematical models relating dose to damage. The model is a nonlinear, longitudinal mixed effects model where the outcome (saliva flow rate) is modeled as a mixture of a Dirac measure at zero and a gamma distribution whose mean is a function of time and dose. Bayesian methods are used to estimate the relationship between dose delivered to the parotid glands and the observational outcome—saliva flow rate. A summary measure of the dose–damage relationship is modeled and assessed by a Bayesian {chi}2 test for goodness-of-fit.

Keywords: Bayesian analysis; 3D conformal radiation therapy; Dirac measure; MCMC; Mixture model; Goodness-of-fit; Radiotherapy


    1. INTRODUCTION
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
Three-dimensional (3D) conformal radiation therapy (Lichter, 1991Go) is a general term used to describe the planning and delivery of radiation where the dose at every point in a 3D target field is calculated and the alignment and position of the radiation source is chosen to optimize the dose distribution, within the target field, according to certain criteria. The 3D conformal radiation therapy has enabled radiation oncologists to deliver higher, more accurate, and homogeneous doses of radiation to a defined region of cancerous tissue. Despite these advances, adjacent normal tissue and organs may also receive high doses of radiation resulting in impaired functioning of those organs. Furthermore, the dose distribution to healthy tissue/organs can be highly heterogeneous. As is the case with advances in the technology of many medical devices, these advances have preceded the quantitative understanding and statistical evaluation of the relationship between the dose to healthy tissue/organs and the observed functional complications due to the damage caused by ionizing radiation.

Radiation therapy is given in a series of daily small fractionated doses (typical dose fraction sizes are 1.5–3 Gray (Gy), with the most common values being 1.8 and 2.0 Gy) delivered to the target 5 days per week over approximately a 6-week period. The total dose delivered to the tumor is typically on the order of 60–75 Gy and should be homogeneous, in order to avoid ‘cold spots’ from which the tumor could regenerate. The fractionated dose distribution may differ from one day to the next due to designed field size changes and technique changes. However, in practice only one or two changes are made during the course of treatment. The 3D treatment-planning software calculates the dose, at every fraction, at every voxel (volume element) in the path of the radiation beam based on volumetric data sets typically obtained by computed tomography (CT). The goal of the treatment plan is two-fold. First, to deliver the prescribed overall dose of radiation homogeneously to the target (typically a malignant tumor). Second, to minimize the dose delivered to the surrounding healthy tissues and organs and to those in the path of the radiation beam. The development of 3D conformal radiation therapy treatment has allowed radiation oncologists to meet these goals to a larger extent than previously possible.

The dose distribution to a specified region is typically summarized by the dose volume histogram (DVH) based on the total delivered dose to that region. The DVH is analogous to the survivor function—it is one minus the distribution function of the total dose over the region of interest. Throughout this paper, the dose distribution function to the region of interest (in our case an healthy organ) will be denoted by F.

The model we consider in this article can be broken down into two parts. The first part relates the DVH, or equivalently F, to the damage suffered by a specific healthy organ. The damage to the organ is unobserved. The second part of the model relates damage to injury incurred by the organ. Injury is the observed outcome. Several biomathematical models relating dose to damage incurred have been proposed, e.g. Chappell et al. (1995)Go, Dawson et al. (2002)Go, Deasy et al. (2002)Go, Jackson et al. (1995)Go, Lyman (1985)Go and Seppenwoolde et al. (2003)Go. However, few of these have been fit to data using modern statistical methodology, as pointed out by Shultheiss et al. (1983)Go. Some analyses of data from 3D conformal therapy have been published (Boersma et al., 1998Go; Chao et al., 2001Go; Jackson et al., 1993Go, 1995Go; Kwa et al., 1998Go; Roesink et al., 2001Go; Scrimger et al., 2004Go; Theuws et al., 1998Go). In these papers, the authors have attempted to estimate the relationship between the DVH and complications. (However, the analyses, for the most part, have not been very sophisticated from a statistical point of view.) Although the current usage of statistical methods in the field of 3D conformal radiation therapy is limited, there have been some improvements in recent years. Maximum likelihood estimation has been used (Roberts and Hendry, 1993Go). Profile likelihood confidence intervals have been advocated (Schilstra and Meertens, 2001Go). The use of simulation to assess goodness-of-fit has been suggested (Stavrev and Stavrev, 2000Go). The bootstrap has been used to obtain confidence intervals (Chao et al., 2001Go), and generalized linear models/generalized estimating equations have been employed (Eisbruch et al., 1999Go). Shultheiss (2001)Go lists a number of problem areas in modeling normal tissue radiation injury. The list includes (i) too few data for analysis or too few events, (ii) assigning values to parameters that should be estimated, (iii) using inappropriate statistical methods, ad hoc methods, and univariate analysis, and (iv) ignoring model validation.

In this article, we address the aforementioned issues. We use an empirical measure of damage and concentrate on modeling the relationship between damage and injury to critical organs, the parotid glands (the major salivary producing glands), that occurs in head and neck cancer patients. The data come from an ongoing study being conducted in the Department of Radiation Oncology at the University of Michigan's Comprehensive Cancer Center. This article lays the foundation for future work where we will use the proposed model to compare, and develop, different biomathematical models that relate dose to damage.

The paper is organized as follows. The data set is described in Section 2, while our model is presented in Section 3. Details of the Markov chain Monte Carlo (MCMC) estimation procedure are detailed in Section 4. Results of the data analysis are given in Section 5 along with an analysis of the goodness-of-fit of the model. In Section 6, we propose an analysis using off-the-shelf software and argue that our approach is necessary to capture all the structure in the data. We conclude with a discussion of future work in Section 7.


    2. THE PAROTID DATA SET
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
The data set is comprised of 87 head and neck cancer patients. The outcome of interest is saliva flow rate from the parotid glands that are located in the cheek just in front of each ear. Saliva flow rate (ml/min) was obtained by placing a suction cup over the secretory duct of the parotid gland (Heft and Baum, 1984Go). Saliva was collected for 2 min for each of the two parotid glands under two stimulatory conditions: normal (unstimulated) and stimulated. Saliva flow rate was then estimated by taking the volume collected (ml) and dividing by time (2 min). The parotid gland was stimulated by citric acid (lemon juice) placed on the tongue. Subjects entered the study prior to initiation of radiation therapy, typically 1–3 weeks prior to therapy (month 0). The subjects were also scheduled to return for saliva flow rate measurements at 1, 3, 6, 12, 18, and 24 months after termination of radiation therapy. We note here that many of the saliva measurements at each postradiation follow-up are zero (32.5% of the observations are 0). CT treatment-planning software was used to develop the treatment plan for each subject. Each parotid gland from each subject was outlined on each slice of the CT image by a radiation oncologist and the treatment-planning software computed the DVH (based on total dose given during radiation therapy) for each of the parotid glands. Examples of eight DVHs from four patients are shown in Figure 1. The DVH represents the proportion of parotid gland that receives a certain dose or greater. The DVH is important because higher doses of radiation are more lethal than lower doses and the DVH is an estimate of the total dose distribution received by, in this case, the parotid glands. One wishes to create a radiation treatment plan that maximized the dose to the tumor while sparing or minimizing the dose to surrounding, healthy tissues and organs. The parotid gland that receives higher doses, typically due to proximity of the tumor, is called the ipsilateral parotid, while the opposite parotid gland is called the contralateral parotid. Thus, it is typical that the dose distribution function (F = 1 – DVH) of the contralateral parotid is stochastically smaller than that of the ipsilateral parotid gland. Sixty-three of the 87 subjects have DVHs for both parotids. The other 24 subjects had one of their parotid glands surgically removed due to disease involvement and thus have a DVH for only one parotid (the contralateral side). As is the case with most, if not all, longitudinal studies, there are missing observations. In this study, 23% of the data are missing. We assume that the data are missing completely at random (MCAR) (Little and Rubin, 2002Go). We will have more to say about this assumption in Section 4. Percentages of missing data at each time point are given in Table 1.



View larger version (21K):
[in this window]
[in a new window]
 
Fig. 1. Eight examples of the DVH. The dashed lines are DVHs from the contralateral parotid, and the solid lines are from the ipsilateral parotid. The DVH is analogous to the survivor function of a distribution F. It represents the proportion of tissue (in this case, the parotid gland) that received a given dose or greater. For example, about 50% of the ipsilateral parotid in panel C) received greater than 41.5 Gy. These examples were chosen to show the heterogeneity of the DVHs across patients.

 

View this table:
[in this window]
[in a new window]
 
Table 1. Percentages of missing data by month

 
We found from the goodness-of-fit summary (Section 5.1) that a transformation of the data was necessary. The goodness-of-fit (GOF) summary suggested that there were too many larger than expected observed values and that a transformation of the observations was necessary. The log transformation is a common function used to transform strictly positive, skewed data to more symmetrically distributed observations. However, these data have many observations at 0 and we felt it was desirable to treat all observations equally (that is, transform all observations). Further, a log transformation of only the nonzero observations skewed the data in the opposite direction because nonzero saliva flow rates were measured as low as 0.005 ml/min. Therefore, since all but two observations were less than 2 ml/min, we transformed the data by squaring the observed values divided by 2. That is, if y is an observation, we took y* = (y/2)2. This transformation was sufficient to achieve a satisfactory GOF summary (see Section 5.1).


    3. THE DOSEDAMAGEINJURY MODEL
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
The dose–damage–injury model is a radiobiologically based model that is central to the field of radiation therapy and that is amenable to statistical data analysis (Jackson et al., 1993, 1995Go; Niemierko and Goitein, 1992Go; York et al., 1993Go). Conceptually, the model is based on the premise that radiation causes damage to tissues/organs at the cellular level that, for the most part, is unobservable. A consequence of this damage is injury, that is observable. The radiation dose distribution is known and well characterized by 3D treatment-planning software. Damage can be conceptualized as the surviving fraction of critical cells, or functional subunits. More generally, damage can be thought of as a continuous single number summary, a latent variable, that is solely responsible for determining the clinically observed outcome. We assume that damage is monotonically related to dose in the sense that a uniform increase in dose results in an increase in damage.

We present a dose–damage–injury model and focus on the relationship between damage and injury. We take a Bayesian approach to estimating model parameters. This part of the model assumes that the injury incurred by the parotid gland is measured by saliva flow rate. Saliva flow rate is assumed to be a smooth, monotone decreasing function of damage. Random effects for model parameters are used to account for overdispersion. Before introducing this part of the model, we first discuss the dose–damage relationship.

3.1 Dose–damage relation

As mentioned in Section 1, several biomathematical models that relate dose to damage have been proposed. One of the main goals of this ongoing study is to evaluate the appropriateness of these models to these data, modify these models if necessary and develop new models that are appropriate for these data. However, in this article our focus is on the relationship between damage and injury and not on validating the previously proposed dose–damage models.

Nevertheless, the first part of our model relates dose (observed) to damage (unobserved). In general, damage can be written as a functional, T(F), of the dose distribution, F = 1 – DVH, to the parotid and is typically a value between 0 and 1, where 0 represents no damage and 1 represents maximal damage. The biomathematical models mentioned are of this type. More empirically, we might think of damage as a scalar summary of F, such as a functional that maps F to the real line. Two examples of such a functional are the generalized mean: T(F) = [{int}x{lambda} dF(x)]1/{lambda} and the inverse cumulative distribution function (inverse cdf): T(F) = F–1(p). Although in these cases, ‘damage’ is some scalar summary of F, it has the same unit as dose, Gy, and it does not lie in [0, 1].

We take an empirical approach to estimate the damage from the dose distribution. In particular, we use the inverse cdf,

to specify the relationship between dose and damage. Formally, d is the dose at which 100p percentage of the parotid received dose d or less . We take d to be a measure of damage, thus d can take values from 0 to the maximum possible dose in this study, 81.5 Gy. We estimate the posterior distribution of p from the data. A relatively large estimate of p suggests that the dose to a majority of the parotid gland is important in assessing overall damage. Whereas, a relatively small estimate of p suggests that the dose to only a small portion of the parotid gland is necessary. This could have implications as to whether an attempt should be made to reduce the overall dose distribution to the parotid gland, or to attempt to spare a portion of the gland and not worry about the rest. We assign the quantile p, which we assume to be the same for all subjects, a U[0, 1] (uniform) prior distribution.

3.2 Damage–Injury Relation

The second part of our model relates damage to the injury incurred by the parotid gland. Specifically, the (transformed) expected saliva flow rate is modeled as a smooth, monotone decreasing parametric function of damage. In our particular application, saliva flow rate is used to determine the extent of injury to the parotid glands. This part of the model is a nonlinear mixture model with both fixed and random effects. As mentioned in Section 2, there are numerous outcome measures of zero. To account for these zero saliva flow rates, we model saliva flow rate at a given level of damage and at each time, as a mixture of a Dirac measure (point mass) at zero and a gamma distribution. This type of mixture model (a point mass at zero mixed with a standard distribution function) is reminiscent of Lambert's (1992)Go zero-inflated Poisson regression model. The parotid glands are indexed by k (=0 for ipsilateral and =1 for contralateral), time is indexed by j, and the stimulation condition is indexed by {ell} (=0 for normal and =1 for stimulated). Subjects are indexed by i. dik will denote the damage incurred by parotid k of subject i.

Let Zijk{ell} be a latent allocation variable taking values in {0, 1} such that p(Zijk{ell} = 1) = {pi}j{ell}(dik). Let, N(µ, {sigma}2) denote the normal distribution with mean µ and variance {sigma}2 and let G(µ, {nu}) denote the gamma distribution with mean µ and shape parameter {nu}. Thus, if X ~ G(µ, {nu}) the density of X is given by ({nu}/µ){nu}x{nu}–1 e{nu}x/µ/{Gamma}({nu}). Then saliva flow rate Yijk{ell} conditional on the allocation variable, as well as all other model parameters, has the following distribution:

Marginalizing over Zijk{ell}, Yijk{ell} is a mixture of a point mass at zero and a gamma random variable:

We model the mixing parameters, {{pi}j{ell}}, as smooth, monotone decreasing functions of damage allowing for the possibility that the mixing proportion at a damage of zero is less than one ({pi}j{ell}(0) < 1). This allows subjects to have zero saliva flow at baseline, prior to radiation therapy, due to xerostomia (dry mouth). {pi}j{ell}(dik) can also be interpreted as the probability of nonzero saliva flow rate under stimulation condition {ell} at time j for a subject that has incurred damage level dik to parotid k . In particular, we model the logit of {pi}j{ell} as a linear function of damage:

(3.1)

The intercept, {theta}{ell} > 0, is indexed by stimulation condition and the slope, {phi}j > 0, is indexed by time. Indexing the intercept by stimulation condition allows for differing baseline proportions of subjects with no saliva flow between the stimulated and normal conditions. Indexing the slope by time allows for the probability of nonzero saliva flow to change over time at a given level of damage, thus allowing for the possibility that the parotid recovers from damage over time. {theta}{ell} and {phi}j are assigned vague, proper exponential priors with rates 0.01 ({varepsilon}(0.01)). At baseline, prior to radiation therapy, we set {phi}0{equiv}0 since no radiation has been given and no damage has occurred due to radiation exposure. Therefore, we estimate the posterior distribution of {phi}j only at the six posttreatment times j=1, 2, 3, 4, 5, 6 corresponding to 1, 3, 6, 12, 18, and 24 months postradiation therapy, respectively.

When Z = 1, the conditional expected outcome, µj{ell}(dik), is restricted to be a smooth, monotone decreasing, parametrized function of the damage to the parotid: µj{ell}: [0, {infty}) -> [0, {infty}). Let {alpha}ik{ell}, ßij{ell}, {gamma}ik{ell} > 0. We use a scaled version (scaled by {gamma}ik{ell}) of the incomplete gamma function


{biostskxi032fx1_ht}

(3.2)

(Here we have taken a bit of notational liberty. Technically, we should have shown the conditional dependence of µj{ell} on the parameters {alpha}ijk{ell}, ßij{ell}, and {gamma}ik{ell}. Henceforth, this conditional dependence will be implicit.) µj{ell} achieves its conditional maximum, {gamma}ik{ell}, at dik = 0 and has an asymptote of zero as dik {uparrow} {infty}. We chose this function because of its flexible shape. It takes three parameters: {gamma}, which scales the range and {alpha}, ß which together control location and shape. It has a sigmoid shape and inflection point at {alpha}({alpha} – 1)/ß for {alpha} > 1 and is exponential when {alpha} = 1. When 0 < {alpha} < 1, the function approaches it's asymptote of zero at a rate faster than exponential. One could also choose the survivor function of any distribution with nonnegative support in lieu of the incomplete gamma function, for example, the Weibull or lognormal distribution.

We model the priors for the parameters of the mean function, µj{ell}, in a hierarchical fashion, thus accounting for some of the correlation within subjects across parotid glands, stimulation condition and time. In particular, let {gamma}i = ({gamma}i00, {gamma}i01, {gamma}i10, {gamma}i11)T, for all i. Further, let Wd(S) denote the Wishart distribution with d degrees of freedom and symmetric, positive definite scale matrix S for which the dimension will be obvious. To satisfy the positivity constraint on {gamma}i, we use a lognormal prior:

where g = ln(0.1, 0.1, 0.1, 0.1)T, V = 100I4, R = 0.01I4, and Id is the d x d identity matrix. This gives a rather vague, but proper prior on both the random effects and their mean. To account for overdispersion, random parotid and stimulation effects within each subject, {gamma}i, are centered about the population mean, {gamma}, with covariance {Phi}. {Phi} accounts for correlation within subjects between and across parotids and stimulation conditions at baseline. A priori, we assume that the components of the random effects vector are uncorrelated, as well as the components of the population mean vector.

Lognormal priors are also placed on the other two parameters of the mean function. Specifically, let {alpha}ij = ({alpha}ij00, {alpha}ij01, {alpha}ij10, {alpha}ij11)T, for all i and j. Let ßi{ell}=(ßi1{ell},..., ßi6{ell})T for all i, {ell}. Then

(3.3)

where a = ln(3, 3, 3, 3)T, U = I4, and T = 2I4; and

(3.4)

where h = (0.3, , 0.3)T, W = I6, and S = 2I6.

{Theta} accounts for posttreatment correlation between stimulation conditions and parotid glands. We assume that this correlation is constant over time. {Psi} accounts for correlation over time within a subject. We assume that this correlation is the same for both normal and stimulated conditions. We note here that the prior specifications for the mean of the random effects, the prior distributions in (eqn:alpha) and (eqn:beta) are somewhat informative. These priors were chosen, however, in combination to give a broad range of shapes (Figure 2) for the prior mean function µj{ell} which is of particular interest to this study. Thus, we believe that this prior specification does not overly influence the posterior distribution of the mean function. Finally, the shape parameters, {nu}j{ell}, are given an exponential prior distribution with rate 0.1. This completes our model specification.



View larger version (37K):
[in this window]
[in a new window]
 
Fig. 2. Twenty random realizations of the unscaled prior mean function, µj{ell}(dik)/{gamma}ik{ell} (eqn:mean), based on 20 random draws from the priors for {alpha} and ß [priors (eqn:alpha) and (eqn:beta)].

 
Accounting for correlation over time within a parotid gland, between parotid glands and across stimulation conditions, along with numerous outcome measures of zero and any potential overdispersion requires one of two solutions: (1) either make simplifying assumptions and ignore some or all potential correlation within the data or (2) fit the data with a necessarily complex model that accounts for the various forms of correlation, numerous outcome measures of zero, and included random effects to help account for overdispersion. The latter choice is preferred from a statistical point of view.


    4. MCMCESTIMATION
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
Because of the complexity of the model, the posterior distribution of the model parameters is not analytically tractable and so we estimate the posterior distribution through MCMC simulation. The MCMC simulation is a hybrid Metropolis–Hastings and Gibbs algorithm. For those parameters whose full conditional posterior distributions are analytically tractable ({Psi}, {Theta}, {Phi}, {gamma}, {alpha}, ß) we perform a Gibbs update and perform a Metropolis–Hastings step for all other parameters. We update parameters in the following order: {theta}{ell}, {ell} = 0, 1 ; {phi}j, j = 1,..., 6; {gamma}ik, i = 1,..., N, k = 0, 1; {Phi}; {gamma}; ßi{ell}, i = 1,..., N, {ell} = 0, 1; {Psi}; ß; {alpha}i, i = 1,..., N; {Theta}; {alpha}; {nu}j{ell}, j = 0, 1,..., 6, {ell} = 0, 1; and p.

A complication arises in estimating the mean function µj{ell}(dik) for a particular parotid gland from a specific subject under a distinct stimulation condition at a given time in that the observation is obtained at only one level of damage, dik. However, we would like to obtain a smooth, monotone decreasing estimate of saliva flow rate as a function of damage incurred. And we would like to have this estimate for each parotid under both conditions. (Estimation of this mean function is of interest insomuch as it is part of the model, it will allow saliva flow predictions to be made and will allow us to compare various dose–damage relationships.) To circumvent this complication, we use the baseline observation as a second point, which corresponds to the observed saliva flow rate at dik = 0. This assumes that saliva flow rate is not a function of time when no radiation has been delivered, which we believe is a reasonable assumption. A third observation can be obtained as a latent variable evaluated at the estimated damage incurred by the opposing parotid gland within the particular patient of interest. This is accomplished by drawing the latent observation from its posterior predictive density. These draws are easily obtained at each step of the MCMC simulation. First draw from a Bernoulli distribution with parameter {pi}j{ell}(dik*), where k* = |k – 1| (i.e. at the damage received by the other parotid gland). Then if set and if draw from a gamma distribution with mean µj{ell}(dik*) and shape parameter {nu}j{ell}. Here, {pi}j{ell}(dik*), µj{ell}(dik*), and {nu}j{ell} are the current draws from the posterior distribution of the parameters. By the hierarchical structure placed on the parameters of the mean function and assuming that the covariance matrix {Theta} is constant over time and {Psi} is the same regardless of stimulation condition, we borrow strength over time, across stimulation conditions and between parotid glands when drawing these latent variables from the aforementioned gamma distribution.

Missing data are treated as MCAR (Little and Rubin, 2002Go). Since this is an ongoing study, missing data can occur because patients have not completed the study, missed appointments, become lost to follow-up, or die. However, complications to the parotid glands are most likely not the cause of death, nor are they associated with death, and so we feel that this assumption is valid. Further, in a related analysis on a subset of these data, Eisbruch et al. (2001)Go found no relationship between xerostomia questionnaire scores and patterns of missing data. Missing observations are also treated as latent variables and the values are drawn from the posterior predictive distribution as above. Drawing the missing data gives us a complete balanced data set which simplifies computation for some of the model parameters. In particular, it allows us to easily draw the precision matrices {Phi}–1, {Theta}–1, and {Psi}–1 from their conjugate full conditional distributions.

We ran the simulation for 250 000 iterations with a burn-in of 50 000 iterations. We assessed convergence graphically and ran the chain from several different initial states. The posterior distribution was estimated by saving every 10th iteration after burn-in for a total of 20 000 draws from the posterior.


    5. DATA ANALYSIS RESULTS
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
Damage, as we have modeled it, is a function of a random parameter p. As such, damage has a posterior distribution and can cause difficulty in interpreting results of the model, especially for radiation oncologists who are interested in, among other things, the probability of nonzero saliva flow at a specified level of damage. That is, at a point estimate of damage. The estimated marginal posterior density of p has a median of 0.341, a 95% Bayesian credible interval (equal tail area) of (0.295, 0.381) and is quite symmetric, with an estimated skewness coefficient of –0.001. Thus, this distribution is fairly tight around its median value and roughly symmetric. Therefore, we reran our simulation, conditional on p = 0.341. Results of the unconditional simulation and the conditional simulation are virtually indistinguishable. Hence, we report results conditional on p = 0.341.

Two summary measures of interest are the probability of nonzero saliva flow at each time point under both conditions, {pi}j{ell}(d), and the marginal population mean saliva flow rate as a function of damage for each condition and parotid gland, µj{ell}(d). The first of these is obtained from (eqn:logit). The second can be obtained by (eqn:mean) evaluated at the median values of {alpha}ik{ell}, ßij{ell}, and {gamma}ik{ell}. We concentrate on the 12-month results.

The median probability of nonzero saliva flow rate is displayed in Figure 3 for both stimulation conditions along with the Bayesian 95% point-wise credible intervals (Wahba, 1983Go). From this figure, one can see that as the level of damage increases the probability of having nonzero saliva flow decreases (as it should: more damage, less saliva). A patient who has an estimated ‘damage’ of 26.8 Gy will have a 50/50 chance of having no saliva production (under normal conditions at 12 months). This value is called the toxic dose 50, denoted TD50 = 26.8 Gy. According to our model, if the dose distribution to the parotid could be altered, without altering the dose to the tumor, in such a way that the 0.341 quantile of F = 1 – DVH is made smaller, the probability of the patient having nonzero saliva flow rate would be greater. Let {ell} and j denote the median values of the estimated marginal posterior distributions of {theta}{ell} and {phi}j, respectively. An estimate of the TD50 at time j for stimulation condition {ell} is {ell}/j. Under stimulated conditions, the estimated TD50=48.3 Gy at 1 year. The estimated TD50 at each of the postradiation therapy follow-up times are tabulated in Table 2. We see that there is an increase in the TD50 up to 18 months and then a slight drop-off. The increase indicates that the parotid glands recover from ionizing radiation-induced damage, to some degree. The slight drop-off may be due to chance variation or may be due to the pattern of missing observations shown in Table 1.



View larger version (30K):
[in this window]
[in a new window]
 
Fig. 3. The probability of nonzero saliva flow rate as a function of damage at 1 year postradiation therapy. The solid black line is the median probability. The dashed lines are the 95% Bayesian point-wise credible intervals. Observations are shown as dots at 0 on the vertical axis (those with zero saliva flow) and at 1 (those with positive saliva flow). As the level of damage increases the probability of nonzero saliva flow decreases. The gray lines indicate at what level of damage half the patients would retain some saliva output and half would have no saliva output.

 

View this table:
[in this window]
[in a new window]
 
Table 2. Estimated TD50 for each follow-up time with the 95% Bayesian credible intervals

 
The median of the estimated marginal population mean saliva flow rate posterior distribution for each condition by parotid gland is shown in Figure 4 along with the Bayesian 95% credible bands. In this figure, the gray shaded area represents the range of values of damage estimated from the observed data for that particular parotid gland. The solid black dots represent sample means from the observed data at 12 months. The observed data were first grouped into the lower, middle, and upper third quantiles of their estimated damage evaluated at the median value of p. Then for each grouped set of data, the mean saliva flow rate was plotted against the mean damage. The sample baseline mean saliva rate is also depicted in the graphs as a solid dot at a damage of 0. The ‘x’ symbol represents the mean saliva flow rate of the means of the latent observations evaluated by grouping the latent observations according to the grouping performed on the observed observations from the opposing parotid gland. Figure 4 shows that our model does a good job at modeling the marginal mean saliva flow rate. Note the similarity in shapes of the predicted marginal mean saliva flow rates in the four figures; especially the similarity in shape between the ipsilateral and contralateral predicted mean saliva flow rates under normal conditions and between the two parotid glands under stimulated conditions. Also note the positions of the means of the latent variables relative to the positions of the observed means from the opposing parotid gland under each stimulation condition. The latent means track the relative positions of the observed means fairly closely. This is a consequence of borrowed strength across the glands, stimulation condition, and time that was alluded to in Section 4.



View larger version (65K):
[in this window]
[in a new window]
 
Fig. 4. The marginal population mean saliva flow rate at 1 year postradiation therapy. The solid black line is the median of the estimated marginal population mean saliva flow rate. The dashed lines are the 95% Bayesian confidence bands. The solid dot at zero represents the mean of the observed baseline data. The gray shaded region represents the range of estimated damage, evaluated at p = 0.341, from observed data. The solid dots within the gray regions are grouped observed mean saliva flow rates evaluated at the mean level of damage within each group. Groups were formed by taking the lower, middle, and upper 1/3 quantiles. The x symbol represents the mean values of grouped mean latent saliva flow rates.

 
Figure 5 shows the median of the marginal population mean saliva flow rate posterior distribution superimposed on the 12-month, stimulated data. This figure also displays the baseline observations as well as the means of the latent and missing values that were drawn from the estimated posterior predictive distribution. Again, the latent/missing predicted observations mix nicely throughout the observed data, due to the strength borrowed from the observed data. At damage levels larger than about 25, the latent contralateral predicted observations are a bit higher than the observed ipsilateral observations. This is a consequence of the larger predicted baseline population mean (0.51 for the contralateral side, 0.36 for the ipsilateral side, both under stimulated conditions).



View larger version (28K):
[in this window]
[in a new window]
 
Fig. 5. The median of the estimated marginal population mean stimulated saliva flow rate posterior distribution for both contraleral and ipsilateral parotid glands (gray and black lines, respectively) at 12 months. Baseline data are indicated by {bigtriangleup}, observed data by {circ}, missing data (means) by {bigtriangledown}, and latent variables (means) by x. All baseline data are observed and plotted at –1 to assist in visual clarity.

 
5.1 Goodness-of-fit

We assessed model adequacy by a Bayesian {chi}2 goodness-of-fit statistic (Johnson, 2004Go). This goodness-of-fit test is similar to the classical {chi}2 goodness-of-fit test, except in the Bayesian version the bin probabilities pb are held fixed at their null values and the bin counts are considered random.

Let denote a sample from the posterior distribution and F the marginal sampling distribution after integrating out the missing data and latent variables. For this goodness-of-fit test we used 10 equally spaced bins such that the bin counts mb() are determined by counting the number of F(yijk{ell} | ) ((b – 1)/10, b/10] for b = 1,..., 10. When Yijk{ell} | = 0, we use a randomization procedure to determine bin assignment. That is, we draw z~U(0, 1 – {pi}j{ell}(dik)), then if F(z)((b 1)/10, b/10] we increment mb by 1. Johnson's Bayesian {chi}2 statistic is then

where n is the sample size and pb is the theoretical probability of bin assignment, in our case with 10 bins, pb = 0.1. Johnson suggests two summary measures of the distribution of RB(). First is the proportion of times it exceeds the critical value of a distribution, 16.919. That is, the 0.95 quantile of a {chi}2 distribution with nine degrees of freedom. Values much larger than the nominal 0.05 suggest lack of fit. The second suggestion is the probability that RB() is greater than a random deviate. The nominal value of this probability is 0.5 and values much greater than this nominal value suggest lack of fit. For our model, the proportion of times RB() exceeds the critical value of a distribution is 0.052. The probability that RB() is greater than a random deviate is 0.51. Both of these summaries suggest that our model fits the data adequately. Figure 6 depicts the quantile–quantile plot of RB values calculated from 20 000 samples from the posterior distribution.



View larger version (13K):
[in this window]
[in a new window]
 
Fig. 6. Quantile–quantile plot of RB values for 20 000 draws from the posterior distribution of our model. Observed values were sorted into 10 equally spaced bins. The expected order statistics are from a {chi}2 distribution with nine degrees of freedom.

 
As Johnson points out, the Bayesian {chi}2 goodness-of-fit statistic is also a useful tool for code verification. The distribution of RB tends to deviate from its null distribution when the model is incorrectly specified or there are coding errors. It is evident from Figure 6 that RB does not deviate from its null distribution and thus we are confident that we have coded our model correctly and that this is an adequate model for the data.


    6. IS SUCH COMPLEXITY NECESSARY?
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
It could be argued that we have unnecessarily complicated this analysis, and that perhaps a simpler model with less assumptions would fit the data adequately (and one that can be fit using off-the-shelf software). In this section, we relax some of our assumptions and fit the data with a generalized linear mixed model (GLMM). We will relax the assumption that the four saliva flow measurements from each individual are correlated and treat the four measurements as independent measurements. That is, at each time point, the saliva flow rate from the ipsilateral and contralateral parotid glands at the stimulated and unstimulated conditions are all independent measurements. We will still assume that the saliva flow rates from a particular parotid gland under a particular stimulation condition are correlated over time.

To fit the data we use the quasilikelihood approach specifying the mean function and modeling the variance of the outcome, saliva flow rate, as a function of the mean. Specifically, let Yij denote the saliva flow rate from parotid i at time j = 0, 1, 3, 6, 12, 18, 24. Let µij = E(Yij). Further, let C denote the set of contralateral parotid glands and S denote the set of parotid glands that were stimulated. Since the outcome is strictly nonnegative, we model the log of the expected saliva flow as a function of damage di to parotid i. As before, where Fi = 1 – DVHi. In particular,

Above, {alpha} is the model intercept and the ai ~ N(0, {sigma}2) are parotid-specific random effects. I(A) is the indicator function equal to 1 if A is true and 0 otherwise and ßj are dose effect parameters, one for each j > 0. Also, {gamma} and {delta} are the side and stimulation condition fixed effects, respectively. Further,

(6.1)

Note the similarity between this model and a model for count data with a Poisson error and canonical log link. In fact, fitting this model with the function glmmPQL in the MASS library (Venables and Ripley, 2002Go) in R (R Development Core Team, 2004Go), we can specify a Poisson family with log link. The model fits are identical, but since the observations are not integer counts, we encounter several warning messages that the data are not integer count data. Thus, we resort to the quasilikelihood approach. In this model, the mean saliva flow rate is an exponential function of damage. Our mixture model allows mean saliva flow rate to be a more flexible function of damage which contains, as a special case, the exponential function. Several correlation structures were considered. The particular correlation structure specified in (6.1) was chosen because it gave the fewest problems with convergence.

Ideally, p, and hence the p th quantiles di, should be estimated from the data, but since di are nonlinear functions of the DVHs, they cannot be estimated from the data in this quasilikelihood GLMM approach. Thus, we fit the model using several different values for p. In particular, we chose several different values of p between 0 and 1 and fit a separate model for each choice. Assessment of model fit is based on two measures: the sum of the squared Pearson standardized residuals, given as {chi}2, and the Bayesian information criterion (BIC), with smaller values of each denoting better fit. BIC is the sum of two terms: one that measures the goodness-of-fit (–2 times the log-likelihood) and the other that is a penalty term for complexity (number of parameters times the log of the number of observations). Formally, this quasilikelihood approach does not have a likelihood and thus there is no theoretical justification for using the BIC. Further, it is not clear how many parameters are in a random effects model. However, because the equations used in our quasilikelihood estimation are the same as the likelihood equations for count data using a Poisson model with log link and the number of parameters is not changing (no matter how many there are), we feel justified in informally using BIC as a criterion. The following table, Table 3, shows that the 0.55 quantile results in the best fitting model according to {chi}2 and that the 0.90 quantile gives the best fitting model according to BIC. The missing values in the table at p=0.4 and 0.45 are a result of nonconvergence due to numerical instability for those particular values of p. The best fitting model based on each criterion is shown in boldface. It is disconcerting that these criteria chose models that are quite different with regard to the ‘best’ p.


View this table:
[in this window]
[in a new window]
 
Table 3. BIC and {chi}2 values used for model selection when

 
A residual analysis shows a deficiency of this model. Figure 7 displays the Pearson standardized residuals as a function of the fitted values from this model (that chosen by {chi}2) and the Bayesian standardized residuals as a function of the Bayesian predicted values (Gelfand et al., 1992Go) from our mixture model. Note the curved structure seen in the Pearson standardized residuals in Panels A and C. This structure indicates a lack of fit of the model. In particular, there is structure in the data that is not accounted for in the quasilikelihood GLMM approach. This structure is due to the large number of zeros in the observed outcome (508 out of the 1612 observations are 0). The Pearson standardized residuals for the model chosen by BIC show the same nonrandom structure (not shown). We also tried several transformations of the data and various other GLMM models (not shown); however, we were not able to account for this structure found in the data.



View larger version (42K):
[in this window]
[in a new window]
 
Fig. 7. Residual plots from the GLMM and the Mixture Model. Panels C and D are exploded portions of Panels A and B, respectively. Note the curved structure in the Pearson standardized residuals from the GLMM in Panels A and C. This indicates that the model is not fitting the observed zero saliva flow rates adequately. Contrast this with the Bayesian standardized residuals, Panels B and D. The mixture model explicitly models the zero saliva flow rates. Little nonrandom structure remains in the Bayesian standardized residuals. The range of the predicted outcome was truncated at 1.5 in Panels A and B.

 
By contrast, the residuals from our mixture model shown in Panels B and D of Figure 7 show much less residual structure and do not indicate any severe model deficiencies. This is a result of our explicit modeling of the observed zeros.

To summarize, we feel that the level of complexity of our mixture model is justified on several points. First, we had several convergence problems with the GLMM approach. Second, this approach cannot account for the large number of zero outcomes in the data. Third, our decision on which quantile to use is based on a heuristic with no theoretical justification. Fourth, different model selection criteria chose quite different models. Fifth, this approach cannot estimate p nor can it gives measures of uncertainty about p. Lastly, nonnested models cannot be compared using these model comparison measures (this will become important when we compare different dose–damage models—a central goal of this ongoing study, see Section 7).


    7. DISCUSSION
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 
We have developed and implemented a realistic, albeit complicated, model to analyze the parotid data set outlined in Section 2. The need to account for the numerous outcome measures of zero and to account for within parotid gland correlation over time, correlation between stimulation conditions, and correlation between the two parotid glands within a subject requires such a complex model. However, this model did come at a price: time and effort in developing specialized software. In this model, we used an empirical measure of damage, namely the inverse cdf of the dose distribution and concentrated on modeling the damage–injury relationship in the dose–damage–injury model. Further, our analysis shows that the parotid gland recovers with time. A similar result was found by Scrimger et al. (2004)Go. Given that we now have a statistical model that fits the damage–injury relationship and takes into account the correlation inherent in the data (over time, across parotid glands and stimulation conditions), our goal is to concentrate on the dose–damage relationship. Several theoretical models have been proposed (see Section 1) and we will pursue the use of these as well as the modified versions of each. Other empirical relationships between dose and damage will also be explored, such as the generalized mean of the dose distribution. The challenge will be in comparing models. Bayes factors are the traditional method of comparing models, however, except for the simplest of models, they are computationally burdensome. Johnson's goodness-of-fit statistic, RB, could be used to compare models. The model whose RB distribution is closest to its null (in some objective sense) would be chosen as the better model for the data at hand. Other summary measures of model adequacy will also be explored, such as the sum of the logs of the conditional predictive ordinates (Gelfand et al., 1992Go), the deviance information criterion (Spiegelhalter et al., 2002Go), and a minimum posterior predictive loss approach initially proposed by Gelfand and Ghosh (1998)Go. We note that all four of these approaches can be used to compare nonnested models. The last two methods provide a trade-off between goodness-of-fit and model complexity and thus may be preferred over other methods when comparing models. This is important because the different biomathematical models that relate dose to damage require different parameters in their modeling. Johnson's {chi}2 goodness-of-fit statistic, on the other hand, is useful in determining model fit as opposed to model choice.


    ACKNOWLEDGMENTS
 
This work was funded by the National Institutes of Health, grant number RO1 CA095096.


    REFERENCES
 TOP
 SUMMARY
 1. INTRODUCTION
 2. THE PAROTID DATA...
 3. THE DOSE-DAMAGE-INJURY MODEL
 4. MCMCESTIMATION
 5. DATA ANALYSIS RESULTS
 6. IS SUCH COMPLEXITY...
 7. DISCUSSION
 REFERENCES
 

    BOERSMA, L. J., VAN DEN BRINK, M., BRUCE, A. M., SHOUMAN, T., GRAS, L., TE VELDE, A. AND LEBESQUE, J. V. (1998). Estimation of the incidence of late bladder and rectum complications after high-dose (70–85 GY) conformal radiotherapy for prostate cancer, using dose–volume histograms. International Journal of Radiation Oncology, Biology, Physics 41, 83–92.[CrossRef][Web of Science][Medline]

    CHAO, K., DEASY, J., MARKMAN, J., HAYNIE, J., PEREZ, C., PURDY, J. AND LOW, D. (2001). A prospective study of salivary function sparing in patients with head-and-neck cancers receiving intensity-modulated or three-dimensional radiation therapy: initial results. International Journal of Radiation Oncology, Biology, Physics 49, 907–916.[CrossRef][Web of Science][Medline]

    CHAPPELL, R., NONDAHL, D. M. AND FOWLER, J. (1995). Modelling does and local control in radiotherapy. Journal of the American Statistical Association 90, 829–938.[CrossRef]

    DAWSON, L. A., NORMOLLE, D., BALTER, J. M., MCGINN, C. J., LAWRENCE, T. S. AND TEN HAKEN, R. K. (2002). Analysis of radiation-induced liver disease using the lyman ntcp model. International Journal of Radiation Oncology, Biology, Physics 53, 810–882.[CrossRef][Web of Science][Medline]

    DEASY, J. O., NIEMIERKO, A., HERBERT, D., YAN, D., JACKSON, A., TEN HAKEN, R. K., LANGER, M. AND SAPARETO, S. (2002). Methodological issues in radiation dose–volume outcome analyses: summary of a joint aapm/nih workshop. Medical Physics 29, 2109–2127.[CrossRef][Web of Science][Medline]

    EISBRUCH, A., TEN HAKEN, R. K., KIM, H. M., MARSH, L. H. AND SHIP, J. A. (1999). Dose, volume and function relationships in parotid salivary glands following conformal and intensity-modulated irradiation of head and neck cancer. International Journal of Radiation Oncology, Biology, Physics 45, 577–587.[CrossRef][Web of Science][Medline]

    EISBRUCH, A., KIM, H. M., TERRELL, J. E., MARSH, L. H., DAWSON, L. A. AND SHIP, J. A. (2001). Xerostomia and its predictors following parotid-sparing irradiation of head-and-neck cancer. International Journal of Radiation Oncology, Biology, Physics 50, 695–704.[CrossRef][Web of Science][Medline]

    GELFAND, A. E., DEY, D. K. AND CHANG, H. (1992). Model determination using predictive distributions with implementation via sampling-based methods (with Discussion). In Bernardo, J. M. and DeGroot, M. H. (eds), Bayesian Statistics 4. Oxford: Oxford University Press.

    GELFAND, A. E. AND GHOSH, S. K. (1998). Model choice: a minimum posterior predictive loss approach. Biometrika 85, 1–11.[Abstract/Free Full Text]

    HEFT, M. W. AND BAUM, B. J. (1984). Unstimulated and stimulated parotid salivary flow rate in individuals of different ages. Journal of Dental Research 63, 1182–1185.[Abstract/Free Full Text]

    JACKSON, A., KUTCHER, G. J. AND YORKE, E. D. (1993). Probability of radiation induced complications for normal tissues with parallel architecture subject to non-uniform radiation. Medical Physics 20, 613–625.[CrossRef][Web of Science][Medline]

    JACKSON, A., KUTCHER, G. J. AND YORKE, E. D. (1995). Analysis of clinical complication data for radiation hepatic using a parallel architecture model. International Journal of Radiation Oncology, Biology, Physics 31, 883–891.[CrossRef][Web of Science][Medline]

    JOHNSON, V. E. (2004). A Bayesian chi-squared test for goodness of fit. Annals of Statistics 32, 2361–2384.[CrossRef]

    KWA, S. L. S., LEBESQUE, J. V., THEUWS, J. C. M., MARKS, L. B., MUNLEY, M. T., BENTEL, G. B., OETZEL, D., SPAHN, U., GRAHAM, M. V., DRZYMALA, R. E. et al. (1998). Radiation pneumonitis as a function of mean dose lung dose: an analysis of pooled data of 540 patients. International Journal of Radiation Oncology, Biology, Physics 42, 1–9.[Web of Science][Medline]

    LAMBERT, D. (1992). Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34, 1–14.

    LICHTER, A. S. (1991). Three-dimensional conformal radiation therapy: a testable hypothesis. International Journal of Radiation Oncology, Biology, Physics 21, 853–855.[Web of Science][Medline]

    LITTLE, R. J. A. AND RUBIN, D. B. (2002). Bayesian Data Analysis. New York: Wiley-Interscience.

    LYMAN, J. T. (1985). Complication probability as assessed from dose–volume-histograms. Radiation Research 104, s13–s19.[CrossRef]

    NIEMIERKO, A. AND GOITEIN, M. (1992). Modeling of normal tissue response to radiation: the critical volume model. International Journal of Radiation Oncology, Biology, Physics 25, 135–145.

    R DEVELOPMENT CORE TEAM (2004). R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.

    ROBERTS, S. AND HENDRY, J. (1993). A direct maximum-likelihood analysis of a collection of worldwide tumour-control data. Radiotherapy and Oncology 29, 69–74.[CrossRef][Web of Science][Medline]

    ROESINK, J., MOERLAND, M., BATTERMANN, J., JAN HORDIJK, G. AND TERHAARD, C. (2001). Quantitative dose–volume response analysis of changes in parotid gland function after radiotherapy in the head-and-neck region. International Journal of Radiation Oncology, Biology, Physics 51, 938–946.[CrossRef][Web of Science][Medline]

    SCHILSTRA, C. AND MEERTENS, H. (2001). Calculation of the uncertainty in complication probability for various does-response models. International Journal of Radiation Oncology, Biology, Physics 50, 147–158.[CrossRef][Web of Science][Medline]

    SCRIMGER, R. A., STAVREV, P., PARLIAMENT, M. B., FIELD, C., THOMPSON, H., STAVREVA, N. AND FALLONE, B. G. (2004). Phenomenologic model describing flow reduction for parotid gland irradiation with intensity-modulated radiotherapy: evidence of significant recovery effect. International Journal of Radiation Oncology, Biology, Physics, 178–185.

    SEPPENWOOLDE, Y., LEBESQUE, J. V., DE JAEGER, K., BELDERBOS, J. S. A., BOERSMA, L. J., SCHILSTRA, C., HENNING, G. T., HAYMAN, J. A., MARTEL, M. K. AND TEN HAKEN, R. K. (2003). Comparing different ntcp models that predict the incidence of radiation pneumonitis. International Journal of Radiation Oncology, Biology, Physics 55, 724–735.[CrossRef][Web of Science][Medline]

    SHULTHEISS, T. E. (2001). The controversies and pitfalls in modeling normal tissue radiation injury/damage. Seminars in Radiation Oncology 11, 210–214.[CrossRef][Web of Science][Medline]

    SHULTHEISS, T. E., ORTON, C. G. AND PECK, R. A. (1983). Models in radiotherapy: volume effects. Medical Physics 10, 410–415.[CrossRef][Web of Science][Medline]

    SPIEGELHALTER, D. J., BEST, N. G., CARLIN, B. P. AND VAN DER LINDE, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society, Series B 64, 583–639.[CrossRef]

    STAVREV, P. AND STAVREV, N. (2000). Fraction size and dose parameters related to the incidence of pericardial effusions: regarding Martel et al. International Journal of Radiation Oncology, Biology, Physics 48, 609–617.

    THEUWS, J. C., KWA, S. L., WAGENAAR, A. C., SEPPENWOOLDE, Y., BOERSMA, L. J., DAMEN, E. M., MULLER, S. H., BAAS, P. AND LEBESQUE, J. V. (1998). Prediction of overall pulmonary function loss in relation to the 3-d dose distribution for patients with breast cancer and malignant lymphoma. Radiotherapy and Oncology 49, 233–243.[CrossRef][Web of Science][Medline]

    VENABLES, W. N. AND RIPLEY, B. D. (2002). Modern Applied Statistics with S, 4th edition. New York: Springer.

    WAHBA, G. (1983). Bayesian "confidence intervals" for the cross-validated smoothing spline. Journal of the Royal Statistical Society, Series B, Methodological 45, 133–150.

    YORK, E. D., KUTCHER, G. J., JACKSON, A. AND LING, C. C. (1993). Probability of radiation-induced complications in normal tissue with parallel architecture under conditions of uniform whole or partial organ irradiation. Radiotherapy and Oncology 26, 226–237.[CrossRef][Web of Science][Medline]

    Received November 17, 2004; revised May 9, 2005; accepted for publication May 19, 2005.


    Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



    This Article
    Right arrow Abstract Freely available
    Right arrow FREE Full Text (PDF) Freely available
    Right arrow All Versions of this Article:
    6/4/615    most recent
    kxi032v1
    Right arrow Alert me when this article is cited
    Right arrow Alert me if a correction is posted
    Services
    Right arrow Email this article to a friend
    Right arrow Similar articles in this journal
    Right arrow Similar articles in PubMed
    Right arrow Alert me to new issues of the journal
    Right arrow Add to My Personal Archive
    Right arrow Download to citation manager
    Right arrowRequest Permissions
    Right arrow Disclaimer
    Google Scholar
    Right arrow Articles by Johnson, T. D.
    Right arrow Articles by Eisbruch, A.
    Right arrow Search for Related Content
    PubMed
    Right arrow PubMed Citation
    Right arrow Articles by Johnson, T. D.
    Right arrow Articles by Eisbruch, A.
    Social Bookmarking
     Add to CiteULike   Add to Connotea   Add to Del.icio.us  
    What's this?