Full Access

Methods for Assessing Uncertainty in Fundamental Assumptions and Associated Models for Cancer Risk Assessment

Correction(s) for this article

Corresponding Author

Mitchell J. Small

*Address correspondence to Mitchell J. Small, Civil & Environmental Engineering and Engineering & Public Policy, Carnegie Mellon University, Pittsburgh, PA, 15213, USA; tel: 412-268-8782; fax: 412-268-8782; [email protected].Search for more papers by this author

Mitchell J. Small,

Corresponding Author

Mitchell J. Small

First published: 20 September 2008

https://doi.org/10.1111/j.1539-6924.2008.01134.x

Citations: 17

Share a link

Email
Wechat
Bluesky

Abstract

The distributional approach for uncertainty analysis in cancer risk assessment is reviewed and extended. The method considers a combination of bioassay study results, targeted experiments, and expert judgment regarding biological mechanisms to predict a probability distribution for uncertain cancer risks. Probabilities are assigned to alternative model components, including the determination of human carcinogenicity, mode of action, the dosimetry measure for exposure, the mathematical form of the dose-response relationship, the experimental data set(s) used to fit the relationship, and the formula used for interspecies extrapolation. Alternative software platforms for implementing the method are considered, including Bayesian belief networks (BBNs) that facilitate assignment of prior probabilities, specification of relationships among model components, and identification of all output nodes on the probability tree. The method is demonstrated using the application of Evans, Sielken, and co-workers for predicting cancer risk from formaldehyde inhalation exposure. Uncertainty distributions are derived for maximum likelihood estimate (MLE) and 95th percentile upper confidence limit (UCL) unit cancer risk estimates, and the effects of resolving selected model uncertainties on these distributions are demonstrated, considering both perfect and partial information for these model components. A method for synthesizing the results of multiple mechanistic studies is introduced, considering the assessed sensitivities and selectivities of the studies for their targeted effects. A highly simplified example is presented illustrating assessment of genotoxicity based on studies of DNA damage response caused by naphthalene and its metabolites. The approach can provide a formal mechanism for synthesizing multiple sources of information using a transparent and replicable weight-of-evidence procedure.

1. INTRODUCTION

In a recent report, Cohen⁽¹⁾ outlines the key steps in cancer risk assessment, including: (1) a determination of whether the agent is carcinogenic in humans, (2) estimation of the agent potency within the range of dosage used in an animal study, (3) quantitative extrapolation of risk from the test species to humans, and (4) high- to low-dose extrapolation to estimate risks for the dose range experienced in the exposed human population. In this article, these and other factors are considered as part of a general methodology for addressing fundamental uncertainties in the underlying biological mechanisms and model assumptions used in a cancer risk assessment. The methodology is based on a “distributional approach” to uncertainty analysis. In the distributional approach the risk analyst builds probability trees that allow calculation of risk estimates and associated probabilities for different plausible combinations of model assumptions. Software platforms for influence diagrams and “Bayesian belief networks” (BBNs) are shown to greatly facilitate model construction and evaluation. The method enables evaluations of the effect of complete or partial reductions in the uncertainty of different elements of the risk model, providing a first characterization of the value of information for specific studies that target these elements.

The distributional approach for characterizing uncertainty in cancer risk assessment was developed by Evans, Sielken, and co-workers beginning in the 1990s⁽^2–10⁾ and has also been referred to as information analysis, weight-of-evidence analysis, the comprehensive methodology, and comprehensive realism.⁽^8–10⁾ The method has since been acknowledged in a number of reviews of cancer risk assessment practice and research needs,⁽^11–13⁾ and applied in various forms for risk assessment of different chemical compounds.⁽^14–19⁾

The motivation for the distributional approach is the recognition that the use of a single set of assumptions for the components of a cancer risk assessment, whether default, conservative, or otherwise, fails to capture the full range of plausible or likely relationships, how these relationships depend upon our current state of knowledge, the implications for computed values of potency or unit risk, and the opportunities for improved estimates. The distributional approach thereby enables consideration of a “portfolio-of-mechanisms” that may contribute to carcinogenesis.⁽²⁰⁾

1.1. Methodological Context

Uncertainty may be considered and characterized at various stages of a cancer risk assessment. Traditionally, this has occurred first in the fitting of models to the experimental bioassay data. In the absence of toxicodynamic models (discussed further below), a functional relationship is fit between the experimental dose and the observed incidence of tumor occurrence or a related precursor endpoint, by statistically estimating the parameters of the dose-response equation. Different empirical functional forms may be fit to the bioassay results, often with only minor differences in their statistical goodness of fit to the experimental data. However, the predicted risks from these alternative models are often very different when extrapolated from high experimental doses, where risks of 10⁻²–1 must be achieved in order to detect effects and fit a dose-response relationship with a limited number of laboratory animals tested at each dose level (typically ∼50 animals per group), to the much lower dosage rates typically received by the human population, where expected risks of 10⁻¹⁰–10⁻³ are generally determined. The uncertainty involved in choosing a dose-response model, its associated low-dose extrapolation, and the dependence of these on the mode of action are key elements of the distributional method described in this article.

In addition to the “between-model” uncertainty inherent in the consideration of alternative dose-response equations, “within-model” uncertainty is reflected in the statistical uncertainty of the fitted parameters of any given equation. This parameter uncertainty is translated into uncertainty in a benchmark dose (BMD), in particular, the point of departure (POD) dose. The PODX is chosen as the effective dose that results in an X percentage increase in the observed cancer endpoint relative to the control group. A PODX estimated for cancer risk assessment is sometimes denoted as a BMDX (both of these terms also apply to dose metrics used for noncancer effects), a tumorigenic dose (TDX, especially in Europe, with the TD25 often referred to simply as the T25), or an effective dose (ED_X). The latter is the most common terminology used for cancer risk assessment in the United States, with the ED₁₀ the most commonly applied POD.

Statistical methods allow for the determination of a central, or “best,” point estimate for the ED_X. This estimate is usually determined from the parameter values that make the observed experimental results most likely to have occurred (relative to other possible parameter values) and is referred to as the maximum likelihood estimate (MLE). In addition, statistical bounds can be determined for the estimated ED_X. These bounds can be calculated using classical statistical methods—the interval for the ED_X estimate is then referred to as a confidence interval, and the upper and lower values of the interval are referred to as the upper confidence limit (UCL) and the lower confidence limit (LCL), respectively. The latter, commonly referred to as the LED_X (or BMDLX), is of particular interest in health risk assessment, since it is the lower statistical bound of the effective dose that determines the upper statistical estimate of the cancer potency or risk for a given exposure. UCL and LCL estimates are now typically provided as part of the output from software packages used for fitting dose-response relationships and estimating BMDs (see, for example, the EPA Benchmark Dose Software at: http://cfpub.epa.gov/ncea/cfm/recordisplay.cfm?deid=164443). Additional guidance for calculating confidence intervals for dose-response relationships is found in References 21–23.

Bayesian statistical methods have been used increasingly in recent years to fit dose-response models and to estimate the uncertainty in model parameters and derived ED and risk values (e.g., References 24–27). Bayesian methods begin with a prior probability distribution for the model parameters and associated risk estimates and update these estimates using the likelihood function for the experimental results (the same function that is maximized to determine the MLE) to derive the posterior distribution of the model parameters and associated risk metrics. The prior distribution may be chosen to reflect expert knowledge regarding the model, or diffuse or informationless priors may be assumed. Statistical bounds determined using Bayesian methods are referred to as credible intervals.¹

When reporting statistical bounds, typically a 90% confidence (or credible) interval is reported, ranging from the 5th percentile LCL to the 95th percentile UCL. As noted above, the conservative, health-protective estimate of risk is associated with the LCL for the POD (e.g., the 5th% LED₁₀), which yields the corresponding UCL for the risk estimate, in this case, the 95th% UCL risk. Drawing on the advice of the EPA Science Advisory Board,⁽²⁸⁾ the current EPA guidance notes that “risk assessors should calculate, to the extent practicable, and present the central estimate and the corresponding upper and lower statistical bounds (such as confidence limits) to inform decisionmakers” (Reference 29, pp. 1–14).

Consistent with this recommendation, the distributional approach presented here (and in References 2–10) considers multiple risk metrics, including a central estimate (the MLE) and statistical bounds (here the 95th% UCL). Consideration of both is assumed to capture and reflect the within-model uncertainty associated with fitting a dose-response curve. The method then goes on to consider the fundamental conceptual (e.g., mode of action) assumptions that lead to the selection of alternative dose-response functions and animal-to-human extrapolation models. This between-model uncertainty is used to derive a probability distribution function for each metric, which can then be analyzed to inform decisions that will be based on either or both of the metrics. From this perspective, it is fully legitimate to compute a probability distribution function for each metric, representing the uncertainty in an estimated MLE risk or 95th UCL risk, given that it can be computed in many different ways, depending on the particular combination of uncertain assumptions that is adopted. Similarly, moments of these computed probability distributions (means, standard deviations, etc.) can be determined, and these are especially useful for indicating how the distributions shift as particular assumptions are invoked, as part of a model sensitivity analysis, or in response to new knowledge regarding the scientific support for alternative assumptions.

The methodology relies on the assignment of probabilities to various assumptions and submodels in the cancer risk assessment. In general, these assignments are informed by the weight of evidence from scientific studies, arguments put forth in the scientific literature, National Toxicology Program (NTP) reports, IRIS Toxicological Reviews, and/or previous risk assessments where the relative scientific support for alternative assumptions has been evaluated. In addition, formal expert elicitation may be utilized to determine the probabilities for the component models and their dependencies. Expert elicitation has been used in a wide range of scientific studies (e.g., References 30–34), including environmental health risk assessment.⁽^35–39⁾ In the studies presented here, panels of experts were brought together to derive or formally elicit model probabilities. Experts often differ both in their assessments of current scientific knowledge and their beliefs regarding how proposed studies might change these assessments (e.g., with differing estimates of the sensitivity and selectivity of particular experiments with respect to particular hypotheses). In the context of the methods presented herein, we refer to assessments of current scientific knowledge as prior beliefs, and the new beliefs that result from new or proposed studies as posterior beliefs. The Bayesian belief networks presented herein consider both component assumptions for the model elements and scientific studies that can be used to test (and lend differential support to) these assumptions. Expert views regarding both can thus be included in the models.²

1.2. Applications Considered

Two of the initial applications of the distributional method were for detailed assessments of low-dose cancer risk for formaldehyde⁽^2–4⁾ and chloroform.⁽^{5, 6}⁾ These evaluations were based primarily on the use of animal bioassay studies to fit dose-response relationships. A distributional assessment of cancer risk due to occupational exposure to formaldehyde was also conducted by Fayerweather et al.⁽⁴³⁾ using available epidemiological data. These applications are briefly reviewed and the former⁽^2,3⁾ is used as the basis for demonstrating implementation and extension of the probability tree method for estimating cancer risk from exposure to formaldehyde. As noted above, the probabilities inputs to the distributional model are based primarily on expert judgment elicited using a combination of formal and informal methods.⁽^3–6⁾ However, the presentation is intended solely as a demonstration of methodology, not as a formal cancer risk assessment for formaldehyde, in part because not all of the processes believed to be important in determining the mode of action of formaldehyde are considered (e.g., References 44–47), and in part because the probability weights used in the tree are taken directly from Reference 2, reflecting available data and expert judgment circa 1990.

2. CRITICAL MODEL ELEMENTS AND ASSUMPTIONS

The probability tree models for formaldehyde⁽^2–4⁾ and chloroform⁽^5,6⁾ address uncertainty in the following elements of a dose-response characterization that is derived primarily from animal bioassay results.

1
Whether the agent is indeed a human carcinogen and, if so, for which target organ. As noted in Reference 1, estimates of the probability that a chemical is a human carcinogen can be based on screening tests for genotoxicity or mutagenicity, evidence for compounds with similar animal bioassay results, and confirmed (or at least better) information regarding human carcinogenicity, or estimates from SAR models. The assessment may also consider available epidemiological evidence and mechanistic biological information regarding compound delivery to, and behavior in, target tissues.
2
Mode of action. The determination of the mode of action is a critical step in selecting among alternative models for the dose-response relationship and interspecies extrapolation. Principal options generally include cytotoxicity to target cells, cell proliferation, and genotoxicity.
3
Dose scale for dose-response modeling. The way in which dose is characterized reflects our understanding of the mechanisms of agent absorption, distribution, metabolism, and excretion (ADME) in the test species and humans. In recent years, physiologically based pharmacokinetic (PBPK) models have been used to inform dosimetry assessments and dose scale selections (e.g., References 48–54).³ Furthermore, as we have learned more, these assessments have become more complex, considering multiple target tissues, metabolite formation, and dose-dependent mechanisms of toxicity.⁽⁶⁰⁾
4
Dose-response model. The shape of the dose-response relationship is closely linked to the assumed carcinogenic mode of action in step 2, and is also affected by the dose scale in step 3. Four options are considered in both the formaldehyde and chloroform assessments: (a) a probit model representing highly nonlinear, threshold-like behavior at low dose; (b) a five-stage (multistage) model that results in sublinear relationships at low dose; (c) a five-stage model above 1 ppm formaldehyde, with linear interpolation to zero below 1 ppm; and (d) a one-stage (or one-hit) model that is essentially linear throughout the range of doses considered.
5
Experimental data set. In some cases more than one bioassay study is available for fitting the dose-response relationship. For example, the chloroform assessment of Reference 5 considered nine different bioassay studies conducted by different groups using rat and mouse, female and male, and liver and kidney endpoints.
6
Interspecies extrapolation. As noted in Reference 1, extrapolation across species is usually made assuming dosage rates scaled to the three-fourth power of bodyweight. Lacking detailed PBPK models to inform the estimate, other options typically considered include assuming a direct equivalence between the test species and human dosage rates, and the use of bodyweight scaled to the two-third power (corresponding to equivalence based on relative surface area).

3. IMPLEMENTATION OF THE DISTRIBUTIONAL APPROACH FOR LOW-DOSE FORMALDEHYDE CANCER RISK ASSESSMENT

The initial applications of the distributional approach for formaldehyde cancer risk assessment were based on animal bioassay studies and appeared in References 2 and 3. Fig. 1 presents the network model (described further below) developed here to consider the alternative assumptions in this assessment. The left portion of Fig. 1 presents the six components of the model and the alternatives available for each. Prior probabilities are assigned (for components 1, 2, 3, and 5) or derived (for components 4 and 6) for each of the component alternatives. The right side of Fig. 1 shows the two risk metrics that are computed conditional on the component assumptions: (1) the MLE risk and (2) the 95th UCL risk. The particular application shown here is from Reference 2, where the output metric is the added cancer risk resulting from lifetime inhalation of air containing 1 ppb (0.001 ppm) of formaldehyde. The lifetime exposure is assumed to begin at age 4, with exposure occurring 30 hours per week. The risk outputs are summarized by the computed probabilities (shown as percentages summing to 100) for each of 13 discretized, order-of-magnitude bins, with any risk value below 10⁻¹² assumed to be zero and assigned to the first bin, denoted by e-25 to 1e-12.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Compiled BBN model for formaldehyde low-dose cancer risk assessment with prior probabilities. (Probabilities shown as percentages summing to 100 in this and all subsequent network displays.)

The prior probabilities for the six components in Fig. 1 were assigned by Reference 2 as follows.

1
Human Carcinogen/Target Tissue. In the case of formaldehyde, human carcinogenicity was treated in a general manner, without specification of a target organ(s). The probability that formaldehyde is a human carcinogen was assigned a value of 0.8 by Holland and Sielken,⁽²⁾ consistent with its EPA (B1) and IARC (2A) classifications as a probable human carcinogen.
2
Mode of Action. Three alternatives were considered for formaldehyde: (a) cell proliferation (inducing cell growth) only, (b) genotoxicity only, and (c) both cell proliferation and genotoxicity. Based on previous studies, Holland and Sielken⁽²⁾ assigned the predominant probability (0.8) to cell proliferation only, with 0.195 probability assigned to both cell proliferation and genotoxicity, and only a probability of 0.005 assigned to genotoxicity only.
3
Dose Scale. Holland and Sielken⁽²⁾ included the following dosimetry options for formaldehyde: (a) the concentration in the inhaled air, (b) total daily intake, and (c) a biomarker-based measurement of the covalent bonding of formaldehyde to DNA in respiratory tissues, as measured by the amount of formaldehyde-induced DNA protein cross-links (DPX). The different dose scales were assigned weights of 0.1 (ppm inhaled), 0.3 (total daily intake), and 0.6 (DPX), based on the then-current understanding of ADME mechanisms in rats and humans.⁽²⁾
4
Dose-Response Model. The dose-response model for formaldehyde is conditioned on the assumed mode of action and the dose scale. The probabilities for each dose-response model, conditioned on each combination of mode of action and dose scale, are shown in Table I, Panel A. These conditional probabilities imply that:
- •
  The probit nonlinear-threshold model is used solely when cell proliferation is assumed as the only mode of action, indicating a more likely role for dosimetry based on the inhaled ppm and total daily intake measures;
- •
  The five-stage sublinear model is applicable across all three modes of action and all three dose scales with relatively little differential preference between them;
- •
  The five-stage model with linear interpolation to zero is relatively more likely when both cell proliferation and genotoxicity are assumed to be operative, and implies a somewhat greater weighting on the use of covalent bonding and DPX for the dose scale; and
- •
  The one-stage linear model is associated solely with the genotoxic mode of action, and again implies a modest shift toward DPX as the preferred dose scale. The combination of a genotoxic mode of action and a linear dose-response relationship yields (together with the particular interspecies extrapolation methods described below) the highest estimates for low-dose cancer risk.
5
Experimental Data Set. Estimates for the formaldehyde study were based on a CIIT (Chemical Industry Institute of Technology) rat inhalation study that identified malignant squamous cell carcinoma (MSCC) and benign polypoid adenoma in the exposed rat nasal cavities.⁽⁶¹⁾ The two options for the data sets included are: (a) MSCC only (assigned by Holland and Sielken⁽²⁾ an initial weight of 0.8), and (b) both MSCC and benign polypoid adenoma (assigned the remaining weight of 0.2). When the benign tumors are included, somewhat higher risk estimates result.
6
Interspecies Extrapolation. Holland and Sielken⁽²⁾ link the interspecies extrapolation method for formaldehyde to the dose scale, with the use of the DPX dose scale allowing for preferential use of human-rat equivalence when making the interspecies extrapolation (that is, the critical cross-species differences in formaldehyde ADME are assumed to occur prior to the DPX-related covalent bonding of formaldehyde to DNA in respiratory tissues). The bodyweight-power extrapolations generally yield higher risk estimates than those obtained using the human-rate equivalence assumption. The probabilities for each interspecies extrapolation model, conditioned on the dose scale, are shown in Table I, Panel B.

Table I. Conditional Probabilities (%) for Dependent Model Components (from Reference 2)

Mode of Action	Dose Scale	Dose-Response Model Prob[Dose-Response Model \| Mode of Action ∩ Dose Scale]
Mode of Action	Dose Scale	Probit Nonlinear Threshold	5-Stage Sublinear	5-Stage Linear	1-Stage Linear
A. Dose-Response Model Dependency on the Mode of Action and Dose Scale:
Cell proliferation	ppm inhaled	50	45	5	0
Genotoxicity	ppm inhaled	0	15	50	35
Both	ppm inhaled	0	50	50	0
Cell proliferation	Total daily intake	50	40	10	0
Genotoxicity	Total daily intake	0	10	50	40
Both	Total daily intake	0	40	60	0
Cell proliferation	DNA protein ks	20	40	40	0
Genotoxicity	DNA protein ks	0	2.5	22.5	75
Both	DNA protein ks	0	18.75	81.25	0

	Dose Scale	Human Same as Rat	BW to 3/4ths	BW to 2/3rds
	Dose Scale	Interspecies Extrapolation Prob[Interspecies Extrapolation \| Dose Scale]
B. Interspecies Extrapolation Dependency on the Dose Scale:
	ppm inhaled	20	40	40
	Total daily intake	50	25	25
	DNA protein ks	80	10	10

3.1. Available Tools for Implementing Distributional Method

The distributional approach is based on the use of event (or probability) trees to elucidate all combinations of assumptions or values for each of the model components, with each final branch representing a unique combination of these assumptions. So, for example, for the formaldehyde model shown in Fig. 1, with six components involving 2, 3, 3, 4, 2, and 3 options for each, there are (2 × 3 × 3 × 4 × 2 × 3) = 432 branches on the tree. Each branch has associated with it a particular value of the risk metrics that are computed from the bioassay results using the set of assumptions associated with the branch.

Previous applications of the distributional method have used dedicated software to implement the probability tree calculations.⁽^2–10⁾⁴ More recent advances in influence, decision, and event tree software (e.g., References 62–64) have provided additional options for implementing the method.⁵ A particular type of model for representing a set of events related by conditional probabilities is referred to as an influence diagram,⁽^65–67⁾ or BBN.⁽^{68, 69}⁾ BBNs can be used to structure a model for interrelated events, allowing probabilistic inference for elements of the model that are either data-rich or data-poor.⁽⁷⁰⁾ Software packages for BBNs are especially well suited for probability tree models, facilitating automated generation of the tree branches and calculation of the resulting probability for each, including incorporation of dependencies (in the form of conditional probabilities) between different model components and assumptions. The compiled BBN network allows computation of the effect of modifying any of the node probabilities by propagating updated probabilities throughout the network. This software does not reduce the burden of identifying the risk estimate at the end of each branch, as these must still be determined by fitting the appropriate dose-response model to the appropriate data set, with the appropriate low-dose calculation and animal-to-human extrapolation.

The distributional approach for formaldehyde is demonstrated here using the Netica BBN software package (http://www.norsys.com/). This platform was chosen because of its flexibility and effective user interface, providing for easy development of the network, straightforward input of prior and conditional probabilities, and visually appealing and informative options for displaying the results of compiled and updated networks. A number of similar software packages are available.⁶

For the prior network shown in Fig. 1, probabilities for each downstream node are computed by multiplying the probabilities along each branch of the network, then summing the probabilities of each branch associated with the corresponding value of the output node. This calculation is done automatically in Netica when the network is compiled. As indicated in Fig. 1, 83.1% of the prior probability of the compiled tree is associated with an MLE risk of zero, with the remaining 16.9% distributed among risk estimates ranging from 10⁻¹² to 10⁻³ lifetime cancer risk. As expected, the 95th percentile UCL risk estimates are shifted upward, with now only 40.5% of the probability associated with zero risk and the remainder distributed among lifetime risks ranging from 10⁻⁸ to 10⁻³.

At the bottom of the output nodes for the MLE risk and the 95th UCL risk, summary numbers are provided indicating the mean ± standard deviation. The moments are dominated in this case by the probabilities computed for the few highest intervals of risk, over from 10⁻³ down to 10⁻⁷. As indicated, the mean of the 95th UCL risk is nearly double the mean of the MLE risk, while their standard deviations are nearly equal. While the mean values of the risk metrics are clearly at the upper tails of their respective distributions (as is common for highly skewed distributions that extend over logarithmic scales), changes in the mean provide a good first measure of how the uncertainty distribution for each of these risk metrics, each of regulatory interest, might shift in response to potential new information and associated reductions in uncertainty.

3.2. Model Sensitivity and the Potential Value of New Information

Continuing with the formaldehyde model presented in Fig. 1, the implications of resolving one or more of the uncertainties embodied in each of the model components are now explored. By way of illustration, Fig. 2 shows the results of fixing the values for two component nodes in the probability tree: for node 1, assuming that formaldehyde is a human carcinogen; and for node 6, a determination that interspecies extrapolation is properly made using BW to two-thirds.

When comparing Fig. 2 to Fig. 1, eliminating the branches with no human carcinogenicity and confining the animal-to-human extrapolation to BW to two-thirds is shown to reduce the probability of zero risk from 83% to 76% for the MLE risk metric, and from 41% to 32% for the 95th UCL risk metric. The probabilities that the risk metrics exceed 10⁻⁶ increase from 6.6% to 18% for the MLE risk, and from 21% to 45% for the 95th UCL risk. The mean MLE risk and the mean 95th UCL risk both increase as well (by factors of between 4 and 5), from 2.1 × 10⁻⁶ to 1.0 × 10⁻⁵, and from 3.9 × 10⁻⁶ to 1.6 × 10⁻⁵, respectively.

Probabilistic inferences derived from resolving the uncertainty in node 6, interspecies extrapolation, also propagate backward through the network to upstream nodes. Since interspecies extrapolation is a child of the dose scale node, this (hypothetical) conclusion results in a back-propagating inference that shifts the dose-scale probability weight away from the DPX biomarker and toward the other two dose metrics. Since the dose scale node is also a parent of the dose-response model, small changes in the latter's probability weights also result.

The results in Fig. 2 illustrate the effects of a complete elimination of uncertainty in two of the model components (nodes 1 and 6), resulting from perfect information. In most cases, however, only partial resolution is possible, with studies acknowledged to provide only imperfect information. To illustrate the implications of imperfect information, consider studies aimed at resolving the mode of action in node 2. The modified BBN model that includes these studies is shown in Figs. 3A and 3B, with added node, mode of action study. Both of these figures show the prior state for the respective models. For the case shown in Fig. 3A, the study is assumed to achieve 100%“resolution,” yielding perfect information. For the case shown in Fig. 3B, the study is imperfect, with a resolution of 80%. The conditional probability table for the mode of action study given the true mode of action is shown in Table II for this case of 80% resolution.

Table II. Conditional Probabilities (%) for Mode of Action Study Results Given True Mode of Action; Assumed Resolution = 80%

Mode of Action	Mode of Action Study Prob[Mode of Action Study Result \|True Mode of Action]
Mode of Action	CPO	GO	BCP and G
Cell proliferation only	80	10	10
Genotoxicity only	10	80	10
Both cell prolif and geno	10	10	80

As indicated, the study is assumed to yield the correct result 80% of the time, with the remaining 20% divided equally between the other two possible study outcomes.⁷

The effects of different results from the mode of action study are computed by clicking on the appropriate outcome in the mode of action study node (setting the particular outcome probability to 100%). This was done using networks with assumed values of the resolution for the mode of action study ranging from 50% to 100%. The resulting effects on the predicted mean of the 95th UCL risk and the probability that the 95th UCL risk is greater than 10⁻⁶ are presented in Figs. 4A and 4B, respectively.

As shown in Fig. 4, when the study resolution is only 50%, the posterior output metrics for all study outcomes are very close to their prior values: 3.9 × 10⁻⁶ (prior mean, Fig. 4A), and 20.5% (prior probability of exceeding 10⁻⁶). As the study resolution increases, the study results are more effective in shifting the prior values: downward in the case of a CPO (cell proliferation only) finding, and upward in the case of a GO (genotoxicity only) or BCPandG (both cell proliferation and genotoxicity) finding. The CPO finding results in only modest decreases in the risk metrics, with little change over the range of study resolution. This is because the prior probability assigned to the CPO mode of action is already 0.8, so the CPO study result does not dramatically shift the prior probabilities in the network. The GP finding results in only small upward shifts in the risk metrics when the study resolution is low (between 50% and 80%—especially for the probability that the 95th UCL risk exceeds 10⁻⁶). However, as the study resolution approaches 100%, much larger increases in the risk metrics result. This is because the GO mode of action has a very low prior probability (0.005), so only a very highly resolved study can shift significant probability to this state. The BCP and G study result imparts a moderately high, and steadily increasing, upward effect on the risk metrics throughout the range of study resolution considered.

The results shown in Fig. 4 are suggestive of general patterns of information value associated with studies of differing quality. The assumed study outcomes and their probabilities are highly idealized. Furthermore, only the expected shifts in risk model outputs associated with each study outcome are shown. Information value has further dimensions that have been the subject of significant study in the decision analysis literature.⁽^71–73⁾ In particular, by partitioning the prior uncertainty into multiple (in this case, three) preposterior distributions, the variances associated with each, corresponding to the poststudy state of knowledge, are (in most, but not all cases^74,75) reduced. This variance reduction may then enable decisions (e.g., whether to regulate a chemical as a carcinogen) with lower expected loss—when the losses are monetized, an expected monetary value for the information can then be computed. Information can also have value for conflict resolution,⁽⁷⁶⁾ increasing the probability that different stakeholders with different valuations for the outcomes and possibly different prior probabilities regarding the science will exhibit greater concordance in their posterior beliefs and subsequent decision preferences. In the following section, the analysis presented thus far is extended to consider mechanistic studies aimed at determining the cancer mode of action, and the demonstration of information value is limited to shifts in posterior beliefs regarding the mode of action. A more complete synthesis of cancer studies, risk assessment, cost-benefit analysis, and information value can build on this approach, but awaits further research and methods development.

4. PLANNING AND INTEGRATING RESULTS FROM MECHANISTIC STUDIES

In the previous section a generic study was posited to lend support for one of three possible modes action for formaldehyde carcinogenesis. Here, we consider multiple possible studies for addressing the genotoxicity of naphthalene, a similarly important chemical subject to numerous studies to assess its carcinogenicity. By enumerating possible studies and their outcomes, a framework is provided for consideration of the information value of specific projects in research planning and interpretation.

4.1. Genotoxicity Studies of Naphthalene

A symposium was held recently to address key scientific issues and uncertainties in cancer risk assessment for naphthalene, a compound undergoing current regulatory review. The naphthalene state-of-the-science symposium (NS³) was held to consider fundamental science issues related to carcinogenic mode of action and quantification of human cancer risk from exposure to naphthalene at environmentally relevant levels. Researchers conducting the most important primary scientific studies on metabolism, biochemistry, animal-to-human extrapolation, and mechanism of action presented their work. A set of reports was developed on the state-of-the-science for each issue, identifying significant scientific uncertainties and proposing specific scientific research that could resolve these uncertainties. Summary information on the symposium is available at: http://www.naphthalenesymposium.org/.

One of the key scientific issues addressed at the symposium concerned the question of whether naphthalene is genotoxic.⁽⁷⁷⁾ Like in the case of formaldehyde, answering this question is critical to the choice of dose-response functions, low-dose extrapolation, and resulting risk estimates at environmental concentrations. In the report of the symposium subgroup charged with addressing naphthalene genotoxicity,⁽⁷⁸⁾ scientific statements that can be made with a high degree of confidence are first presented.

1
Under conditions employed in the NTP bioassays.⁽^79,80⁾ naphthalene induces respiratory tumors in rodents—though only at sites showing signs of significant cytotoxicity.
2
Naphthalene metabolites are known to produce oxidative damage in cells, and have been shown to produce both stable and rapidly depurinating adducts under synthetic reaction conditions, but the adducts have not been identified in intact organisms.

Next, the workgroup identified critical questions that cannot now be answered with a high level of confidence, requiring further research and studies for their resolution:

1
Does naphthalene have characteristics of a genotoxic carcinogen?
2
Will biomarkers of naphthalene interactions with DNA be generated in animals or humans following environmental and/or occupational exposures?
3
Do specific molecular lesions in target cells determine the progression from preneoplastic tissue to tumors in rodents exposed to carcinogenic levels of naphthalene?

To address these questions the workgroup formulated a set of six scientific studies:

Study I. A short-term rat inhalation bioassay of molecular dosimetry of DNA adducts and cell proliferation studies in olfactory mucosa;
Study II. A mechanistic tumor initiation study in mouse skin measuring possible enzyme(s) for metabolic activation of naphthalene, stable and depurinating adducts, abasic sites, and analysis of H-ras mutations in preneoplastic skin and tumors (if they develop);
Study III. Measurement of DNA adducts of naphthalene in urine of exposed human populations for use as a possible biomarker;
Study IV. Examination of pathways of DNA damage response caused by naphthalene and its metabolites to identify the mode of action for naphthalene carcinogenicity;
Study V. An evaluation of the enzymology of naphthalene metabolism, with implications for the likelihood that specific metabolic pathways will occur in different species; and
Study VI. Bioassays with knockout and “humanized” mice to determine key enzymes in the initial metabolic activation of naphthalene in mice and study mediation processes for carcinogenicity in the liver or lung.

For each study a summary of the proposed experimental procedure was presented and a list of major inferences that could be drawn from different outcomes of the study was offered.

To illustrate the description and possible inferences that could result from a study, the following prospectus for Study IV is considered.

4.2. Study IV Pathways of DNA Repair

This study addresses the pathways of DNA damage response caused by naphthalene and its metabolites to identify whether naphthalene is a mutagenic or nonmutagenic carcinogen in mice. A series of isogenic cell lines deficient in various DNA metabolism pathways will be utilized to characterize the DNA damage responses caused by test compounds. Based on the results from the cultured cells, mice deficient in specific DNA damage responses (e.g., nucleotide excision repair, NER, responsible for stable bulky adducts) will be exposed to naphthalene.

Major inferences that could be drawn from different outcomes of the study:

1
DNA damage responses in a series of isogenic cells deficient in various DNA metabolism pathways: Identification of DNA damage responses caused by naphthalene metabolites would support the hypothesis that naphthalene has a genotoxic mechanism. Also, it would show which kind of DNA lesions (stable adducts, depurinating adducts, or abasic sites) significantly impact cells exposed to naphthalene or its metabolites. Failure to identify DNA damage responses caused by naphthalene metabolites would provide some evidence that naphthalene does not have a genotoxic mechanism.
2
Stable DNA adducts in target tissue: Identification of increased numbers of stable DNA adducts from naphthalene in the exposed DNA repair (NER)-deficient mouse lung versus wild-type mouse lung would support the hypothesis that naphthalene has a genotoxic mechanism and support the hypothesis that naphthalene could cause cancer through a genotoxic mechanism. Failure to identify increased numbers of DNA adducts from naphthalene in exposed DNA repair-deficient mouse lung, compared to its effect on wild-type mouse lung, would provide some evidence that naphthalene is not a genotoxic carcinogen.
3
Clara cell toxicity in mouse lung: Identification of more extensive Clara cell toxicity in the naphthalene-exposed DNA repair (NER)-deficient mouse lung versus wild-type mouse lung would support the hypothesis that Clara cell toxicity is associated with the existence of DNA damage. Failure to identify acceleration of Clara cell toxicity in the naphthalene-exposed DNA repair (NER)-deficient mouse lung compared to wild-type mouse lung would provide strong evidence that cell toxicity is independent of DNA damage.
4
Mouse lung tumor (long-term project): Identification of an increased incidence of lung tumors in the naphthalene-exposed DNA repair (NER)-deficient mouse compared to that in the wild-type mouse would support the hypothesis that naphthalene is a genotoxic mouse lung carcinogen. Failure to identify acceleration of Clara cell toxicity in the naphthalene-exposed DNA repair (NER)-deficient mouse lung compared to wild-type mouse lung would support the hypothesis that naphthalene is a nonmutagenic mouse lung carcinogen.

As indicated, the possible inferences from Study IV results consider both positive and negative study outcomes for the critical questions regarding mode of action for naphthalene using one of three terms: “supports the hypothesis,”“provides some evidence,” or “provides strong evidence.” Study results such as these can be considered in a probability tree/influence diagram model by translating the strength of evidence beliefs into study sensitivities and selectivities for the hypotheses in question.⁸

As in the case of prior probabilities and other conditional probabilities in the network model, different scientific experts may have different beliefs about the appropriate values for a study's sensitivity and selectivity (see, for example, References 31, 81, and 82). That is, the experts may differ in their beliefs regarding the importance of different findings for the questions at hand (a number of disagreements among experts were in fact apparent in the deliberations of the NS³ genotoxicity subgroup). The cancer risk assessment should thus include an examination of the implications of these differences as part of an overall sensitivity analysis of the probability model. In the example that follows, a representative set of study sensitivities and selectivities is used to illustrate the methodology. As with all examples in this article, this is in no way intended to represent an actual risk assessment for naphthalene, but rather to demonstrate a methodology for integrating mechanistic study results into a probability model.

Fig. 5 provides a simple network representation for two elements of the Study IV prospectus on pathways of DNA repair. The following study results are considered and the indicated conditional probabilities are assumed:

1
DNA damage responses in a series of isogenic cells deficient in various DNA metabolism pathways:
- a
  FNR = (1 – Sensitivity) = Prob[No DNA damage response | naphthalene genotoxic]= 0.35
- b
  FPR = (1 – Selectivity) = Prob[DNA damage response | naphthalene not genotoxic]= 0.20;
2
Increased incidence of lung tumors in naphthalene-exposed DNA repair-deficient mouse compared to wild-type mouse:
- a
  FNR = (1 – Sensitivity) = Prob[No increase in lung tumors | naphthalene genotoxic]= 0.15
- b
  FPR = (1 – Selectivity) = Prob[Increase in lung tumors | naphthalene not genotoxic]= 0.15.

As indicated, the study results based on an increase in tumors in DNA repair-deficient mice are assumed to be both more sensitive and more selective for genotoxicity (with lower false negative and false positive rates) than the results from the study of DNA damage responses in isogenic cells deficient in DNA metabolism pathways. Fig. 5A shows the assumed prior conditions, with the prior probability that naphthalene is genotoxic assumed equal to 0.5. Fig. 5B shows the computed posterior probability that naphthalene is genotoxic given that both study elements yield positive results for genotoxicity. With the assumed prior and the assumed sensitivities and selectivities, the double positive result in Fig. 5B yields a nearly 95% posterior probability that naphthalene is genotoxic. In Fig. 5C mixed evidence (positive results for cell DNA damage, but no increase in tumors in DNA repair-deficient mice) results in a decrease in the probability that naphthalene is genotoxic from 50% to 36%. The net decrease is a consequence of the higher sensitivity and selectivity assumed for the mouse tumor results compared to the cell DNA damage results. The BBN thus provides a formal mechanism for synthesizing multiple sources of (sometimes contradictory) information in a transparent and replicable weight-of-evidence procedure.

5. DISCUSSION

The probability model/comprehensive approach provides a rational framework for enumerating the alternative assumptions in a cancer risk assessment, determining the implications of each for low-dose potency and cancer risk estimates, and considering the weight of evidence for the alternative assumptions to assign probabilities to each and compute the resulting probability distribution of risk. The method relies upon expert judgment for the selection of prior probabilities for top nodes in the model as well as for the selection of conditional probabilities that relate the states in downstream and upstream nodes. Given the diversity of scientific issues and disciplines embodied in the cancer risk model, it may be appropriate in many cases to elicit probabilities from different experts for different parts of the model. In addition, since experts often differ considerably in their scientific assessments, methods for exploring and demonstrating the implications of these differences are important, as is consideration of alternative methods for combining inferences from multiple experts.⁽^82–85⁾

Having calculated a probability distribution for uncertain cancer risk and shown how this distribution can change through consideration of recent or proposed scientific studies, it is worthwhile to step back and consider just what these distributions (and their changes) represent.⁹ Is the key finding the change in support for different scientific models and assumptions that occur with different study outcomes, or the overall epistemic uncertainty of the resulting cancer risk estimate? While the probability tree method encourages a holistic consideration of the interaction among the risk assessment component models, experts are asked for their assessments regarding individual model probabilities. These are then combined and aggregated by the model to derive the uncertainty in the cancer risk. The strategy is thus one of “decomposition and recomposition.”⁽³⁾ However, the set of component models and options considered is limited and does not recognize new formulations or component interactions that might be discovered with ongoing research. As such, the derived uncertainty distribution should be recognized as a conditional distribution. In particular, the distribution is conditional on: (1) the assumption that the suite of options and submodels considered for each component constitutes (or at least reasonably approximates) the full set of possibilities, (2) the assumption that all interactions and dependencies among the different components are captured by the conditional probabilities between linked nodes, and (3) the assumption that the combination of assigned probabilities for individual components (which may result from the elicitation of one or more experts) leads to a meaningful result that can itself be attributed to the individual assessments and their combination through the rules of probability. This surely argues for careful exploration of this method, recognizing that predicted risk results should be viewed as providing order-of-magnitude insights. Similarly, the shifts in support for alternative submodels and assumptions associated with different scientific studies and their findings should be considered primarily in terms of their direction (+ or −) and general magnitude (small, medium, or large).

The probability tree method has now been available for well over a decade in the scientific literature and in selected applications. It is thus fair to ask: Has the approach had an impact on the characterization of uncertainty in recent IRIS database entries for cancer risk assessment?

In a 2000 review conducted by the EPA and Versar, Inc., an evaluation of the treatment of variability and uncertainty in IRIS risk assessments was made.⁽⁸⁶⁾ Benzene, a very intensely studied carcinogen, was one of the compounds considered in the Versar-EPA report. The study reviewed the IRIS entry for benzene posted in 2000, with the result that two of the three expert reviewers reported that “EPA did not adequately characterize uncertainty,” failing especially to consider alternative models. The benzene entry has since been revised, with the latest update posted in April 2003 (http://www.epa.gov/iris/subst/0276.htm, accessed June 18, 2007). The Carcinogenicity Assessment for Lifetime Exposure section of the IRIS assessment provides an extensive review of the human, animal, and supporting data that lead to benzene's classification as a known human carcinogen. Rather than presenting single-point estimates for potencies and unit risk factors, a range of values is presented (albeit relatively narrow ranges considering the full set of uncertainties that typically affect even well-studied compounds such as benzene). These ranges are:

1
Oral Ingestion (extrapolation method—linear extrapolation of human occupational data): Oral Slope Factor — 1.5 × 10⁻² to 5.5 × 10⁻² per (mg/kg)/day Drinking Water Unit Risk — 4.4 × 10⁻⁷ to 1.6 × 10⁻⁶ per (μg/L).
2
Inhalation (extrapolation method—low-dose linearity utilizing maximum likelihood estimates of Crump, 1994, Reference 19 in this article): Unit Inhalation Risk—2.2 × 10⁻⁶ to 7.8 × 10⁻⁶ is the increase in the lifetime risk of an individual who is exposed for a lifetime to 1 μg/m³ benzene in air.

The ranges in each case extend over a factor of ∼3–4.

In Section II. C.3 of the IRIS assessment for benzene: Additional Comments (Carcinogenicity, Inhalation Exposure), the Agency makes note of an uncertainty analysis conducted by Crump⁽¹⁹⁾ as part of the study used to derive the unit inhalation risk:

The two most important determinants of the magnitude of the unit risk number are the choice of extrapolation model to be used to estimate risk at environmental levels of exposure and the choice of the exposure estimates to which the Pliofilm workers (Rinsky et al., 1981, 1987) were subjected. Crump (1992, 1994) presented 96 unit risk calculation analyses by considering different combinations of the following factors: (1) different disease endpoints, (2) additive or multiplicative models, (3) linear/nonlinear exposure-response relationships, (4) two different sets of exposure measurements (Crump and Allen [1984]) versus exposure estimates by Paustenbach et al. [1993]), and (5) cumulative or weighted exposure measurements. The unit risk estimates range from 8.6 × 10⁻⁵ to 2.5 × 10⁻² at 1 ppm (3,200 μg/m³) of benzene air concentration (Crump, 1992, 1994).

Source: http://www.epa.gov/iris/subst/0276.htm.

The method used by Crump⁽¹⁹⁾ is clearly a variation of the probability tree approach presented in this article, with options available for five component models, yielding 96 unique combinations that define the branches of the model tree. The IRIS report does go on to justify its selection of a particular exposure study⁽^86,87⁾ and the use of a linear model for low-dose extrapolation, yielding the estimated range for the unit risk from lifetime inhalation of benzene as 2.2 × 10⁻⁶ to 7.8 × 10⁻⁶ at 1 μg/m³. The corresponding range from the Crump probability tree assessment is from 2.7 × 10⁻⁸ to 7.8 × 10⁻⁶ at 1 μg/m³. As such, the Crump probability tree model has the same upper bound as that determined by the Agency and reported in the IRIS assessment, but a lower bound that is a factor of 80 below that reported by the Agency. Neither study attempts to determine probabilities for the different values that occur within the ranges calculated.

The example provided for benzene indicates that the Agency is willing to present and discuss key uncertainties and resulting ranges for potency and unit risk values as part of an IRIS cancer risk assessment. While the need for bright lines in many regulatory applications is likely to continue to constrain the consideration of the full range of uncertainty in reported estimates, the door is open for broader explorations of the fundamental uncertainties that affect these estimates, both within the IRIS summaries and as part of the more detailed toxicological reviews.

In recent years, the emphasis in risk assessment has shifted (even) more strongly toward the need for improved fundamental understanding of the mechanistic physiological and cellular processes that affect animal and human dosimetry and ADME, as well as the modes of action for cancer, including cytotoxicity, cell proliferation, genotoxicity, and mutagenicity. Methods for uncertainty analysis that are able to interface with these studies can help to provide a stronger linkage between basic research and risk assessment activities that inform societal choices for chemical use, treatment, or cleanup. The distributional/probability tree method reviewed in this article is one approach that can begin to meet this need. By focusing attention on the fundamental assumptions in each of the components of a cancer risk assessment, and by determining how together they affect the resulting model predictions, the method provides the basis for a sounder determination of the full range of uncertainty in risk estimates. It also provides a tool for estimating the potential value of information of studies targeting different components of the risk model with differing sensitivity and selectivity. Readily available software packages for influence diagrams, decision trees, and Bayesian belief networks can help to facilitate the construction and evaluation of these models. Close interaction between risk assessors and health scientists who understand the current state of the art for key model components, and the potential insights and limitations of alternative study designs, can help to ensure that the models and uncertainty analyses that result are well informed and well informing.

Footnotes

1 Unlike classical confidence intervals, which strictly speaking do not represent the probability distribution of the estimated parameters, Bayesian intervals do correspond to a probability distribution for the model parameters and associated risk estimates. However, when informationless priors are chosen, the inferences from Bayesian methods are similar or identical to those obtained with classical methods, including central estimates (with, for example, the mode of the posterior distribution equivalent to the MLE) and equivalence between estimated confidence and credible limits.

2 The Bayesian framework includes a prior probability for events, a likelihood function that relates possible outcomes of an experiment to these events, and a posterior probability distribution for the events that is computed from the prior and the likelihood function using Bayes rule. To specify the Bayesian belief network model, experts are elicited for their priors and their likelihood functions. The model can then be used to compute the posterior distributions that result from different experimental outcomes. In many applications of elicitation using advanced Bayesian statistics, experts are elicited for their priors and their posteriors, given an assumed set of experimental outcomes (the latter is referred to as a predictive distribution). Statistical methods then allow for derivation of the implicit likelihood function associated with the experts' elicited probabilities.⁽^40–42⁾

3 Advanced statistical methods are now often used to characterize the uncertainty in the form, parameters, and predictions of PBPK (or, more generally, toxicokinetic-toxicodynamic) models, including Bayesian methods such as Markov Chain Monte Carlo (MCMC) simulation.⁽^55–59⁾

4 For a brief description of the software, developed by Sielken and Associates and referred to as UNRAVEL, see: http://www.sielkenassociates.com/DesktopDefault.aspx?tabid=1310.

5 For example, in addition to the Bayesian belief network application that follows, we have implemented the distributional method using the decision tree/influence diagram software program, PrecisionTree, part of the Palisade DecisionTools suite that works within Excel (and also includes @Risk, see: http://www.palisade.com/decisiontools_suite/). A summary of this implementation is found at: http:\\cee.andrew.ad.cmu.edu\User_Files\MS35\Public\WWW\Info-rmationAnalysis--KanShao.mht.

6 Examples of other commercial or university packages for implementing BBNs include:
• GeNIe and SMILE: http://genie.sis.pitt.edu/about.html#genie
• Analytica: http://www.lumina.com/
• JavaBayes: http://www.cs.cmu.edu/~javabayes/Home/
• Hugin Knowledge Base: http://www.hugin.com/Products_Services/
• MSBNx, the Microsoft Bayesian Network Editor and Tool Kit: http://research.microsoft.com/adapt/MSBNx/
• AgenaRisk: http://www.agenarisk.com/, and
• Bayes Net Toolbox for Matlab: http://bnt.sourceforge.net/.

7 This formulation of study “resolution” is just one approach for representing imperfect information, among the simplest for the case of three or more possible states of nature and their corresponding study results. For example, an imperfect study need not distribute the incorrect probability equally among the incorrect study results. In the next section, binary states are considered for the genotoxicity of naphthalene (yes or no). A study's accuracy is then characterized by its sensitivity (one minus the false negative rate) and selectivity (one minus the false positive rate). If the sensitivity equals the selectivity, then both are equal to the resolution as defined here.

8 The sensitivity of a study is its true positive rate, that is, the probability that the test will support a hypothesis given that the hypothesis is true. The sensitivity is 1–-the false negative rate (= 1 – FNR), and is also referred to as the power of the study. The selectivity is the true negative rate, that is, the probability that the test will refute a hypothesis given that the hypothesis is false. The selectivity is 1—the false positive rate (= 1 – FPR).

9 This discussion is motivated by comments provided by an anonymous reviewer.

ACKNOWLEDGMENTS

The initial plan for this article developed from discussions with John Evans, Joshua Cohen, and Lorenz Rhomberg. Special thanks to Robert Sielken who provided access to a comprehensive set of reports, papers, and software related to the distributional method for cancer risk assessment. Kan Shao and Royce Francis provided assistance in developing the Netica model. The author is indebted to the members of the genotoxicity workgroup of the Naphthalene State-of-the-Science Symposium (NS³) for their inputs and patience in identifying key factors and studies needed to better understand naphthalene genotoxicity: David Brusick, Ercole Cavalieri, Dhrubajyoti Chakravarti, Xinxin Ding, David Longfellow, Jun Nakamura, Eleanor Rogan, and James Swenberg. David Brusick also provided suggestions for this portion of the manuscript. Helpful suggestions at the NS³ were also provided by Kenneth Bogen and Richard Belzer. This article was greatly improved as a result of comments on an earlier draft by John Evans and three anonymous reviewers. Financial support for this article was provided by the U.S. Environmental Protection Agency (EPA), Contract #5-15238 through the Oak Ridge Associated Universities (ORAU), U.S. Department of Energy (DOE). This article has not been subject to formal peer review by the EPA, ORAU, or DOE, and no official endorsement should be inferred.

REFERENCES

1 Cohen, J.T. (2007) A White Paper Written To Address EPA IRIS Issue Topic 1: A comprehensive look at uncertainty analysis of carcinogenic risk assessments: Taking a broad view. Report prepared for U.S. Environmental Protection Agency by the Institute for Clinical Research and Health Policy Studies, Tufts-New England Medical Center, Report T-NEMC #063, Boston, MA.
Google Scholar
2 Holland, C. D., & Sielken, R. L. Jr. (1993). Quantitative Cancer Modeling and Risk Assessment. (Chapter 7). Englewood Cliffs , NJ : Prentice Hall.
Google Scholar
3 Evans, J. S., Graham, J. D., Gray, G. M., & Sielken, R. L. Jr. (1994). A distributional approach to characterizing low-dose cancer risk. Risk Analysis, 14(1), 25–34.
10.1111/j.1539-6924.1994.tb00025.x
CAS PubMed Web of Science® Google Scholar
4 Evans, J. S., Graham, J. D., Gray, G. M., & Sielken, R. L. Jr. (1995). A distributional approach to characterizing low-dose cancer risk. In S. Olin, W. Farland, C. Park, L. Rhomberg, R. Scheuplein, T. Starr, & J. Wilson (Eds.), Low-Dose Extrapolation of Cancer Risks (pp. 253–274). Washington , DC : ILSI Press.
Google Scholar
5 Sielken, R. L. Jr. (1993). Evaluation of chloroform risk to humans. The Toxicology Forum, 1993 Annual Winter Meeting, February 15–17, 1993, The Capitol Hilton, Washington, DC.
Google Scholar
6 Evans, J. S., Gray, G. M., Sielken, R. L., Jr., Smith, A. E., Valdez-Flores, C., & Graham, J. D. (1994). Use of probabilistic expert judgment in distributional analysis of carcinogenic potency. Regulatory Toxicology and Pharmacology, 20(1), 15–36.
10.1006/rtph.1994.1034
CAS PubMed Web of Science® Google Scholar
7 Sielken, R. L., Jr., & Valdez-Flores, C. (1999). Probabilistic risk assessment's use of trees and distributions to reflect uncertainty and variability and to overcome the limitations of default assumptions. Special Issue of Environmental International on Modeling and Simulation, 25, 755–772.
Web of Science® Google Scholar
8 Sielken, R. L. Jr. (1990). A weight-of-evidence approach to quantitative cancer risk assessment: Information analysis. In G. Schettler, D. Schmähl, & T. Klenner (Eds.), Risk Assessment in Chemical Carcinogenesis. New York : Springer-Verlag. Proceedings of the Satellite Symposium on Risk Assessment in Chemical Carcinogenesis, Heidelberg, Germany August 24–25, 1990.
Web of Science® Google Scholar
9 Sielken, R. L., Jr., Bretzlaff, R. S., & Stevenson, D. E. (1995). Challenges to default assumptions stimulate comprehensive realism as a new tier in quantitative cancer risk assessment. Regulatory Toxicology and Pharmacology, 21, 270–280.
10.1006/rtph.1995.1041
PubMed Web of Science® Google Scholar
10 Sielken, R. L., Jr., & Valdez-Flores, C. (1996). Comprehensive realism's weight-of-evidence based distributional dose-response characterization. Special Issue of the Human and Ecological Risk Assessment on Theoretical, Toxicological and Biostatistical Foundations for Deriving Probability Distribution Functions for Reference Doses and Benchmark Doses with Application to Carcinogens and Noncarcinogens , 2(1), 175–193.
Google Scholar
11 Boyce, C. P. (1998). Comparison of approaches for developing distributions for carcinogenic slope factors. Human and Ecological Risk Assessment, 4(2), 527–577.
10.1080/10807039891284415
CAS Web of Science® Google Scholar
12 Moschandreas, D. J., & Karuchit, S. (2002). Scenario-model-parameter—A new method of cumulative risk uncertainty analysis. Environment International, 28(4), 247–261.
10.1016/S0160-4120(02)00025-9
CAS PubMed Web of Science® Google Scholar
13 Zeise, L., Hattis, D., Andersen, M., Bailer, A. J., Bayard, S., Chen, C., Clewell, H., Conolly, R., Crump, K., Dunson, D., Finkel, A., Haber, L., Jarabek, A. M., Kodell, R., Krewski, D., Thomas, D., Thorslund, T., & Wassell, J. (2002). Improving risk assessment: Research opportunities in dose response modeling to improve risk assessment. Human and Ecological Risk Assessment, 8(6), 1421–1444.
10.1080/20028091057448
Web of Science® Google Scholar
14 Humphreys, S. H., Carrington, C., & Bolger, M. (2001). A quantitative risk assessment for fumonisins B1 and B2 in US corn. Food Additives and Contaminants, 18(3), 211–220.
10.1080/02652030010021486
CAS PubMed Web of Science® Google Scholar
15 Rai, S. N., Bartlett, S., Krewski, D., & Paterson, J. (2002). The use of probabilistic risk assessment in establishing drinking water quality objectives. Human and Ecological Risk Assessment, 8(3), 493–509.
10.1080/10807030290879790
Web of Science® Google Scholar
16 Kirman, C. R., Sweeney, L. M., Teta, M. J., Sielken, R. L., Valdez-Flores, C., Albertini, R. J., & Gargas, M. L. (2004). Addressing nonlinearity in the exposure-response relationship for a genotoxic carcinogen: Cancer potency estimates for ethylene oxide. Risk Analysis, 24(5), 1165–1183.
10.1111/j.0272-4332.2004.00517.x
CAS PubMed Web of Science® Google Scholar
17 Starr, T. B., Matanoski, G., Anders, M. W., & Andersen, M. E. (2006). Workshop overview: Reassessment of the cancer risk of dichloromethane in humans. Toxicological Sciences, 91(1), 20–28.
10.1093/toxsci/kfj145
CAS PubMed Web of Science® Google Scholar
18 David, R. M., Clewell, H. J., Gentry, P. R., Covington, T. R., Morgott, D. A., & Marino, D. J. (2006). Revised assessment of cancer risk to dichloromethane II. Application of probabilistic methods to cancer risk determinations. Regulatory Toxicology Pharmacology, 45(1), 55–65.
10.1016/j.yrtph.2005.12.003
CAS PubMed Web of Science® Google Scholar
19 Crump, K. S. (1994). Risk of benzene-induced leukemia: A sensitivity analysis of the Pliofilm cohort with additional follow-up and new exposure estimates. Journal Toxicology and Environmental Health, 42, 219–242.
10.1080/15287399409531875
CAS PubMed Web of Science® Google Scholar
20 Cox, L. A. (2006). Quantifying potential health impacts of cadmium in cigarettes on smoker risk of lung cancer: A portfolio-of-mechanisms approach. Risk Analysis, 26(6), 1581–1599.
10.1111/j.1539-6924.2006.00848.x
PubMed Web of Science® Google Scholar
21 Crouch, E. A. C., Lester, R. R., Lash, T. L., Armstrong, S. R., & Green, L. C. (1995). Report to the Commission on Risk Assessment: Health Risk Assessments Prepared per the Risk Assessment Reforms Under Consideration in the U.S. Congress. Cambridge Environmental Inc., Available at http://oaspub.epa.gov/eims/eimsapi.dispdetail?deid=55006. Also available at http://www.riskworld.com/nreports/1996/risk_rpt/pdf/cambrdg.pdf. See especially Appendix B: Obtaining distributions for carcinogenic potencies derived from studies on laboratory animals.
Google Scholar
22 Piegorsch, W. W., & West, R. W. (2005). Benchmark analysis: Shopping with proper confidence. Risk Analysis, 25(4), 913–920.
10.1111/j.1539-6924.2005.00636.x
PubMed Web of Science® Google Scholar
23 Piegorsch, W. W., West, R. W., Pan, W., & Kodell, R. L. (2005). Low dose risk estimation via simultaneous statistical inferences. Applied Statistics, 54(1), 245–258.
10.1111/j.1467-9876.2005.00481.x
Web of Science® Google Scholar
24 Hasselblad, V., & Jarabek, A. M. (1996). Dose-response analysis of toxic chemicals. In D. A. Berry & D. K. Stangl (Eds.), Bayesian Biostatistics (pp. 235–259). New York : Marcel Dekker.
Google Scholar
25 Chu, H. -M., & Kuo, L. (1997). Sampling based approach for one-hit and multi-hit models in quantal bioassay. Statistics and Computing, 7(3), 183–192.
10.1023/A:1018586118882
Web of Science® Google Scholar
26 Dunson, D. B. (2001). Modeling changes in tumor burden. Journal of Agricultural, Biological and Environmental Statistics, 6(1), 38–48.
10.1198/108571101300325238
Web of Science® Google Scholar
27 Dunson, D. B., & Dinse, G. E. (2001). Bayesian incidence analysis of animal tumorigenicity. Journal of the Royal Statistical Society: Series C (Applied Statistics), 50(2), 125–141.
10.1111/1467-9876.00224
Web of Science® Google Scholar
28 U.S. EPA Science Advisory Board. (1997). An SAB Report: Guidelines for Cancer Risk Assessment. EPA-SAB_EHC-97-010, Washington, DC. Available at http://www.epa.gov/sab/pdf/ehc9710.pdf.
Google Scholar
29 Guidelines for Carcinogen Risk Assessment. U.S. EPA. (2005). EPA/630/P-03/001F, Washington, DC. Available at http://www.epa.gov/iris/cancer032505-final.pdf.
Google Scholar
30 Morgan, M. G., Henrion, M., Morris, S. C., & Amaral, D. A. L. (1985). Uncertainty in risk assessment. Environmental Science & Technology, 19(8), 662–667.
10.1021/es00138a002
CAS PubMed Web of Science® Google Scholar
31 Stiber, N. A., Pantazidou, M., & Small, M. J. (1999). Expert system methodology for evaluating reductive dechlorination at TCE sites. Environmental Science & Technology, 33(17), 3012–3020.
10.1021/es981216s
CAS Web of Science® Google Scholar
32 Morgan, M. G., Pitelka, L. F., & Shevliakova, E. (2001). Elicitation of expert judgments of climate change impacts on forest ecosystems. Climatic Change, 49, 279–307.
10.1023/A:1010651300697
CAS Web of Science® Google Scholar
33 Arnell, N. W., Tompkins, E. L., & Adger, W. N. (2005). Eliciting information from experts on the likelihood of rapid climate change. Risk Analysis, 25(6), 1419–1431.
10.1111/j.1539-6924.2005.00689.x
PubMed Web of Science® Google Scholar
34 Gill, J., & Walker, L. D. (2005). Elicited priors for Bayesian model specifications in political science research. Journal of Politics, 67(3), 841–872.
10.1111/j.1468-2508.2005.00342.x
Web of Science® Google Scholar
35 Hawkins, N. C., & Graham, J. D. (1988). Expert scientific judgment and cancer risk assessment: A pilot study of pharmacokinetic data. Risk Analysis, 8(4), 615–625.
10.1111/j.1539-6924.1988.tb01205.x
CAS PubMed Web of Science® Google Scholar
36 Casman, E. A., & Morgan, M. G. (2005). Use of expert judgment to bound lung cancer risks. Environmental Science & Technology, 39(16), 5911–5920.
10.1021/es048492t
CAS PubMed Web of Science® Google Scholar
37 Grist, E. P. M., O'Hagan, A., Crane, M., Sorokin, N., Sims, I., & Whitehouse, P. (2006). Bayesian and time-independent species sensitivity distributions for risk assessment of chemicals. Environmental Science & Technology, 40(1), 395–401.
10.1021/es050871e
CAS PubMed Web of Science® Google Scholar
38 Cooke, R. M., Wilson, A. M., Tuomisto, J. T., Morales, O., Tainio, M., & Evans, J. S. (2007). A probabilistic characterization of the relationship between fine particulate matter and mortality: Elicitation of European experts. Environmental Science & Technology, 41(18), 6598–6605.
10.1021/es0714078
CAS PubMed Web of Science® Google Scholar
39 Kandlikar, M., Ramachandran, G., Maynard, A., Murdockand, B., & Toscano, W. (2007). Health risk assessment for nanoparticles: A case for using expert judgment. Journal of Nanoparticle Research, 9(1), 137–156.
10.1007/s11051-006-9154-x
Web of Science® Google Scholar
40 Chaloner, K. M. (1996). The elicitation of prior distributions. In D. A. Berry & D. K. Stangl (Eds.), Bayesian Biostatistics (pp. 141–156). New York : Marcel Dekker.
Google Scholar
41 Kadane, J. B., & Wolfson, L. J. (1998). Experiences in elicitation. Statistician, 47(1), 3–19.
10.1111/1467-9884.00113
Web of Science® Google Scholar
42 Garthwaite, P. H., Kadane, J. B., & O'Hagan, A. (2005). Statistical methods for eliciting probability distributions. Journal of the American Statistical Association, 100(470), 680–700.
10.1198/016214505000000105
CAS Web of Science® Google Scholar
43 Fayerweather, W. E., Collins, J. J., Schnatter, A. R., Hearn, F. T., Menning, R. A., & Reyner, D. P. (1999). Quantifying uncertainty in a risk assessment using human data. Risk Analysis, 19(6), 1077–1090.
10.1111/j.1539-6924.1999.tb01129.x
CAS PubMed Web of Science® Google Scholar
44 Conolly, R. B., Kimbell, J. S., Janszen, D., Schlosser, P. M., Kalisak, D., Preston, J., & Miller, F. J. (2003). Biologically motivated computational modeling of formaldehyde carcinogenicity in the F344 rat. Toxicological Sciences, 75, 432–447.
10.1093/toxsci/kfg182
CAS PubMed Web of Science® Google Scholar
45 Conolly, R. B., Kimbell, J. S., Janszen, D., Schlosser, P. M., Kalisak, D., Preston, J., & Miller, F. J. (2004). Human respiratory tract cancer risks of inhaled formaldehyde: Dose-response predictions derived from biologically-motivated computational modeling of a combined rodent and human dataset. Toxicological Sciences, 82(1), 279–296.
10.1093/toxsci/kfh223
CAS PubMed Web of Science® Google Scholar
46 Hester, S. D., Barry, W. T., Zou, F., & Wolf, D. C. (2005). Transcriptomic analysis of F344 rat nasal epithelium suggests that the lack of carcinogenic response to glutaraldehyde is due to its greater toxicity compared to formaldehyde. Toxicologic Pathology, 33(4), 415–424.
10.1080/01926230590953105
CAS PubMed Web of Science® Google Scholar
47 McGregor, D., Bolt, H., Cogliano, V., & Richter-Reichhelm, H. B. (2006). Formaldehyde and glutaraldehyde and nasal cytotoxicity: Case study within the context of the 2006 IPCS human framework for the analysis of a cancer mode of action for humans. Critical Reviews in Toxicology, 36(10), 821–835.
10.1080/10408440600977669
CAS PubMed Web of Science® Google Scholar
48 Sielken, R. L., Jr., Reitz, R. H., & Hays, S. M. (1996). Using PBPK modeling and comprehensive realism methodology for the quantitative cancer risk assessment of butadiene. In Proceedings of the International Symposium on the Evaluation of Butadiene & Isoprene Health Risks, June 27–29, 1995 , Toxicology, 113, 231–237.
10.1016/0300-483X(96)03450-6
CAS PubMed Web of Science® Google Scholar
49 Clewell, H. J., III, Gentry, P. R., Covington, T. R., & Gearhart, J. M. (2000). Development of a physiologically based pharmacokinetic model of trichloroethylene and its metabolites for use in risk assessment. Environmental Health Perspectives, 108(Suppl. 2), 283–305.
10.1289/ehp.00108s2283
CAS PubMed Google Scholar
50 Clewell, H. J., III, Andersen, M. E., & Barton, H. A. (2002). A consistent approach for the application of pharmacokinetic modeling in cancer and noncancer risk assessment. Environmental Health Perspectives, 110(1), 85–93.
10.1289/ehp.0211085
PubMed Web of Science® Google Scholar
51 Kimbell, J. S., Subramaniam, R. P., Gross, E. A., Schlosser, P. M., & Morgan, K. T. (2001). Dosimetry modeling of inhaled formaldehyde: Comparisons of local flux predictions in the rat, monkey, and human nasal passages. Toxicological Sciences, 64, 100–110.
10.1093/toxsci/64.1.100
CAS PubMed Web of Science® Google Scholar
52 Krishnan, K., & Johanson, G. (2005). Physiologically-based pharmacokinetic and toxicokinetic models in cancer risk assessment. Journal of Environmental Science and Health, Part C: Environmental Carcinogenesis and Ecotoxicology Reviews, 23(1), 31–53.
10.1081/GNC-200051856
CAS PubMed Web of Science® Google Scholar
53 Chiu, W. A., Barton, H. A., DeWoskin, R. S., Schlosser, P., Thompson, C. M., Sonawane, B., Lipscomb, J. C., & Krishnan, K. (2007). Evaluation of physiologically based pharmacokinetic models for use in risk assessment. Journal of Applied Toxicology, 27(3), 218–237.
10.1002/jat.1225
CAS PubMed Web of Science® Google Scholar
54 Stern, A. H. (2005). A revised probabilistic estimate of the maternal methyl mercury intake dose corresponding to a measured cord blood mercury concentration. Environmental Health Perspectives, 113, 155–163.
10.1289/ehp.7417
CAS PubMed Web of Science® Google Scholar
55 Bernillon, P., & Bois, F. (2000). Statistical issues in toxicokinetic modeling: A Bayesian perspective. Environmental Health Perspectives, 108(Suppl. 5), 883–893.
CAS PubMed Google Scholar
56 Jonsson, F., & Johanson, G. (2003). The Bayesian population approach to physiological toxicokinetic-toxicodynamic models—An example using the MCSim software. Toxicological Letters, 138(1–2), 143–150.
10.1016/S0378-4274(02)00369-7
CAS PubMed Web of Science® Google Scholar
57 Mezzetti, M., Ibrahim, J. G., Bois, F. Y., Ryan, L. M., Ngo, L., & Smith, T. J. (2003). A Bayesian compartmental model for the evaluation of 1,3-butadiene metabolism. Journal of the Royal Statistical Society Series C-Applied Statistics, 52, 291–305.
10.1111/1467-9876.00405
Web of Science® Google Scholar
58 Duffull, S. B., Kirkpatrick, C. J., Green, B., & Holford, N. H. (2005). Analysis of population pharmcokinetic data using NONMEM and Winbugs. Journal of Biopharmaceutical Statistics, 15, 53–73.
10.1081/BIP-200040824
PubMed Web of Science® Google Scholar
59 Hack, C. E. (2005). Bayesian analysis of physiologically based toxicokinetic and toxicodynamic models. Toxicology, 221, 241–248.
10.1016/j.tox.2005.12.017
CAS PubMed Web of Science® Google Scholar
60 Slikker, W., Andersen, M. E., Bogdanffy, M. S., Bus, J. S., Cohen, S. D., Conolly, R. B., David, R. M., Doerrer, N. G., Dorman, D. C., Gaylor, D. W., Hattis, D., Rogers, J. M., Setzer, R. W., Swenberg, J. A., & Wallace, K. (2004). Dose-dependent transitions in mechanisms of toxicity: Case studies. Toxicology and Applied Pharmacology, 201(3), 226–294.
10.1016/j.taap.2004.06.027
CAS PubMed Web of Science® Google Scholar
61 Kerns, W. D., Pavkov, K. L., Donofrio, D. J., Gralla, E. J., & Swenberg, J. A. (1983). Carcinogenicity of formaldehyde in rats and mice after long-term inhalation exposure. Cancer Research, 43, 4382–4392.
CAS PubMed Web of Science® Google Scholar
62 Lapin L., & Whisler W. (2002). Quantitative Decision Making with Spreadsheet Applications, 7th ed. Belmont , CA : Wadsworth (Thomson Learning).
Google Scholar
63 Monahan, G. E. (2000). Management Decision Making: Spreadsheet Modeling, Analysis, and Application. Cambridge , UK : Cambridge University Press.
Google Scholar
64 Savage, S. L. (2003). Decision Making with Insight. Thomson Brooks/Cole.
Google Scholar
65 Bedford, T., & Cooke, R. (2001). Probabilistic Risk Analysis: Foundations and Methods. Cambridge , UK : Cambridge University Press.
10.1017/CBO9780511813597
Google Scholar
66 Clemen, R. T., & Reilly, T. (2001). Making Hard Decisions with Decision Tools. Belmont , CA : Duxbury Press.
Web of Science® Google Scholar
67 Keefer, D. L., Kirkwood, C. W., & Corner, J. L. (2004). Perspective on decision analysis applications. Decision Analysis, 1(1), 4–22.
10.1287/deca.1030.0004
Google Scholar
68 Pearl, J. (2000). Causality: Models, Reasoning, and Inference. Cambridge , UK : Cambridge University Press.
Google Scholar
69 Jensen, F. V. (2001). Bayesian Networks and Decision Graphs. New York : Springer Verlag.
10.1007/978-1-4757-3502-4
Google Scholar
70 Biedermann, A., & Taroni, F. (2006). Bayesian networks and probabilistic reasoning about scientific evidence when there is a lack of data. Forensic Science International, 157(2–3), 163–167.
10.1016/j.forsciint.2005.09.008
CAS PubMed Web of Science® Google Scholar
71 Taylor, A. C., Evans, J., & McKone, T. (1993). The value of animal test information in environmental control decisions. Risk Analysis, 13(4), 403–412.
10.1111/j.1539-6924.1993.tb00740.x
CAS PubMed Web of Science® Google Scholar
72 Morgan, M. G., & Henrion, M. (1990). Uncertainty: A Guide to Dealing with Uncertainty in Quantitative Risk and Policy Analysis. Cambridge , UK : Cambridge University Press.
10.1017/CBO9780511840609
Google Scholar
73 Yokota, F., & Thompson, K. M. (2004). Value of information analysis in environmental health risk management decisions: Past, present, and future. Risk Analysis, 24(3), 635–650.
10.1111/j.0272-4332.2004.00464.x
PubMed Web of Science® Google Scholar
74 Hammitt, J. K. (1995). Can more information increase uncertainty? Chance, 8(3), 15–17.
Google Scholar
75 Hammitt, J. K., & Shlyakhter, A. I. (1999). The expected value of information the probability of surprise. Risk Analysis, 19(1), 135–152.
10.1111/j.1539-6924.1999.tb00395.x
Web of Science® Google Scholar
76 Small, M. J. (2004). The value of information for conflict resolution. In I. Linkov & A. Bakr Ramadan (Eds.), Comparative Risk Assessment and Environmental Decision Making (pp. 171–194). Dordrecht , Netherlands : Kluwer Academic.
Web of Science® Google Scholar
77 Brusick, D. (2008). Critical assessment of the genetic toxicity of naphthalene. Regulatory Toxicology and Pharmacology, 51(2), 37–42.
10.1016/j.yrtph.2007.08.013
CAS PubMed Web of Science® Google Scholar
78 Brusick, D., Small, M. J., Cavalieri, E. L., Chakravarti, D., Ding, X., Longfellow, D. G., Nakamura, J., Rogan, E. C., & Swenberg, J. A. (2008). Possible genotoxic modes of action for naphthalene. Regulatory Toxicology and Pharmacology, 51(2), 43–50.
10.1016/j.yrtph.2007.12.002
CAS Web of Science® Google Scholar
79 National Toxicology Program (NTP). (1992). Toxicology and carcinogenesis studies of naphthalene (CAS NO. 91-20-3) in B6C3F1 mice (inhalation studies). NTP Technical Report No. 410. Research Triangle Park, NC.
Google Scholar
80 National Toxicology Program (NTP). (2000). Toxicology and Carcinogenesis Studies of Naphthalene (CAS no. 91-20-3) in F344/N Rats (Inhalation Studies). NTP Technical Report No. 500; National Institutes of Health Publication No. 01-4434. Research Triangle Park, NC.
Google Scholar
81 Morgan, M. G., & Keith, D. (1995). Subjective judgments by climate experts. Environmental Science & Technology, 29(10), 468–476.
10.1021/es00010a003
CAS PubMed Web of Science® Google Scholar
82 Stiber, N. A., Small, M. J., & Pantazidou, M. (2004). Site-specific updating and aggregation of Bayesian belief network models for multiple experts. Risk Analysis, 24(6), 1529–1538.
10.1111/j.0272-4332.2004.00547.x
PubMed Web of Science® Google Scholar
83 Van Der Fels-Klerx, I. H. J., Goossens, L. H. J., Saatkamp, H. W., & Horst, S. H. S. (2002). Elicitation of quantitative data from a heterogeneous expert panel: Formal process and application in animal health. Risk Analysis, 22(1), 67–81.
10.1111/0272-4332.t01-1-00007
PubMed Web of Science® Google Scholar
84 Clemen, R. T., & Winkler, R. L. (1999). Combining probability distributions from experts in risk analysis. Risk Analysis, 19(2), 187–203.
10.1111/j.1539-6924.1999.tb00399.x
Web of Science® Google Scholar
85 Van Der Fels-Klerx, I. H. J., Cooke, R. M., Nauta, M. N., Goossens, L. H., & Havelaar, A. H. (2005). A structured expert judgment study for a model of campylobacter transmission during broiler-chicken processing. Risk Analysis, 25(1), 109–124.
10.1111/j.0272-4332.2005.00571.x
CAS PubMed Web of Science® Google Scholar
86 Versar, Inc. (2000). Characterization of Data Uncertainty and Variability in IRIS Assessments, Pre-Pilot vs. Pilot/ Post-Pilot. Report prepared for U.S. EPA Office of Research and Development, National Center for Environmental Assessment, Contract No. 68-C-99-238, Washington , DC. Available at http://oaspub.epa.gov/eims/eimscomm.getfile?p_download_id=4538.
Google Scholar
87 Rinsky, R. A., Young, R. J., & Smith, A. B. (1981). Leukemia in benzene workers. American Journal of Industrial Medicine, 2, 217–245.
10.1002/ajim.4700020305
CAS PubMed Google Scholar
88 Rinsky, R. A., Smith, A. B., Horning, R., Filloon, T. G., Young, R. J., Okun, A. H., & Landrigan, P. J. (1987). Benzene and leukemia: An epidemiologic risk assessment. New England Journal of Medicine, 316, 1044–1050.
10.1056/NEJM198704233161702
CAS PubMed Web of Science® Google Scholar
89 Crump, K. S., & Allen, B. C. (1984). Quantitative Estimates of Risk of Leukemia from Occupational Exposure to Benzene. Prepared for the Occupational Safety and Health Administration by Science Research Systems, Inc., Ruston , LA .
Google Scholar
90 Paustenbach, D., Bass, R., & Price, P. (1993). Benzene toxicity and risk assessment, 1972–1992: Implications for future regulation. Environmental Health Perspectives, 101(Suppl. 6), 177–200.
10.1289/ehp.93101s6177
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume28, Issue5

October 2008

Pages 1289-1308

Methods for Assessing Uncertainty in Fundamental Assumptions and Associated Models for Cancer Risk Assessment

Correction(s) for this article

Erratum to “Methods for Assessing Uncertainty in Fundamental Assumptions and Associated Models for Cancer Risk Assessment,” by Mitchell J. Small, in Risk Analysis, 28(5), 2008

Abstract