Volume 18, Issue 11 pp. 2611-2621
Hazard/Risk Assessment
Full Access

Prediction of ecotoxicity of hydrocarbon-contaminated soils using physicochemical parameters

Diana C. L. Wong

Corresponding Author

Diana C. L. Wong

Equilon Enterprises LLC, Westhollow Technology Center, P.O. Box 1380, Houston, Texas 77251-1380, USA

Equilon Enterprises LLC, Westhollow Technology Center, P.O. Box 1380, Houston, Texas 77251-1380, USASearch for more papers by this author
Eric Y. Chai

Eric Y. Chai

Equilon Enterprises LLC, Westhollow Technology Center, P.O. Box 1380, Houston, Texas 77251-1380, USA

Search for more papers by this author
Karin K. Chu

Karin K. Chu

Equilon Enterprises LLC, Westhollow Technology Center, P.O. Box 1380, Houston, Texas 77251-1380, USA

Search for more papers by this author
Philip B. Dorn

Philip B. Dorn

Equilon Enterprises LLC, Westhollow Technology Center, P.O. Box 1380, Houston, Texas 77251-1380, USA

Search for more papers by this author
First published: 04 February 2010
Citations: 24

Presented at the 18th Annual Meeting of the Society of Environmental Toxicology and Chemistry, November 16–20, 1997, San Francisco, California, USA.

Abstract

The physicochemical properties of eight hydrocarbon-contaminated soils were used to predict toxicity to earthworms (Eisenia fetida) and plants. The toxicity of these preremediated soils was assessed using earthworm avoidance, survival, and reproduction and seed germination and root growth in four plant species. No-observed-effect and 25% inhibitory concentrations were determined from the earthworm and plant assays. Physical property measurements and metals analyses of the soils were conducted. Hydrocarbon contamination was characterized by total petroleum hydrocarbons, oil and grease, and GC boiling-point distribution. Univariate and multivariate statistical methods were used to examine relationships between physical and chemical properties and biological endpoints. Soil groupings based on physicochemical properties and toxicity from cluster and principal component analyses were generally similar. Correlation analysis identified a number of significant relationships between soil parameters and toxicity that were used in univariate model development. Total petroleum hydrocarbons by gas chromatography and polars were identified as predictors of earthworm avoidance and survival and seed germination, explaining 65 to 75% of the variation in the data. Asphaltenes also explained 83% of the variation in seed germination. Gravimetric total petroleum hydrocarbons explained 40% of the variation in earthworm reproduction, whereas 43% of the variation in plant root growth was explained by asphaltenes. Multivariate one-component partial least squares models, which identified predictors similar to those identified by the univariate models, were also developed for worm avoidance and survival and seed germination and had predictive powers of 42 and 29%, respectively.

INTRODUCTION

The characterization of environmental contamination at hazardous waste sites usually involves determining the concentrations and toxicity of the chemical substances present. In prior waste-site assessments, chemical measurements were taken and compared with human health risk factors. Only recently have plant and animal communities been considered as potentially more susceptible receptors to contaminants.

Petroleum contamination in the environment is frequently evaluated by measuring mixtures of organic chemicals, such as total petroleum hydrocarbons (TPH). The extent of site cleanup based on TPH values has been established by many states in the United States, and cleanup levels vary substantially across jurisdictions. The TPH Criteria Working Group convened in 1993 to address the disparity in cleanup requirements used by states at sites contaminated with petroleum products and to develop a consistent strategy for assessing human health risk at these sites [1].

The development of biological criteria to evaluate soil quality may be preferable to arbitrary chemical criteria cleanup levels. Athey et al. [2] reported that chemical analysis alone has been found to be insufficient to assess potential hazards associated with creosote contamination in a stream. Loehr and Webster [3] pointed out that a limitation of using concentration- based criteria is choosing the right standard, citing the study of Trudell et al. [4], in which two bioremediated soils having similar oil and grease values could be assumed to pose similar risk if oil and grease was used as a criterion but in fact exhibited different Microtox® (Azur, Carlsbad, CA, USA) toxicities due to a difference in PCP concentrations. Bioassays can help define the integrated ecotoxicological effects of environmental contaminants. However, conducting chemical and biological measurements to assess contamination at each waste site or to guide remedial action can become costly. Hence, it would be desirable to be able to relate the concentration of chemicals found at a waste site to some biological response. Loehr and Webster [3] recently summarized several previous reports on remediation studies of soils and wastes related to wood treating and petroleum products that attempted to correlate chemical concentrations with toxicity test results.

Our laboratory has been involved in a research program to understand the effects of hydrocarbon in soils, particularly for applications to a risk-based framework for ecological assessments. Results from studies in which we tested different oil and soil combinations on earthworms, plants, and Microtox have been reported [5-8]. The most recent studies involved determining the ecotoxicity and characterizing the physical and chemical parameters of eight hydrocarbon-contaminated soils. The toxicity of these soils was assessed using earthworm avoidance, survival, and reproduction and seed germination and root growth for four plant species. Physical property measurements and metals analyses of the soils were conducted. Hydrocarbon contamination was characterized using TPH, oil and grease, and GC boiling-point distribution. The toxicity results, along with a discussion of test protocol evaluation and refinement and analytical assessment of the contaminated soils, have been reported by Saterbak et al. [9]. Readers should refer to Saterbak et al. [9] for a more thorough presentation of the experiments from which the data used in this study were derived. The objectives of this study were to examine the relationships between physical parameters, chemical parameters, and biological endpoints; to determine similarities and differences, based on physicochemical parameters and toxicity endpoints, among soils; and to develop quantitative models using chemical parameters to predict toxicity of hydrocarbon-contaminated soils to earthworms and plants.

Univariate and multivariate statistical methods were used to accomplish the objectives of this study. Correlation, cluster, and principal component analyses (PCA) were used to explore interrelationships between soil physicochemical parameters and biological endpoints. Cluster and PCA were also used to group soils with similar attributes. Lastly, multivariate partial least squares (PLS) and univariate regression analyses were used to relate the toxicity endpoints with chemical parameters. These quantitative models were developed as predictive tools of ecotoxicity using soil chemical parameters.

MATERIALS AND METHODS

Soil collection and characterization

The eight hydrocarbon-contaminated soils and associated control soils were collected from different field locations across the United States. All the soils were contaminated with crude oil except one, which contained lube oil (soil 18). One soil was also contaminated with brine (soil 8), and another was also contaminated with produced water (soil 14). The metals concentrations found in the soils were within the range of naturally occurring concentrations reported by Deuel and Holliday [10] except soil 9. The spill age ranged from 3 months to more than 5 years at the time of collection.

Physical and chemical characterization of the contaminated and control soils was conducted (Table 1). The analytical methods used in these characterizations were described by Saterbak et al. [9]. Briefly, physical parameters, such as pH, electrical conductance, porosity, bulk and particle densities, total organic carbon, and soil texture (% sand, clay, and silt), were determined along with trace metal analyses. Hydrocarbon contamination was characterized using TPH, oil and grease, and GC boiling-point distribution. Total petroleum hydrocarbons were determined both gravimetrically and by GC (tphgc). Freonextractable and carbon disulfide-extractable gravimetric oil and grease were determined. The carbon disulfide extracts were also used to speciate hydrocarbons by GC into the following classes: total saturates, consisting of n- and iso-saturates and ring-saturates; total aromatics; polars; and asphaltenes. The TPH and oil and grease measurements were considered as crude measurements in this study and the speciated hydrocarbon classes represented fractionated components of the oil and grease carbon disulfide extracts. Sixteen target polyaromatic hydrocarbons and benzene, toluene, ethylbenzene, and xylenes (BTEX) were also quantified.

Toxicity test endpoints

Toxicity tests with earthworms (Eisenia fetida) and four plant species, corn (Zea mays variety “Trucker's Favorite”), lettuce (Lactuca sativa variety “Great Lakes”), mustard (Brassica rapa), and wheat (Triticum aestivum variety “Taylor”), were conducted. No-observed-effect concentrations (NOECs) were estimated for earthworm avoidance, earthworm survival (in 7-d, 14-d, and chronic tests), earthworm reproduction, seed germination, and root growth. Concentrations that caused 25% inhibition (IC25) in earthworm survival and reproduction, seed germination, and root growth were also calculated if the concentration response data allowed (Table 2).

Statistical analyses

Correlations between physicochemical parameters and toxicity. Correlation analysis, using Pearson product-moment correlation coefficients, was used to explore possible linear relationships between physicochemical parameters and toxicity endpoints. Additionally, correlations between the physical and chemical variables were examined. Highly correlated (p ≤ 0.05) physical and chemical parameters were identified along with statistically significant soil property-toxicity relationships. Screening the physicochemical parameters was necessary because the number of observations (soils tested) was small relative to the number of physicochemical measurements. The relationships between the chemical parameters were used to help select variables for univariate model development.

Univariate models for toxicity prediction. Only hydrocarbon measurements were used as predictor variables. The physical parameters and metal concentrations were generally similar between the contaminated and control soils and were not used in model development. Using only hydrocarbon parameters as predictors also helped to reduce the number of variables.

A series of steps was used to build, if possible, a common prediction model for a group of toxicity endpoints (earthworm avoidance and survival, worm reproduction, seed germination, and root growth). For each toxicity endpoint (y), candidate predictors (x's) were selected from some of the chemical parameters that were correlated and from some that were not correlated. The maximum number of x's selected was limited by the number of soils tested, which ranged from five to eight. In stepwise regression with forward selection, the significance levels for variable entry and removal from the model were p = 0.05. To avoid overfitting the model, a minimum of 3 df for the model residuals was used. Model assumptions of normality and randomness were assessed using the residuals of the fitted model. Subsequently, the common predictor variables within each group of toxicity endpoints were identified. For each predictor variable, the slopes and intercepts of the individual regression models were then tested using the SAS general linear models procedure [11] to determine whether they were significantly different (p ≤ 0.05) from each other. If the slopes and intercepts were not significantly different, a regression with common slope and intercept, y = α0 + βx, was obtained. If only intercepts were significantly different, a regression with common slope and different intercepts, y = αi + βx, where αi = different intercepts for different toxicity endpoints, was obtained. Prediction error of the models was calculated as the square root of the mean square error from analysis of variance. Correlation analysis and univariate model development were done using STATGRAPHICS Plus software [12].

Cluster analysis to group soils. Cluster analyses were used to group soils based on physicochemical parameters. The analyses were done using the average, complete, or single linkage methods in SAS clustering procedures [11]. Single linkage minimizes the distance between points, whereas complete linkage maximizes these distances. Average linkage, on the other hand, determines clustering on the basis of averaged distances between clusters [13]. The criteria used in determining the number of clusters were coefficient of determination (R2) and graphical separation using the root-mean-squared distance for the average clustering method, maximum distance for the complete clustering method, and minimum distance for the single clustering method. Analyses were performed on both the original values and those that had been standardized for mean and variance. Standardization was used to decrease variability between the parameters because the range of concentrations within each parameter differed.

Table Table 1.. Physical and chemical properties of hydrocarbon-contaminated soils
Soil
Parameter 1 2 3 4 5 6 7 8 16 17 18 19 14 23 9
Control + + + + + + +
Contaminated + + + + + + + +
Spill typea CR CR CR CR, BR CR Lube oil CR, PW CR
pH 8.0 8.3 7.5 7.5 7.1 7.0 7.8 7.0 7.3 6.4 6.8 8.8 6.7 8.4 6.3
Electrical conductance, mmho/cm 2.7 1.2 0.32 4.0 0.73 3.3 0.37 0.60 0.72 0.44 0.41 0.7 3.02 0.9 8.81
Sand, % 9 7 82 78 63 64 84 80 84 84 89 68 76 64 77
Silt, % 48 47 2 20 16 12 9 10 9 10 4 13 14 13 2
Clay, % 50 46 16 2 21 24 7 10 14 6 7 19 10 23 21
Porosity, % 39 38 38 41 42 41 35 38 33 34 34 44 45 41 48
Total organic carbon, % 0.2 0.4 0.2 2.6 1.0 2.1 0.2 0.6 0.3 0.7 1.9 0.2 1.8 0.2 5.7
As, mg/kg soil 6.5 7.8 0.9 2.0 3.4 4.7 1.2 1.2 0.9 1.0 0.7 7.5 4.6 9.0 9.0
Ba 580 680 24 40 59 84 42 12 24 13 10 64 107 142 1,788
Cr 4 37 4 4 9 9 4 5 10 3 4 15 12 20 42
Pb 18 12 10 10 13 27 24 12 13 10 12 1 15 3 15
Se 0.1 0.1 0.1 0.1 0.1 0.1 0.4 0.4 0.1 0.1 0.1 0.1 0.1 0.1 0.3
Zn 59 50 25 48 34 46 17 18 27 35 120 28 82 33 163
BTEX, mg/kg soilb 0.1 0.1 0.1 0.18 0.1 0.1 0.1 0.41 0.1 4.71 0.1 NDb 0.1 ND 0.65
ogfreonc 500 1,406 500 42,000 1,142 27,000 500 11,000 500 29,000 33,000 500 14,000 500 71,000
ogcs2d 5 2,700 2,800 49,600 4,100 26,800 5 8,000 5 24,800 18,600 ND 10,600 ND 73,500
tphgravc 500 1,943 500 18,000 1,923 7,700 500 8,200 500 12,000 12,000 500 8,300 500 30,000
tphgcc 125 569 125 9,826 125 3,591 125 3,577 125 14,572 1,487 ND 2,723 ND 34,455
n- and iso-saturates ND 541 ND 13,841 ND 3,534 ND 2,544 ND 11,175 6,113 ND 943 ND 8,496
Ring saturates ND 717 ND 9,818 ND 7,240 ND 2,712 ND 6,285 9,325 ND 3,159 ND 24,433
Aromatics ND 763 904 20,537 1,152 7,906 ND 2,032 ND 4,437 2,641 ND 2,703 ND 24,623
Polars ND 49 213 1,934 308 1,179 ND 216 ND 1,438 428 ND 169 ND 6,395
Asphaltenes ND 632 568 4,067 1,378 6,834 ND 488 ND 1,364 74 ND 3,625 ND 9,555
  • a BR = brine; CR = crude; PW = produced water.
  • b BTEX = benzene, toluene, ethylbenzene, and xylenes; ND = not determined (analysis not conducted).
  • c ogfreon = freon-extractable oil and grease; tphgrav = gravimetrically determined total petroleum hydrocarbons; 500 mg/kg represents one-half the method reporting limit of 1,000 mg/kg.
  • d ogcs2 = carbon disulfide-extractable oil and grease; = mg/kg represents one-half the method reporting limit of 10 mg/kg.
  • e tphgc = total petroleum hydrocarbons determined by GC; 125 mg/kg represents one-half the method reporting limit of 250 mg/kg.
Table Table 2.. Earthworm and plants toxicity endpoints from exposure to hydrocarbon-contaminated soils
% Soil
Toxicity test Endpointa 2 4 6 8 9 14 17 18
Earthworm
 Avoidance Survival NOEC 100 1 100 100 NDb ND 1 100
 7-d Acute Survival NOEC 100 100 100 100 ND 100 10 100
 14-d Acute Survival NOEC 100 1 10 100 ND 100 1 100
 14-d Acute Survival IC25 c 7.8 77.5 c ND c 10.0 c
 Chronic Survival NOEC 100 1 10 100 ND ND 1 100
 Reproduction Juveniles/adult/week NOEC 100 1 10 10 ND ND 1 1
 Reproduction Juveniles/adult/week IC25 27.6 10 22.5 0.9 ND ND 1.2 4.0
 Reproduction Cocoons/adult/week NOEC ND 1 10 1 ND ND 1 1
 Reproduction Cocoons/adult/week IC25 ND 7.9 24.3 4.3 ND ND 2.8 1.7
Plant
 Corn Germination NOEC 100 100 77.0 100 30.0 100 83.0 100
 Mustard Germination NOEC ND 4.8 7.7 10.0 0.7 2.0 7.7 65.0
 Mustard Germination IC25 ND 8.8 18.0 23.3 0.7 1.4 18.7 73.5
 Wheat Germination NOEC 100 53.0 30.0 100 30.0 100 77.0 100
 Corn Root growth NOEC 100 16.7 34.7 7.7 20.0 65.0 20.0 100
 Corn Root growth IC25 d 24.7 3.3 13.7 19.5 d 28.8 d
 Mustard Root growth NOEC ND 4.3 1.0 10.0 0.3 2.0 5.3 65.0
 Mustard Root growth IC25 ND 14.3 7.4 71.5 0.6 6.0 14.2 d
 Wheat Root growth NOEC 100 46.7 17.8 100 10 50.5 46.7 100
 Wheat Root growth IC25 d 55.3 30.3 d 17.0 0.9 25.0 d
  • a IC25 = 25% inhibitory concentration; NOEC 5 no-observed-effect concentration.
  • b ND = not determined (test was not conducted with this soil).
  • c Unable to calculate IC25. No treatment response means were <75% of control response means; 100% was used in data analysis.
  • d Unable to calculate IC25. No treatment response means were <75% of control response means.

The physical and chemical parameters were analyzed both separately and jointly for the contaminated soils. Certain physical parameters and metals, such as bulk and particle densities, silver, cadmium, and mercury, were not included in the analysis because the concentrations of these variables were essentially the same across all soils. All chemical parameters and subsets of these parameters were used in separate analyses. Subsets included BTEX and the crude hydrocarbon measurements (e.g., gravimetric TPH) and BTEX with the fractionated components (e.g., polars).

Principal component analysis to group soils. Principal component analysis was used to reduce the number of physicochemical parameters and toxicity endpoints (X) to a smaller set of representative variables or principal components (PCs). The PCs were linear combinations of the variables that explained the variability in the Xs, where the eigenvectors in this case were the generated loadings of the PCs, which displayed the influence of the variables. The influential Xs were those with higher coefficients in its eigenvectors and were the main contributors to the corresponding PCs. Principal component analysis was performed on the same groups of parameters that were used in cluster analyses. The variables were standardized to unit variance before analysis so that an equal weight was given to each variable. The fraction of the variability in the Xs explained by a PC (R2X) was used to determine the number of significant components. Graphical representation of the PCA parameters for scores and loadings provided information about similarities of the soils and correlation structures of the variables, respectively.

Partial least squares models for toxicity prediction. The method of partial least squares projections to latent structures was applied to the data to explore quantitative relationships between multivariate toxicity endpoints (Y) and chemical parameters (X). Partial least squares is particularly useful in analyzing multicollinear data and in cases where the number of variables exceeds the number of observations [14]. All 11 chemical parameters were used as X variables, and the toxicity endpoints were separated into groups as in the univariate model development (i.e., worm avoidance and survival, worm reproduction, seed germination, and root growth). R2X and R2Y were used to evaluate how well the models simultaneously explained the variability in the X and Y variables, respectively. The normality assumption was assessed using the Y residuals. The cross-validation statistic, Q2, provided an estimate of the predictive power of the PLS models. The statistical significance of a component was based on Q2 exceeding the cross-validation threshold for that component. X variables with the greatest influence in explaining variability in Y, or variable importance in projection, were determined by the criterion of a value greater than 1. The PLS scores plots portrayed the relationships between X and Y, expressed by the latent variables t and u, for each soil. The loadings plots, on the other hand, showed the interrelationships between these variables within each component. Principal component analysis and PLS analyses were conducted using SIMCA® software [15].

RESULTS

Soil groupings

Using cluster and PCA analyses, soils were separated into groups on the basis of physicochemical parameters and toxicity endpoints. In general, soils were grouped similarly by cluster and PCA. Based on visual inspection, the soils tended to separate into more groups in the PCA scores plots than in cluster analysis. Only the results from PCA are presented (Table 3). Using physical parameters, soil 9 was different from the other soils on the first PC, whereas soil 2 was different on the second PC. The soil properties that distinguished soil 9 were electrical conductance, total organic carbon, porosity, arsenic, barium, chromium, and zinc (PC1), whereas the variables that distinguished soil 2 were pH and soil texture.

Table Table 3.. Soil groupings from principal component analysis score plots based on analyses of different sets of physical, chemical, and toxicity variables
Parameters
Group Physical Chemical (crude)a Chemical (fractions)b Physical chemical (crude)a Earthworm endpointsc Plants endpointsd
1 2 4 4 2 2 4, 14, 17
2 9 9 9 9 4, 17 6, 9
3 4, 6, 8, 14, 17, 18 17 17 4, 6, 8, 14 8, 18 8
4 2, 6, 8, 14, 18 2, 6, 8, 14, 18 6 18
  • a Includes benzene, toluene, ethylbenzene, and xylenes (BTEX); freon- and carbon disulfide-extractable oil and grease; and gravimetrically and GC-determined total petroleum hydrocarbons.
  • b Includes BTEX, n- and iso-saturates, ring saturates, aromatics, polars, and asphaltenes.
  • c Soils 9 and 14 not tested.
  • d Soil 2 not tested.
Details are in the caption following the image

Plots of scores (A) and loadings (B) for the first and second principal components for earthworm toxicity endpoints in the six soils studied. avoid = avoidance; ic25 = 25% inhibitory concentration; juv = juvenile; noec = no-observed-effect concentration; surv = survival.

For chemical parameters, PCA separated soils 9 and 17, and soil 4 to a lesser extent, from the other soils when either crude or fractionated hydrocarbon parameters were used. The crude and fractionated hydrocarbon parameters, except n- and iso-saturates, in the first PCs, separated soil 9. For soil 17, BTEX was the discriminating variable; this variable also dominated PC2. The remaining five soils appeared to cluster together in the PCA scores plot, with soil 2 exhibiting more difference than the other four soils. When both physical and crude chemical parameters were used, the soil groupings were more similar to groupings based on physical parameters in that soils 4 and 17 were not differentiated.

The PCA scores plot showed that soils 8 and 18 were similar on the basis of earthworm toxicity (Fig. 1A). These two soils tended to be less toxic than average (higher than average NOEC) for worm avoidance and for 14-d acute and chronic survival (PC1). Soils 2 and 17 were observed to be in opposite regions of the scores plot. Soil 2 tended to be less toxic than average, whereas soil 17 was more toxic than average for all endpoints (both PCs). Soil 4 was more similar in toxicity to soil 17, whereas soil 6 was more similar to soil 2. The worm avoidance and 14-d acute and chronic survival endpoints contributed most to PC1, and the juvenile production endpoints dominated PC2 (Fig. 1B).

Based on plant responses, soil 18, and soil 8 to a lesser extent, were separated from the other soils by PCA (Fig. 2A). Soil 18 was different from the other soils and tended to be less toxic than average for all plant endpoints. Of the five remaining soils, soil 9 was the most toxic, and soils 4 and 17 had similar toxicities. Most of the endpoints contributed approximately equally to PC1 (Fig. 2B). On the other hand, the endpoints formed two groups in PC2, and the variation between these two groups was mostly captured by PC2. Mustard germination was differentiated from corn and wheat in PC2 and was more sensitive. Mustard and corn root growth were also differentiated from wheat.

Details are in the caption following the image

Plots of scores (A) and loadings (B) for the first and second principal components for plant toxicity endpoints in the eight soils studied. germ = germination; ic25 = 25% inhibitory concentration; noec = no-observed-effect concentration; rootgr = root growth.

Interrelationships between soil properties and toxicity

Relationships between soil properties and between soil properties and toxicity were discernible from correlation analysis and PCA loadings plots. From correlation analysis, most chemical parameters were significantly correlated with each other (r ≥ 0.70, p ≤ 0.05) with the exception of BTEX and n- and iso-saturates. The BTEX compounds were not highly correlated (r < 0.50) to any other chemical parameters. The n- and iso-saturates were only significantly correlated to total saturates. A few physical parameters and metals also correlated to chemical parameters. Electrical conductance and total organic carbon were positively correlated (r ≥ 0.70) to all hydrocarbon parameters except BTEX and n- and iso-saturates. Barium and zinc were the only metals that showed significant correlations to some of the chemical parameters, such as polars and ring saturates. Principal component analysis loadings plots showed similar associations between the physical and chemical properties.

Earthworm avoidance and survival were correlated only to chemical parameters, whereas reproduction was correlated primarily to physical parameters. Significant negative correlations between worm avoidance and survival and TPH (as measured by GC), carbon disulfide-extractable oil and grease, polars, and n- and iso-saturates were found, whereas cocoon and juvenile production showed significant positive correlations to clay, arsenic, barium, chromium, and lead. Among the plant endpoints, only corn and wheat germination NOECs and wheat root growth NOEC showed significant correlations to physical and chemical parameters. Germination was negatively correlated to most hydrocarbon parameters, whereas root growth was less well correlated.

The relationships between highly correlated physicochemical parameters and biological endpoints were further examined using scatter plots. In many cases, the highly correlated relationships were due to data falling into two clusters, with endpoints reflecting either very low or very high toxicities at low and high concentrations of the physicochemical parameter, respectively. Because these relationships were based on limited data (six to eight soils) and were further limited by the use of NOEC endpoints from widely spaced test concentrations, they are not very surprising, and more data are needed to fill the gaps. Nevertheless, these relationships suggest possible threshold concentrations above which toxic effects are manifested.

Prediction models development

Univariate prediction models. Predictive models for four sets of toxicity endpoints (i.e., earthworm avoidance and survival, earthworm reproduction, seed germination, and root growth) have been constructed (Table 4). Total petroleum hydrocarbons measured by GC and polars were common predictors for earthworm avoidance and survival and seed germination. These hydrocarbon parameters appeared to have greater effect on earthworms than seed germination, as indicated by greater negative slopes in the earthworm models. The earthworm models have common slopes and intercepts and explained 65% of the variation seen in the five avoidance and survival endpoints. Common slopes and intercepts in the tphgc and polars models suggest that each of these parameters have similar effects on avoidance and acute and chronic survival of earthworms. The models for earthworm avoidance and survival using TPH-GC and polars as predictors are
equation image(1)
equation image(2)
Gravimetric TPH and n- and iso-saturates were identified as the “best” predictors of earthworm reproduction (Table 4). However, these variables accounted for only 20 to 40% of the variability observed. Consequently, the models have low predictability, with estimated prediction errors of approx. 60% for gravimetric TPH and 80% for n- and iso-saturates (relative to intercepts).
In addition to tphgc and polars, asphaltenes were also identified as predictors of seed germination (Table 4 and Fig. 3). The germination models, in contrast to the earthworm survival regressions, have common slopes but different intercepts (Fig. 3). The germination models using tphgc, polars, and asphaltenes as predictors are
equation image(3)
where α = 98.54 for corn, 27.90 for mustard germination NOEC, 34.54 for mustard germination IC25, and 86.04 for wheat.
equation image(4)
where α = 97.63 for corn, 26.91 for mustard germination NOEC, 33.56 for mustard germination IC25, and 85.13 for wheat.
equation image(5)
where α = 104.93 for corn, 34.81 for mustard germination NOEC, 41.45 for mustard germination IC25, and 92.43 for wheat.
Table Table 4.. Regression parameters for univariate prediction models for earthworms and plants using soil chemical characteristics
Dependent variable (y) Predictor variablea (x) Interceptb (α) Slope (β) R2 Prediction errorc (%) Range of predictor variable (mg/kg) No. of soils
Avoidance/survival (5 toxicity endpoints) Polars 109.79 −0.0517 0.64 27.43 49–1,934 6–7
tphgc 108.01 −0.0076 0.67 26.52 569–14,572 6–7
Reproductiond (4 toxicity endpoints) tphgrav 36.84 juv IC25 −0.0026 0.39 18.60 1,943–18,000 5–6
46.32 juv NOEC
38.19 cocoon IC25
32.78 cocoon NOEC
nisosat 24.54 −0.0020 0.19 19.79 541–13,841 5–6
Seed germinationd (4 toxicity endpoints) Asphaltenes 104.93 corn NOEC −0.0056 0.83 18.25 74–9,555 7–8
34.81 mustard NOEC
41.45 mustard IC25
92.43 wheat NOEC
Polars 97.63 corn NOEC −0.0077 0.77 20.96 49–6,395 7–8
26.91 mustard NOEC
33.56 mustard IC25
85.13 wheat NOEC
tphgc 98.54 corn NOEC −0.0014 0.76 21.34 569–34,455 7–8
27.90 mustard NOEC
34.54 mustard IC25
86.04 wheat NOEC
Root growth (6 toxicity endpoints) Asphaltenes 71.10 −0.0074 0.43 28.33 74–9,555 7–8
  • a nisosat = n- and iso-saturates; tphgc = total petroleum hydrocarbons determined by GC; tphgrav = gravimetrically determined total petroleum hydrocarbons.
  • b IC25 = 25% inhibitory concentration; juv = juvenile; NOEC = no-observed-effect concentration.
  • c Prediction error = √ mean square error.
  • d Statistically different intercepts for different endpoints were found.
Details are in the caption following the image

Prediction model for seed germination using asphaltenes. The observed toxicity endpoints as a function of asphaltene concentration are plotted. The regression line has a common slope, and intercepts varied depending on plant species. IC25 = 25% inhibitory concentration; NOEC = no-observed-effect concentration.

Table Table 5.. Partial least squares model coefficients for most influential X variables on worm avoidance and survival endpoints
Xa Avoidance NOECb 7-d Survival NOEC 14-d Survival NOEC 14-d Survival IC25c Chronic survival NOEC
tphgc −0.116 −0.060 −0.122 −0.130 −0.113
Polars −0.115 −0.060 −0.122 −0.130 −0.113
nisosat −0.107 −0.056 −0.113 −0.120 −0.105
ogcs2 −0.097 −0.050 −0.103 −0.109 −0.095
Total saturates −0.096 −0.050 −0.102 −0.108 −0.094
  • a nisosat = n- and iso-saturates; ogcs2 = carbon disulfide-extractable oil and grease; tphgc = total petroleum hydrocarbons determined by gas chromatography.
  • b NOEC = no-observed-effect concentration.
  • c IC25 = 25% inhibitory concentration.

Different intercepts within each model suggest differences in interspecies sensitivity to the predictors, with mustard germination being more sensitive than corn and wheat. Furthermore, the intercepts in the asphaltenes, polars, and tphgc models were similar for each species, suggesting that these hydrocarbon parameters were affecting germination in a similar manner. The models explained 75 to 85% of the variation observed in the four germination endpoints for the three plant species. The estimated prediction error was 20 to 25% (relative to intercepts) for corn and wheat and 45 or 75% for mustard (depending on whether it was based on the NOEC or IC25 endpoint, respectively).

Asphaltenes did not appear to predict root growth as well as germination. The model for all three plant species explained only 43% of variability and has an associated prediction error of 40%.

Multivariate prediction models. Partial least squares models that relate worm avoidance and survival and seed germination to soil chemical parameters have been developed. Multivariate models relating worm reproduction and root growth to chemical parameters were less definitive. The first component in the worm reproduction and root growth models was not statistically significant, and the prediction power of the models was very poor (≤5%).

For worm avoidance and survival, there were three statistically significant components. However, the second component did not substantially increase the predictive power of the model (42 vs 48%). The third component did increase the predictive power to 61%, but the most influential X variables in the third component were also in the second component. The additional variable present in the second component that was not present in the first component was BTEX. Therefore, on the basis of these factors, a one-component model was adopted. The one-component model accounted for 69% of the variability in Xs and 63% of the variability in Ys. The model explained 73% of the avoidance data, 78% of the 14-d survival data, 70% of the chronic survival data, and 18% of the 7-d survival data. The predictive power of the model was, likewise, poor for 7-d survival but 60 to 70% predictive of the other endpoints. The most relevant chemical parameters (variable importance in projection > 1) in explaining the worm avoidance and survival endpoints were, in descending order, tphgc, polars, n- and iso-saturates, carbon disulfide-extractable oil and grease, and total saturates (Table 5), because these parameters loaded most heavily into the first component (Fig. 4B). The negative regression coefficient for each chemical variable indicated an inverse relationship between chemical concentration and toxicity endpoint. The coefficients for each chemical parameter were similar for avoidance, 14-d survival, and chronic survival, suggesting that each chemical parameter affected these endpoints in a similar manner. Likewise, for each toxicity endpoint, the coefficients for the five most influential chemical variables were not substantially different, suggesting that any of these five parameters can be used equally well for toxicity prediction. Within the one-component PLS model, a good linear relationship (r = 0.90, p < 0.01) exists between the scores from the chemical parameters and worm avoidance and survival for the seven soils studied, as seen in the scores plot, where X and Y are expressed as the latent variables t and u (Fig. 4A).

Details are in the caption following the image

Partial least squares scores (A) and loadings (B) plots for a one-component model relating chemical parameters and worm avoidance (avoid) and survival (surv) in the seven soils studied. asphalt = asphaltenes; btex = benzene, toluene, ethylbenzene, and xylenes; ic25 = 25% inhibitory concentration; nisosat = n- and iso-saturates; noec = no-observed-effect concentration; ogcs2 = carbon disulfide-extractable oil and grease; ogfreon = freon-extractable oil and grease; ringsat = ring saturates; totalarom = total aromatics; totalsat = total saturates; tphgc = total petroleum hydrocarbons determined by GC; tphgrav = gravimetrically determined total petroleum hydrocarbons.

Table Table 6.. Partial least squares model coefficients for most influential X variables on seed germination endpoints
Xa Corn germination NOECb Wheat germination NOEC Mustard germination NOEC Mustard germination IC25c
Asphaltenes −0.117 −0.108 −0.041 −0.053
Polars −0.107 −0.099 −0.038 −0.049
tphgc −0.102 −0.094 −0.036 −0.047
ogcs2 −0.101 −0.094 −0.036 −0.046
Total aromatics −0.098 −0.090 −0.035 −0.045
Ring saturates −0.092 −0.085 −0.032 −0.042
  • a ogcs2 = carbon disulfide-extractable oil and grease; tphgc = total petroleum hydrocarbons determined by GC.
  • b NOEC = no-observed-effect concentration.
  • c IC25 = 25% inhibitory concentration.

For seed germination, with 11 X variables and four Y variables, a one-component model explained 77% of the variability in Xs and 42% of the variability in Ys. A second component, which was not statistically significant, accounted for an additional 27% of the variability in Ys but only another 7% of the variability in Xs. For each Y in the one-component model, R2 ranged from 9 to 73%, with the model explaining much of variability observed for corn and wheat germination but not for mustard germination. The overall predictive power of the one-component model was 29% for all germination endpoints, with 44 and 54% predictability for corn and wheat germination, respectively. The chemical parameters (Xs) that were most relevant in explaining the germination endpoints were, in descending order, asphaltenes, polars, tphgc, carbon disulfide-extractable oil and grease, total aromatics, and ring saturates (Table 6 and Fig. 5B). The coefficient for each of these chemical parameters indicated an inverse relationship with the germination endpoints. Because the corn and wheat endpoints contributed more to the first component than the mustard endpoints (Fig. 5B), the coefficients reflected greater effect of these chemical parameters on corn and wheat germination than mustard germination (Table 6). Corn and wheat germination was similarly affected by each chemical variable. As seen with worm avoidance and survival, the coefficients for the most influential chemical variables did not differ greatly, indicating that any of the six parameters can be used to predict toxicity to corn and wheat germination. The scores values of the chemical parameters and germination endpoints showed a good linear relationship (r = 0.83, p = 0.01) for the eight soils within the one-component PLS model (Fig. 5A).

DISCUSSION

A key issue in the management of waste sites containing hydrocarbon-contaminated soils is defining environmentally acceptable conditions of the soil such that site closure can be achieved. Environmentally acceptable endpoints for the soil can include chemical concentrations and/or biological responses to the contaminants. Recent recognition that ecological species can be susceptible receptors to waste-site contamination has made it desirable to be able to relate chemical concentrations and toxicity in soils. Loehr and Webster [3] recently evaluated several bioremediation and site-assessment studies [2, 4, 16-18] to determine the extent to which chemical concentrations could be correlated with toxicity test results. They concluded that it was not possible to relate soil toxicity data to the concentrations of specific chemicals in a soil or sludge with the available data. Only one study, that of Callahan et al. [17], reported a good correlation between chemical concentrations and earthworm response at an untreated former pesticide site. Earthworm death, but not several morbidity endpoints, was correlated to earthworm and soil chlordane and to soil DDT and metabolites levels. Furthermore, sample locations ranked on the basis of impact on earthworm responses were found to be directly correlated to locations ranked by soil chemical concentrations. Although Loehr and Webster [3] stated that there was no apparent direct relationship between residual oil and grease or PCP and Microtox 50% effective concentration values in the Trudell et al. [4] study, the data presented generally fell into two clusters, with higher oil and grease or PCP concentrations resulting in lower Microtox 50% effective concentration values.

Details are in the caption following the image

Partial least squares scores (A) and loadings (B) plots for a one-component model relating chemical parameters and seed germination (germ) in the eight soils studied. asphalt = asphaltenes; btex = benzene, toluene, ethylbenzene, and xylenes; ic25 = 25% inhibitory concentration; nisosat = n- and iso-saturates; noec = no-observedeffect concentration; ogcs2 = carbon disulfide-extractable oil and grease; ogfreon = freon-extractable oil and grease; ringsat = ring saturates; totalarom = total aromatics; totalsat = total saturates; tphgc = total petroleum hydrocarbons determined by GC; tphgrav = gravimetrically determined total petroleum hydrocarbons.

The use of multivariate methods in this study has helped elucidate and implicate possible relationships between soil properties and toxicity. Multivariate statistics have been increasingly used in recent years to relate sets of variables in environmental studies, particularly in studies to assess biological impact of contaminated sediments and studies in which the sediment quality triad approach was used [19-24]. Eriksson et al. [14] also illustrated the utility of PLS analysis in modeling aquatic toxicity data.

In our study, correlation, cluster, and PCA analyses were valuable in defining interrelationships between the soil physicochemical parameters and toxicity endpoints. A number of statistically significant correlations between soil physicochemical parameters and toxicity to earthworms and plants were identified. Significant correlations between physical parameters, chemical parameters, and biological endpoints suggest the likelihood that fewer analytical and toxicity measurements may sufficiently define contamination in field soils. Cluster analysis and PCA also were used to relate soils on the basis of soil characteristics and toxicity. The ability to differentiate a large number of soils on the basis of physicochemical parameters makes it possible to conduct toxicity tests only on representative soils and to predict the toxicity of the other soils on the basis of soil parameters.

Multivariate PLS and univariate regression analyses independently identified similar soil chemical parameters as potential predictors of toxicity. Regression analyses identified TPH (as measured by GC), polars, and asphaltenes as univariate predictors that explained much of the observed variability in earthworm survival and seed germination. The univariate models indicated that these hydrocarbon parameters affected earthworms and plants differently and also differentiated sensitivity between plant species. Multivariate one-component PLS models were also developed for worm avoidance and survival and seed germination; the chemical parameters identified by these models appeared to have similar toxicity effects and can be used equally well to predict toxicity.

Because tphgc was identified as a common predictor of worm avoidance and survival and seed germination, its measurement and use as an initial parameter in waste-site assessment to screen soil quality and toxicity are recommended. Our data indicate that effects on earthworm avoidance and survival were apparent at tphgc concentrations of ≥10,000 mg/kg soil, whereas little or no acute effects were observed at <4,000 mg/kg. Effects on seed germination were species specific, with mustard being affected at tphgc concentrations of ≥2,000 mg/kg. Toxic effects on wheat germination were observed at ≥10,000 mg/kg in most soils, whereas corn germination was substantially affected in only one soil, which had a tphgc concentration of 34,000 mg/kg. The tphgc concentrations at which effects on earthworm avoidance and survival and seed germination were observed in this study can provide initial screening levels for the various biological endpoints in a tier-1 risk assessment.

Quantitative models relating soil chemical parameters and ecotoxicity, to our knowledge, have not been previously reported. Judicious use of the models is advised because they were developed with limited data (seven to eight soils) and are valid only for the range of chemical concentrations found in this study. The predictive power of these models was reasonable in light of the small data set. The inability to develop models for worm reproduction and root growth may be due to the higher variability observed in these data when compared to the corresponding survival and germination data. The worm reproduction data were further limited to only six soils, and the PLS algorithm in SIMCA may not have been as effective. The cross-validation method used in SIMCA to determine prediction power sequentially removes each observation or soil, and its corresponding toxicity values, and considers them as test data. The prediction power is then determined by the overall ability of the model to regenerate the Y values. Because of the small number of data points, a model derived from X number of data points can be very different from a model derived from X − 1 data points, because each data point contributes significantly to the model.

Acknowledgements

We thank Ann Saterbak and Robin Toy, who directed the experimental studies from which the data used in this study were derived. Bruce McMain and M. Patty Williams provided invaluable technical support. The insightful comments provided by the two reviewers helped to improve the manuscript.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.