Volume 128, Issue 2 pp. 257-264
Article
Full Access

An Analysis of the Influence of Annual Thermal Variables on the Occurrence of Fifteen Warmwater Fishes

Robert M. Scheller

Corresponding Author

Robert M. Scheller

DynTel Corporation6201 Congdon Boulevard, Duluth, Minnesota, 55804 USA

[email protected]Search for more papers by this author
Virginia M. Snarski

Virginia M. Snarski

U.S. Environmental Protection Agency, National Health and Environmental Effects Research Laboratory, Mid-Continent Ecology Division, 6201 Congdon Boulevard, Duluth, Minnesota, 55804 USA

Search for more papers by this author
John G. Eaton

John G. Eaton

U.S. Environmental Protection Agency, National Health and Environmental Effects Research Laboratory, Mid-Continent Ecology Division, 6201 Congdon Boulevard, Duluth, Minnesota, 55804 USA

Search for more papers by this author
Gary W. Oehlert

Gary W. Oehlert

Department of Applied Statistics, University of Minnesota, 1994 Buford Ave., St. Paul, Minnesota, 55108 USA

Search for more papers by this author
First published: 09 January 2011
Citations: 13

Present address: Department of Forest Ecology and Management, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA.

Abstract

Multisource fish-sampling data and U.S. Geological Survey temperature data from streams throughout the United States were used to investigate the influence of derived thermal regime variables on the presence or absence of 15 common warmwater fish species. The 3-year average annual thermal regime was calculated for streams where presence or absence was known for these 15 species. Six variables estimated to be of biological importance to the winter and summer survival and recruitment of a species, including measures of feeding and nonfeeding periods, were calculated from these thermal regimes. Stepwise discriminant analysis and multiple regression were used to select optimal variables for creating multivariate models. Parametric and nonparametric multivariate discriminant analyses were then performed to test our ability to correctly classify presence or absence using the thermal variables. These statistical empirical models were able to correctly predict presence or absence with greater than 90% accuracy for 13 of 15 species. Nonparametric (Kth nearest neighbor) analyses had marginally more accurate predictions than parametric (linear) analyses. This technique may allow for an improved estimation of potential changes in distribution under various global warming scenarios.

A recent United Nations Intergovernmental Panel on Climate Change report (IPCC 1995) has provided new information substantiating the likelihood of global climate change. The effect of climate change on freshwater ecosystems, including effects on freshwater fish, will be profound (Magnuson et al. 1997). Temperature will be a primary factor in determining which freshwater fish species will be most affected. One potential effect of climate change is modification of the geographic distribution of fish species. To predict this modification, it is necessary to estimate temperature tolerances of fishes and then relate these tolerances to the changing environment.

Previous investigations have used field data to estimate maximum temperature tolerances of fishes (Eaton et al. 1995), which in turn were used to project the effects of global warming on fishes in lakes (Stefan et al. 1994, 1995) and streams (Eaton and Scheller 1996). In general, the prevalence of warmwater fish was predicted to increase in the United States and that of coldwater fish to decrease under conditions of increased maximum temperatures. It is likely that some warmwater fishes would benefit as a result of the moderating effect that global warming would have on stressful low temperature conditions prevalent during the winter at the boundaries of their current range. However, relatively little research has been done on the low temperature tolerances of most freshwater fish species.

There is evidence that warmwater fish do not feed or feed only sporadically and eat very little at temperatures below 6–10°C (e.g. Johnson and Charlton 1960; Keast 1968; McComish 1971; Shuter et al. 1980, 1989). For example, Keast (1968) lists the temperatures at which black bullhead Ameiurus melas, black crappie Pomixis nigromaculatus, pumpkinseed Lepomis gibbosus, rock bass Ambloplites rupestris, and largemouth bass Micropterus salmoides start feeding as about 6.5, 6.5, 8, 8.5, and 8.5°C, respectively. Death at low temperatures has been observed to be a function of the duration of winter nonfeeding temperatures, which results in depletion of energy stores (Johnson and Charlton 1960; Shuter et al. 1989; Johnson and Evans 1991) or osmoregulatory dysfunction (Johnson and Evans 1996), or both. In addition to longer-term effects resulting from energy depletion, there is evidence that mortality among some species from acute physiological stress occurs at temperatures between 0°C and 4°C. Sheehan et al. (1990) found that young-of-the-year green sunfish Lepomis cyanellus, bluegill L. macrochirus, and largemouth bass were relatively intolerant to low temperature. All green sunfish died as temperature was reduced from 5°C to 4°C, bluegills suffered mortalities of 32%, 8%, and 4% at 0, 2, and 4°C, respectively, and largemouth bass experienced 4%, 0%, and 12% mortality at 0, 2, and 4°C. in 30-d laboratory exposures. Johnson and Evans (1996) observed mortality among white perch Morone americana to increase from 11% at 4°C to 71% at 2.5°C during 150 d of exposure to low temperatures. A number of studies have documented or demonstrated that smaller fish or young of the year are more vulnerable than larger fish to overwinter mortality (Toneys and Coble 1979; Post and Evans 1989; Shuter et al. 1989), and that good growth conditions in the preceding summer reduce this vulnerability.

The present study describes a multivariate statistical procedure for selecting variables and predicting the presence or absence of 15 fish species using thermal regime criteria developed from fish collection records and U.S. Geological Survey (USGS) stream monitoring station temperature files.

Methods

Data sources

Our analysis is based on two data sources: (1) weekly average stream temperatures, which are drawn from the USGS National Water Data Storage and Retrieval (WATSTORE; Showen 1980) files of stream monitoring stations, and (2) multisource fish-sampling events in close proximity to USGS monitoring sites. These fish data are obtained from a variety of sources (i.e., state fisheries or natural resource agencies, universities) from throughout the contiguous United States. The unique fish collection and water temperature regime combinations for each species are derived from the Fish and Temperature Database Matching System (FTDMS) database (Eaton et al. 1995). Criteria for spatially matching stream monitoring sites and fish collection records are given in that report.

Thermal regime derivation

Fish-sampling records for each species were matched with weekly USGS stream temperature data for the entire sampling year in which fish were collected, starting with week of the year 38 (in late September), plus, when available, the two previous sampling years of USGS stream data. A 3-year mean was thought to better represent the thermal experience of the fish species collected because we assumed that these species were resident at that site. Additionally, a 3-year average would mitigate anomalies that might occur within a single year.

Weekly temperatures of corresponding weeks of the three years were averaged to create an annual thermal regime consisting of 52 average weekly stream temperatures. In cases when USGS temperature data were not available for the full three years, only two complete years (25% of all regimes) or one year (7% of all regimes) of data were considered acceptable. If one or two consecutive weeks of data within a year were missing (a common occurrence in USGS datasets), an average of the previous and next weeks' average temperature was used for the missing weeks. These three year averages, which are associated with one or more sampling events, are referred to as ‘annual thermal regimes' in this study.

Fish presence or absence

If a given species was captured at any time during an annual thermal regime, the species was considered to be present during all of that annual thermal regime. If not, the annual thermal regime was assigned an absence value for that species. It is recognized that false absences may occur due to inadequate or insufficient sampling. However, only streams that were adequately sampled by professional fisheries or natural resource biologists for the 15 species chosen were used as the basis for this study. Because our fish data are derived from external sources, we are dependent on their evaluation of their sampling methods as thorough and complete for all species analyzed.

The data analyzed for each species was limited to annual thermal regimes within those watersheds in which that species was known to occur (Boschung et al. 1983). The subregion watershed classification system developed by Maxwell et al. (1995) provided the watershed boundaries used for determining whether to include or exclude “absence” thermal regime values for each species. Subregion watersheds were considered the potential (i.e., unimpeded or feasible access) geographic range of each species, and thermal regimes found outside of this range were not included in the final analysis. As a consequence, the total number of sites available to each species is unique. This seemed the best way to allow for inclusion of sites which are absences due to thermal conditions and to provide for expansion of a species' range under climate change conditions. Nevertheless, some of our absence values may result from factors unrelated to temperature (e.g., river size, flow, competition, habitat, etc.).

Thermal regime variables

From each annual thermal regime classified as to presence or absence for each of the 15 species, we derived six variables which were considered potentially important components or attributes of warmwater fishes' thermal experience. These six variables, designated TOTAL, >8C, ≤8C, FACTOR, WEEK8C, and WEEK2C, are defined below and were largely based on the limited low temperature effects information described previously.

The TOTAL variable is defined as the cumulative temperature for all weeks within the annual thermal regime. We used the unit degree-weeks (DW), defined as the sum of the average weekly temperatures for the defined period, to quantify the cumulative temperature experienced during these thermal conditions. Degree-weeks integrates time and temperature over longer-term periods (months, seasons) within the annual thermal regime. The TOTAL variable quantifies the thermal regime over the entire annual cycle and was thought to take into consideration those physiological processes that occur gradually or serially over a long period of time (e.g., gonadal development and maturation) that result in important life cycle events such as reproduction. Although TOTAL is essentially equivalent to average temperature multiplied by the number of weeks in a year, the unit was retained to facilitate comparisons to the ≤8C and >8C variables.

Each annual thermal regime was subdivided based on whether the weekly average temperature is above 8°C (>8C variable) or equal to or below 8°C (≤8C variable). These variables are chosen to approximate the feeding and nonfeeding periods within the annual thermal regime. Although the temperature at which fish start feeding after overwintering has been reported to vary among species, variation in feeding activity with temperature also has been observed between different sizes of the same species (Keast 1968, 1977). At or below this temperature, warmwater fish are quiescent or dormant and either don't grow or lose weight. Within a few degrees above this temperature, they become active and are capable of growth (Keast 1968). Thus the >8C variable quantifies the period of growth. The ≤8C variable quantifies the period within a year of zero or negative fish growth, analogous to the starvation period described by Shuter and Post (1990).

The FACTOR variable (dimensionless) is calculated by dividing the >8C variable into the ≤8C variable. Shuter and Post (1990) demonstrated that this proportion or coefficient may be critical in determining the northern limits of some warmwater fishes.

The WEEK8C variable is equal to the number of weeks at or below minimum critical feeding temperature (8°C). This is a measure of the time that fish experience zero growth or starvation. The rationale for this variable is that the duration of the nonfeeding period might be more important to survival than the actual temperature.

The WEEK2C variable, which is equal to the number of weeks at or below 2°C, was included to cover the possibility that very low temperature might cause mortality through osmoregulatory dysfunction (Johnson and Evans 1996) or other physiological mechanisms.

Statistical Analysis

Annual thermal regime data sets for each fish species were analyzed individually with multivariate statistical procedures using SAS statistical software (SAS Institute 1990). First, the annual thermal regime was rejected if all the weekly temperatures exceeded the minimum feeding temperature (no value for FACTOR; ≤8C and WEEK8C equal to zero). The relative influence of these omitted warm stream data sets was tested by assigning a value of one to the ≤8C variable and performing identical analyses. The addition of these annual thermal regimes, which constituted less than 7% of data for any species, did not substantially change the results. Specificity and sensitivity values calculated with both linear discriminant analysis (LDA) and nonparametric Kth nearest neighbor (KNN) analysis were less than 2.4% different. Two variables were transformed to meet the assumptions of normality: the square root of FACTOR was substitued for FACTOR, and WEEK8C was substituted with the arcsine of the value of WEEK8C divided by the maximum value of WEEK8C (stratified by presence and absence). All variables were subsequently standardized (mean = 0, SD = 1). Next, several variable selection techniques were used to identify which of the six thermal regime variables optimized the discriminant function analysis. The selection of optimal variables generally improves the performance of any subsequent discriminant function analyses. Stepwise discriminant analysis (forward selection, backward selection, and full stepwise) and a “best subsets” method (using Mallow's Cp and the relationship between regression and two-group linear discriminant analysis (see, for example, Kleinbaum et al. 1988) were used for variable selection. If multiple combinations of variables were chosen using these different methods, then each combination of variables, for each species, was saved for subsequent submission to our discriminant function analyses. These analyses allowed the selection of the optimal variables separately for each species and to reduce multicollinearity between variables.

Following the variable selection process, LDA and KNN analysis were performed on each of the unique combinations of variables to determine how successfully the derived thermal variables classified annual thermal regimes into presence or absence. Quadratic discriminant analysis and classification tree methods were also used for 7 of the 15 species, but these methods proved to be less accurate than LDA and KNN analysis. Prior probabilities were assumed to be equal in the analysis due to the uncertainty and difficulty of accurately estimating ecological priors (Williams 1983). A nearly unbiased classification was obtained using an n-fold (or leave-one-out) cross validation (Lachenbruch and Mickey 1968). In n-fold cross validation, n separate classification models are constructed, with n different data sets, with each data set having one observation from the full n-observation data set removed. For each of these n models, we classified the single data point that was not used in that model and found the error rate. Thus no data point was ever used to fit the model that was used to classify the data point.

Results

A summary of select univariate statistics for all variables is presented in Table 1. The model (or combinations of variables) that produced the highest specificity (ability of the model to accurately predict absence) and highest sensitivity (ability to accurately predict presence) for each species are highlighted in bold type. The TOTAL and >8C variables have presence values (means) greater than absence values for all 15 species. The TOTAL and >8C variables were selected as optimal model variables for 8 and 6 species, respectively. The ≤8C variable has a presence value less than the absence value for 11 of 15 species (6 of 8 species when the variable was selected for the model). Brown bullhead, gizzard shad, rock bass and white crappie are the exceptions. The FACTOR variable (the ratio of no-growth to growth periods) has mean presence values lower than absence for 13 species (excepting brown bullhead and rock bass), reflecting longer relative growth periods for presence. The WEEK8C variable was universally chosen as an optimal model variable. The WEEK8C variable has mean presence values lower than absence values for all 15 species, and the range of presence values, notably the maximum, are reduced as compared with absence. The WEEK2C variable was selected as an optimal variable for 12 species. The mean presence values for WEEK2C are lower or equal to mean absence values for 9 of these 12 species, but the results for WEEK2C are difficult to interpret because they are highly skewed towards zero. There is considerable overlap in the ranges of values for presence and absence for all of our variables, although the range of absence values is generally greater than that of presence values. This may be an artifact of the larger sample sizes for absence. Normality was tested and confirmed for our variables, with the notable exceptions of the FACTOR and WEEK2C variables, which are positively skewed.

Table 1. Descriptive statistics for the model variables selected for 15 warmwater fish species. Optimal model variables are in bold italic type face
image
Table 1. Extended
image

Multivariate analyses of the derived thermal variables were successful in classifying presence or absence. Table 2 presents the specificity and sensitivity of these two analyses for each species. Comparing the results from KNN analysis and LDA, we can see that KNN analysis tended to have greater specificity and sensitivity than the LDA. The KNN analysis accurately predicted both presence and absence with greater than 90% success for 12 of 15 species. Predictions by LDA were slightly lower: sensitivity was greater than 80% for 13 of 15 species, and specificity was greater than 80% for 14 of 15 species. A notable failure of both statistical procedures occurred in classifying presence or absence of rock bass. Specificity and sensitivity were 60% and 87% with KNN analysis and 71% and 65% with LDA.

Table 2. Specificity (percent absence correctly predicted as absence) and sensitivity (percent presence correctly predicted as presence) of K-nearest neighbor (KNN; priors = 0.5; K = 15) and linear discriminant analysis (LDA; priors = 0.5)
image

Discussion

Thermal conditions occurring over a much longer period and influencing a wider range of physiological processes were considered in estimating low-temperature tolerances as compared with the high-temperature tolerances estimated previously (Eaton et al. 1995). The results of this study support the hypothesis that low-temperature tolerances, in addition to the upper preferred temperature range, are important in determining the occurrence of warmwater species. The multivariate analysis technique used to obtain these results is unique in combining several thermal regime factors that have previously been shown only individually to influence the thermal tolerances, including overwinter survival, of warmwater fishes. Although the Fish Temperature Database Matching System was originally used to estimate species high-temperature tolerances, these data are also suitable for estimating cold thermal tolerances as used in the present analysis.

Clearly, we have not included all the other biological factors which may determine the ability of a population of a fish species to exist at a given location. However, the variables included in this study do successfully classify most presence or absences or, alternatively, are closely correlated with other factors that do discriminate between presence and absence. The addition of other environmental variables (e.g., stream flow, maximum tolerable temperature) could increase the sensitivity and specificity of this or similar analyses and may increase the ability to predict changes in distribution under altered conditions. No unusual aspects of the life history, biology, thermal responses, or habitat requirements or anomalies in the data set of rock bass (Scott and Crossman 1973) were found to explain the poor ability of the statistical methods to accurately classify presence or absence for this species.

Shuter and Post (1990) used bioenergetic models to demonstrate the importance of body size and duration of winter on the northern geographic limits for Eurasion perch, Perca fluviatilis, yellow perch, Perca flavescens, and smallmouth bass, Micropterus dolomieu. Near their geographic limits, overwinter survival was reduced by starvation of young of year because of restrictions imposed by shortened growing period and lengthened starvation period (see Lyons 1997 for a regional application of the model). Ten of the 15 species used in our study were determined by Shuter and Post (1990) as having a distribution potentially limited by overwinter starvation. Results of the statistical model in our study support the importance of relative growth conditions and shortened nonfeeding periods on distribution. Notably, the FACTOR variable, which is the ratio of nonfeeding to feeding potential, was chosen as an important variable for predicting the presence or absence of 12 species. Presence values are lower than absence for 11 of the 12 selected species (rock bass is the exception) and 13 of the total 15 species, reflecting the importance of a relatively longer growing or feeding period or shortened nonfeeding period. The variable WEEK8C (nonfeeding or stressful cold temperatures) was chosen as an important predictive variable for all 15 species. All presence values were lower than absence values, which is consistent with our expectations that warmwater species would favor streams with shorter cold periods. Although the >8C and ≤8C variables were chosen less often (eight and six species, respectively), the presence values for these variables show a pronounced tendency towards greater feeding periods and reduced nonfeeding periods for these warmwater species. The TOTAL variable, chosen for eight species, encompasses conditions experienced throughout the entire thermal regime. Presence values for TOTAL were consistently greater than absence values.

Other studies on the effects of low temperature on fishes have focused on resident lake populations. Consequently, 4°C has frequently been the minimum temperature available to investigators in these studies (e.g., Toneys and Coble 1979; Post and Evans 1989). In our study, 39% of the streams available in our database sustain average temperatures of less than 4°C for at least 1 week. Our results highlight the important contribution that temperatures at or below 4°C can have on the presence or absence of warmwater fishes in streams. The WEEK2C variable, selected as a temperature previously shown to directly cause mortality, was selected as an important predictive variable for 12 species; 9 of these 12 species had presence values lower than absence values.

The analyses presented in this study describing the relationship between species presence or absence and thermal regime variables are empirical and correlative; therefore, deterministic interpretations are very difficult. Although we cannot infer causation or the relative importance of a variable as compared to the other variables, these results agree with known physiological or life history restrictions imposed by the affects of these different variables.

The somewhat greater success of the nonparametric KNN analysis is probably due to the nonnormality of some of our variables. Unfortunately, the further application of the KNN method requires the use of the original data set used for calibration, whereas the formulae derived from LDA can be applied independently of the original data. Other multivariate statistical analysis methods were explored but found to be inadequate or unsuited to our data. Quadratic discriminant analysis was highly sensitive to normality and extrapolation. Classification regression trees did not provide adequate separation of presence or absence groups. Based on results presented in this paper, we conclude that KNN analysis or LDA could be applied to other species in the FTDMS database with sufficient data to meet the criteria that minimum sample size exceed multivariate dimensionality by at least a factor of three (Williams and Titus 1988).

Summary

The results of this study present a statistical approach for predicting the presence and absence of 15 warmwater fish species based on various thermal variables derived from annual thermal regimes. The ability of our statistical models to discriminate presence from absence strengthens our hypothesis that these potentially biologically significant thermal variables are important to a species' ability to populate a given location. This study used existing data to analyze the effects of derived temperature variables on species throughout their present thermal and geographic range in the United States. This method allows for assessment of thermal suitability (presence or absence) for a species based on measurements of year-round stream temperature or predicted annual thermal regimes. This methodology can be applied to predict changes in warmwater species population abundances or distributions due to climate change, such as the lake temperature models developed by Stefan et al. (1995). Modeling techniques are currently being developed for predicting the effects of climate change on stream thermal regimes that will, in turn, provide the basis for estimating impacts on fisheries resources.

Acknowledgments

Naomi Detenbeck and J. Howard McCormick provided help with preliminary data analysis and important advice on fish biology.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.