Volume 23, Issue 10 pp. 1157-1168

BIODIVERSITY RESEARCH

Full Access

Can incomplete knowledge of species’ physiology facilitate ecological niche modelling? A case study with virtual species

Xiao Feng,

Corresponding Author

Xiao Feng

[email protected]

orcid.org/0000-0003-4638-3927

Department of Integrative Biology, Oklahoma State University, Stillwater, OK, USA

Correspondence

Xiao Feng, Department of Integrative Biology, Oklahoma State University, 501 LSW, Stillwater, OK 74078, USA.

Email: [email protected]

Search for more papers by this author

Monica Papeş,

Monica Papeş

Department of Integrative Biology, Oklahoma State University, Stillwater, OK, USA

Search for more papers by this author

Xiao Feng,

Corresponding Author

Xiao Feng

[email protected]

orcid.org/0000-0003-4638-3927

Department of Integrative Biology, Oklahoma State University, Stillwater, OK, USA

Correspondence

Xiao Feng, Department of Integrative Biology, Oklahoma State University, 501 LSW, Stillwater, OK 74078, USA.

Email: [email protected]

Search for more papers by this author

Monica Papeş,

Monica Papeş

Department of Integrative Biology, Oklahoma State University, Stillwater, OK, USA

Search for more papers by this author

First published: 14 August 2017

https://doi.org/10.1111/ddi.12606

Citations: 13

Share a link

Email
Wechat
Bluesky

Abstract

Aim

Ecological niche modelling (ENM) is widely used in biogeography and conservation studies. The performance of ENM is influenced by the quality of species’ presence and absence datasets. Presences may include marginal localities, and absences are usually difficult to collect. We evaluated the use of species’ physiological limits to improve selection of presences and absences for ENM in a virtual species framework with defined response functions as surrogates for physiological knowledge.

Location

The lower 48 states in USA.

Methods

We generated physiologically informed absences based on either complete or incomplete knowledge of species’ physiology. With the same physiological knowledge, we reduced noise (incorrect or marginal locations) from presence datasets, completely or incompletely. We compared (i) models based on physiologically informed absences and random background points, (ii) models based on presences with and without noise and (iii) models obtained with and without incorporating physiological knowledge in absence and presence datasets.

Results

Only absences based on complete physiological information produced better performing models than random background points. Model improvement was positively correlated with the percentage of noise being removed from the presence data, and best-performing models were obtained with true presences (all noise removed). Manipulating both absences and presences led to better models than manipulating only presences when all or majority of physiological limits were known.

Main conclusions

The benefit of incorporating physiological information into ENM datasets largely depends on completeness of physiological knowledge, but in reality incomplete understanding of species’ physiology is the norm. We found that applying incomplete physiological knowledge to absences may bias ENM, and thus, use of random background points is recommended; on the other hand, removing noise from species’ presence datasets based on incomplete physiological limits increases model performance, thus this approach could potentially improve the effectiveness of ENM applications in conservation planning and invasive species management.

1 INTRODUCTION

Ecological niche modelling (ENM) is a tool widely used in biogeography, conservation biology, and ecology to study species’ ecological niche and distribution through space and time (Elith & Leathwick, 2009; Franklin & Miller, 2009; Peterson et al., 2011). The theoretical basis of ENM is the concept of fundamental niche, defined by Hutchinson (1957) as the set of environmental conditions permitting indefinite existence of species. Generally, the environmental conditions considered in ENM are abiotic in nature (or scenopoetic; Soberón, 2007); thus, the niche being estimated is the abiotic niche (Soberón & Peterson, 2005). The niche estimated in environmental space is used to predict the species’ distribution in geographic space (Peterson et al., 2011).

Without considering species’ dispersal abilities and biotic interactions, we expect that the environmental conditions of species’ presences are part of the fundamental niche, whereas conditions of species’ absences are not (Araújo & Pearson, 2005; Soberón & Peterson, 2005), but usually in reality this is not the case. Species occurrence data, digitalized and stored in publicly accessible databases (e.g., www.gbif.org; www.bison.usgs.ornl.gov), have quality issues that warrant our attention (Beck, Böller, Erhardt, & Schwanghart, 2014; Feeley & Silman, 2010). Presence data can include occasional observations in unsuitable areas (noise), where species cannot persist, for example sink populations (Pulliam, 2000) or marginal occurrences (Feng & Papeş, 2015). Such noise in presence data can blur the modelling goal and bias the model prediction (e.g., towards broader potential distribution; Soley-Guardia, Radosavljevic, Rivera, & Anderson, 2014), but the issue has not drawn much attention, probably because of the difficulty or subjectivity in delineating noise from true presence.

Additionally, true absences are rare (Mackenzie, 2005), because gathering absences is not usually the goal of field surveys and the observed absence may be an artefact of species’ detection or limited dispersal ability rather than environmental unsuitability. In lieu of absence data, pseudo-absences are simulated for ENM algorithms (Engler, Guisan, & Rechsteiner, 2004; Hirzel, Helfer, & Metral, 2001). Frequently, random background points are used (e.g., the default method in MaxEnt; Phillips, Anderson, & Schapire, 2006), a mathematically feasible approach but difficult to interpret because it could mix true presences and true absences. Other methods simulate pseudo-absences from environmentally or/and geographically stratified areas (Barbet-Massin, Jiguet, Albert, & Thuiller, 2012; Iturbide et al., 2015; Senay, Worner, & Ikeda, 2013), but the best method depends on the modelling algorithm (Barbet-Massin et al., 2012). Phillips et al. (2009) used presences of related species as absences for the focal species, but this method requires more data and may not be broadly applicable. Without reliable data inputs (i.e., presences and absences), potential distribution maps obtained with ENM warrant our attention, particularly when used in conservation or invasive species management (Bond, Thomson, & Reich, 2014; Kulhanek, Leung, & Ricciardi, 2011).

Knowledge of species’ physiology has been fundamental in developing biophysical models to estimate potential distributions (Mathewson et al.,2016; Porter, Munger, Stewart, Budaraju, & Jaeger, 1994); however, these are data-intensive approaches requiring detailed information about the organism's energy budget and exchange with the environment and thus are unfeasible for most species (Peterson, Papeş, & Soberón, 2015). In contrast, basic knowledge of conditions that allow species’ survival (physiological limits; e.g., thermal tolerance) may be available for many species (Araújo et al., 2013; Hoffmann, Chown, & Clusella-Trullas, 2013). Here, we focus on integrating species’ physiological limits into ENM to address the issues of (1) noise in presence data and (2) relevance of pseudo-absence data. We assume that species’ physiological tolerance to abiotic conditions reasonably approximates the fundamental niche (Feng & Papeş, 2017; Martínez, Arenas, Trilla, Viejo, & Carreño, 2015). With inferences of what is physiologically tolerable by a species, we can eliminate noise from species’ presence dataset based on biologically meaningful information. Similarly, we can reasonably infer what is physiologically intolerable for a species, thus select biologically meaningful absences, independent of our ability to observe absences.

We tested integration of physiological limits into ENM through a virtual species framework, commonly used in developing and assessing ENM methodologies (Hirzel et al., 2001; Qiao et al., 2016; Zurell et al., 2010). Virtual species are simulated ecological niches, defined by abiotic variables in the environmental space. A simulated ecological niche is usually composed of multiple mathematical functions, each one describing a species’ response to one abiotic condition. The product (e.g., additive or multiplicative) of all mathematical functions determines the focal species’ response to the environment (e.g., suitability or probability of presence on the landscape), which is usually transformed into binary format (presences and absences). The advantage of using virtual species instead of real species is that the distribution of the species is defined and known; thus, the methodological evaluation is mathematically robust (Qiao, Soberón, & Peterson, 2015). We defined the fundamental niche with suitability functions in the virtual species framework and used the functions to determine the species’ physiological limits. However, in real world we seldom have complete knowledge of a species’ fundamental niche or physiological tolerance, so we simulated scenarios of incomplete physiological knowledge.

We designed three experiments to investigate techniques of integrating physiological limits into ENM. In Experiment 1, we simulated absences as (i) random background points, (ii) physiologically informed absences based on incomplete knowledge of species’ physiology and (iii) based on complete knowledge. In Experiment 2, we simulated (i) presences with noise, (ii) presences with noise reduced based on incomplete knowledge of species’ physiology and (iii) based on complete knowledge. In Experiment 3, we combined the design of Experiments 1 and 2. We hypothesized that physiologically informed absences will produce better performing models than random background points because the former are biologically relevant; the presences with reduced noise will produce better performing models than presences with noise because the noise could bias the models; and finally, the integration of physiologically informed absences and presences with reduced noise will lead to highest performing models because of the combined positive effects from both data inputs.

2 METHODS

2.1 Defining virtual species

We simulated ecological niches of virtual species using four bioclimatic variables in a real landscape, the contiguous USA (or lower 48 states). The four bioclimatic variables were annual mean temperature, temperature seasonality, annual precipitation, and precipitation seasonality, downloaded at the resolution of 2.5 arc min from WorldClim (Hijmans, Cameron, Parra, Jones, & Jarvis, 2005). We chose the four variables because they reflected broad climatic patterns (mean and variation of temperature and precipitation) and the correlation among these variables within the study area was moderate (|r| < .52), thus the influence of collinearity was reduced (Dormann et al., 2013). We chose the resolution of 2.5 arc min to guarantee the needed sample sizes for experimental simulations, while minimizing the computation time.

We simulated a pool of random virtual species using the R package virtualspecies (Leroy, Meynard, Bellard, & Courchamp, 2016). We defined each species using four different Gaussian functions, each corresponding to a bioclimatic variable (representing knowledge of one physiological limit) and used the product of the four functions as the final suitability function of each species. The parameters of the Gaussian functions varied across the virtual species and thus produced distinct final suitability maps. To simulate virtual species with realistic distributions, we enabled the realistic option in the R package virtualspecies (Leroy et al., 2016), which ensured a latter function was restricted to areas with higher suitability values defined by the former function or functions, instead of independently generating four Gaussian functions. This option prevents generating unrealistic species and suitability maps (e.g., a species present in both extremely hot and cold areas), because contradictions among suitability functions are avoided (Leroy et al., 2016). We transformed the suitability maps into distribution maps (presences and absences) using a threshold method: pixels with suitability values ≥0.5 were classified as presences and those with values <0.5 as absences. We calculated the species’ prevalence by dividing the number of presence pixels by that of total pixels. Because of the potential influence of species’ prevalence on modelling performance (Manel, Williams, & Ormerod, 2002; Santika, 2011), we randomly selected six virtual species along a gradient of prevalence values (from 0.06 to 0.57; see Fig. S1 in Supporting Information) and evaluated the consistency of our results over this gradient.

2.2 Experimental design

2.2.1 Experiment 1—simulating absences

To explore the idea of simulating absences outside the species’ physiological limits, we generated five different sets of absences (Fig. S2). For set0 (control group), we randomly selected 10,000 points (random background points) from the study area, the predominant method in current ENM studies (Barbet-Massin et al., 2012). For the remaining sets (1, 2, 3 and 4; experimental groups), we simulated equal-size absence datasets using incomplete physiological knowledge (from one to three niche dimensions) for set1 to set3 and complete knowledge (from all four niche dimensions) for set4. Thus, the absences in set4 were true absences. We used the following specific steps for set1: we classified the pixels into presences or absences (i.e., physiologically tolerant or intolerant for the species, respectively) using the suitability function of one bioclimatic variable; then, we randomly selected absences for set1 based on this classification. The product of the functions of two bioclimatic variables was used for set2, three variables for set3 and all four variables for set4. We used the same procedure and threshold (0.5 suitability) explained above to transform suitability maps to presence–absence maps. We included all possible combinations of bioclimatic variables for set1, set2 and set3. To explore the influence of the amount of training presences as well as presence/absence ratio (Barbet-Massin et al., 2012), we randomly selected 100, 300 or 1,000 presences from the distribution map. Coupled with the variation in absence simulations (set0 to set4), we trained models using 15 different combinations of training presences and absences for each virtual species and algorithm (see below, Section 2.3).

2.2.2 Experiment 2—simulating presences

To investigate the effect of reducing noise in the presence datasets with species’ physiological tolerance information, we designed a second experiment with five different sets of presences (Fig. S3). In set0 (control group), we simulated 1,000 presences composed of 80% true presences and 20% noise from less suitable areas (suitability values from 0.3 to 0.5), similar to occasional observations in species’ presence datasets in real situations. In set1 to set4 (experimental groups), we kept the same number of true presences as in control group, but we removed noise using known species’ physiological tolerance, either incompletely, from one to three niche dimensions for set1 to set3, or completely, from all niche dimensions, for set4. As all noise was removed from set4, this represented true presences of virtual species. For set1, we classified the study area pixels into species’ presences (physiologically tolerant) or absences (physiologically intolerant) based on the suitability function of one bioclimatic variable and eliminated the noise using this classification. Similarly, we used the product of the functions of two bioclimatic variables for set2, three variables for set3, and all variables for set4. We included all possible combinations of variables for set1, set2, and set3. The same procedure and threshold (0.5 suitability) as explained above were used to transform suitability maps to presence–absence maps. The number of the presences in the five sets ranged from 1,000, when no noise was removed, to 800, when all noise was removed. To explore the influence of the number of presences on model performance, we repeated the same procedure but started with 100 and 300 training presences with noise, for the five sets (set0 to set4). In all, we trained models with 15 variations of presence sets and a sample of 10,000 random background points, for each virtual species and algorithm (see below, Section 2.3).

2.2.3 Experiment 3—simulating both presences and absences

We designed our last experiment with the goal of investigating the combined effect of using absences outside the species’ physiological limits and presences with reduced noise (Fig. S4). The control set-up was similar as for Experiment 1 and Experiment 2: 1,000 presences of which 20% represented noise (locations from less suitable areas) and 10,000 random background points. For the experimental groups (set1 to set4), we removed noise from presence datasets based on one to four physiological limits; these datasets were paired with physiologically informed absences, simulated using physiological limits in one (set1) to four (set4) environmental dimensions. Thus, Experiment 3 incorporated the approach for simulating absences from Experiment 1 and for refining presences from Experiment 2. We also varied the size of presence datasets by considering 100 or 300 presences. We trained models using 15 different combinations of training presences and training absences, for each algorithm (see below, Section 2.3) and virtual species.

2.3 Modelling algorithms

We used four modelling algorithms to evaluate the consistency of our framework and results: boosted regression tree (BRT; Friedman, Hastie, & Tibshirani, 2000; Leathwick, Elith, Francis, Hastie, & Taylor, 2006), generalized linear model (GLM; McCullagh, 1984; Wintle, Elith, & Potts, 2005), generalized additive model (GAM; Hastie & Tibshirani, 1990; Lehmann, Overton, & Leathwick, 2002) and maximum entropy (MaxEnt; Phillips et al., 2006). BRT is a machine-learning algorithm that uses a boosting method and simple regression tree models (Elith, Leathwick, & Hastie, 2008; Elith et al., 2006). GLM and GAM represent the traditional regression approach, with intermediate performance among ENM algorithms (Elith et al., 2006). MaxEnt is a good performing and widely used algorithm that makes inferences of the probability of species’ presence based on the maximum entropy principle (Phillips & Dudik, 2008).

As comparing performance of algorithms was not the goal of this study, we generally followed the parameter settings from previous studies that have shown reliable performance. We generated BRT models in R package dismo (Hijmans, Phillips, Leathwick, & Elith, 2013) using Bernoulli distribution and learning rate of 0.001, tree complexity of 5, step size of 50, and maximum trees size of 10,000, with fivefold cross-validation (Elith et al., 2008; Ridgeway, 2006). We trained GLM in R (R Core Team, 2014) using binomial with a logit link function and quadratic interaction and adopted the Akaike's information criterion stepwise selection (McCullagh & Nelder, 1989). We performed the GAM experiments in R package mgcv, using logit link with outer, newton optimizers and default degree of freedom (−1) that triggers an internal generalized cross-validation to optimize the actual effective degree of freedom (Wood, 2011). We assigned equal weights to training presences and absences (total weight of presences equals that of absences) for BRT, GLM, and GAM algorithms (Barbet-Massin et al., 2012). Finally, we ran MaxEnt in R package dismo (Hijmans et al., 2013), with default regularization multiplier and autofeatures option that determines the combination and transformation of variables (i.e., features) based on the number of training presences (Phillips & Dudik, 2008).

2.4 Evaluation indices of model performance

A total of 69,120 models were generated by repeating the three experiments 20 times for each virtual species, modelling algorithm and combination of training presences and absences. We used 10% of true presences and the same number of absences of each virtual species as testing data so that the ratio of testing presences and absences was one. We did not use the same pixel (presence or absence) at the same time for training and testing of models unless it was used as a random background point. We evaluated all models using three threshold-dependent indices, calculated with lowest training presence threshold (the lowest suitability value associated with training presences; Pearson, Raxworthy, Nakamura, & Peterson, 2007). Omission rate measures the percentage of presences predicted absent. Commission rate measures the percentage of absences predicted present. True skill statistic (TSS) accounts for both omission rate and commission rate and ranges from −1 to 1, with values above 0 indicating models better than random (Allouche, Tsoar, & Kadmon, 2006).

2.5 Statistical tests

To assess the effect of number of physiological limits on model performance in each experiment, we ran one-way ANOVAs to compare model evaluation indices of the control and four experimental groups. We used Tukey's post hoc analysis for the paired comparison of any two groups (Zar, 2010). When the assumption of homogeneity of variances was violated, we used Welch test and Games–Howell post hoc analysis instead (Games & Howell, 1976; Welch, 1947). We performed the statistical tests for each possible combination of number of training presences, modelling algorithm, and virtual species, expecting to find consistency of patterns across these settings. To compare the differences of manipulating presences only (Experiment 2) versus manipulating presences and absences (Experiment 3), we used t tests on the model evaluation indices for each pair of sets (e.g., set1 vs. set1) or used Welch test when homogeneity of variances was violated (Welch, 1947; Zar, 2010). We ran analyses for each combination of number of training presences, modelling algorithm, and virtual species.

3 RESULTS

3.1 Experiment 1—simulating absences

TSS and omission rate of models increased and commission rate decreased (Figures 1 & S5) as more physiological limits were used to define the training absences (set1 to set4). Models based on absences with partial physiological information (set1 to set3) had significantly lower TSS (p < .05) than models obtained with control group (set0, random background points), in all or majority of combinations of algorithm, number of training presences, and virtual species (Table 1). In contrast, the absences based on complete physiological information (set4) produced models with highest TSS by maintaining relatively low omission and commission rates. Most of the set4 models (74%) had significantly higher TSS (p < .05) than the control group (set0). Overall, set1 models showed the lowest TSS and omission rate but highest commission rate and set4 models showed the highest TSS and omission rate but lowest commission rate. The evaluation indices of set0 models were between the two extremes, although more similar to indices of set4 models, with a few exceptions (omission rates of GAM and MaxEnt models trained with 100 presences; Table 1 & Figures 1 & S5).

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Overview of model performance obtained with four ecological niche modelling algorithms implemented in Experiment 1. The x-axis represents the number of physiological limits (filters) involved in simulating training absences; 0 represents random background points, 4 represents absences based on complete physiological information (four limits) or true absences, and 1–3 represent absences with partial physiological information, based on 1–3 physiologically limits. The y-axis represents one model evaluation index, true skill statistic (TSS). The number of training presences is represented by different colours: red represents 100, green 300, and blue 1,000 presences. Bars represent 95% confidence intervals on the means of models of all species

Table 1. Summary of post hoc analysis of performance of ecological niche models obtained with control and experimental groups. The model evaluation indices (true skill statistic or TSS, omission rate, and commission rate) of one experimental group are either significantly lower or higher (p < .05) than those of the control group or have no significant difference. The values represent the percentage of situations among all (72) combinations of algorithm, number of training presences, and virtual species. The control group in each experiment is set0, which uses random background points and true presences in Experiment 1 and random background points and presences with noise in experiments 2 and 3. The experimental groups are set1 to set4 that include 1–4 physiological limits to simulate training absences (Experiment 1), or presences (Experiment 2), or both absences and presences (Experiment 3)

Experiment	Comparison	TSS (%)			Omission rate (%)			Commission rate (%)
Experiment	Comparison	Lower	No difference	Higher	Lower	No difference	Higher	Lower	No difference	Higher
Experiment 1	set1 vs. set0	100	0	0	65	35	0	0	0	100
	set2 vs. set0	97	0	3	36	56	8	0	0	100
	set3 vs. set0	61	35	4	18	51	31	3	25	72
	set4 vs. set0	1	25	74	8	46	46	88	13	0
Experiment 2	set1 vs. set0	0	76	24	0	99	1	25	75	0
	set2 vs. set0	0	14	86	0	57	43	92	8	0
	set3 vs. set0	3	3	94	0	11	89	100	0	0
	set4 vs. set0	4	4	92	0	0	100	100	0	0
Experiment 3	set1 vs. set0	100	0	0	15	68	17	0	0	100
	set2 vs. set0	47	28	25	6	33	61	26	24	50
	set3 vs. set0	1	18	81	0	13	88	89	11	0
	set4 vs. set0	1	3	96	0	0	100	100	0	0

Percentages may not total 100% because of rounding

3.2 Experiment 2—simulating presences

Comparable to results of Experiment 1, TSS and omission rate of models increased and commission rate decreased (Figures 2 & S6) with more physiological information used to refine the training presences (set1 to set4). With noise incompletely or completely removed (set2 to set4), most models (86%–94%) had significantly higher TSS (p < .05) than that models based on set0 (presences with noise) (Table 1). The percentage of models with significantly higher TSS increased from 24% to 94% as more physiological limits were included (set1 to set4). This pattern was mainly due to changes in the commission rate (Fig. S6), which were related to the percentage of noise being removed (Figures 3 & S7). This outcome was more notable for broadly distributed species. BRT and GAM models trained with 100 presences slightly deviated from the general trend of results, especially by showing high omission rates (Figures 2 & S6).

3.3 Experiment 3—simulating both presences and absences

Overall, the results obtained with Experiment 3 were similar to those obtained with Experiments 1 and 2: as more physiological information was used to manipulate training presences and absences (set1 to set4), the models achieved higher TSS and omission rate and lower commission rate (Figures 4 & S8). Compared with models based on control group (set0, random background points and presences with noise), all models obtained with set1 (considering one physiological limit) had significantly lower TSS (p < .05). The patterns were less clear for models obtained with set2 (two physiological limits), which had a mix of significantly higher or lower relationships with set0 models, or non-significant differences (Table 1). Models based on set3 and set4 (three and four physiological limits, respectively) had significantly higher TSS (p < .05) than set0 models in most cases (81% for set3 models and 96% for set4 models). Similarities between the results of Experiment 3 and Experiment 1 included lowest TSS and omission rate and highest commission rate of models obtained with set1 and highest TSS and omission rate and lowest commission rate for models based on set4 (Figures 4 & S8); also similar to Experiment 1 results, in Experiment 3, the evaluation indices of set0 models were between the two extremes (set1 models and set4 models); but in contrast to Experiment 1, evaluation indices of set0 models approximated set2 models, instead of set4 models. BRT models trained with 100 presences slightly deviated from the general trend of results, especially showing high omission rates (Figures 4 & S8).

3.4 Comparison between Experiment 2 and Experiment 3

Using a single physiological limit to select absences and reduce noise from presences (set1 in Experiment 3), we obtained models with significantly lower TSS (p < .05) and higher commission rate than using physiological limits only to reduce noise from presence datasets (set1 in Experiment 2), in all cases (Table 2 & Figures 5 & S9). In most cases (>82%), two physiological limits applied to both presence and absence simulations (set2 of Experiment 3) produced models with significantly lower TSS and higher commission rate (p < .05) than using two physiological limits to simulate presences (set2 in Experiment 2). The pattern was reversed when using three or four physiological limits (set3 and set4): in more cases, the models of Experiment 3 had significantly higher TSS and lower commission rate (p < .05) than models of Experiment 2. This reversed pattern was more obvious for species with higher prevalence (Figures 5 & S9). The differences in the omission rate were generally less pronounced between the models of the two experiments (Table 2).

Table 2. Summary of statistical analysis of performance of ecological niche models obtained in Experiment 2 and Experiment 3. The model evaluation indices (true skill statistic or TSS, omission rate and commission rate) of one set (control or experimental) in Experiment 3 are either significantly lower or higher (p < .05) than those of the same set in Experiment 2 or are not significantly different. The values represent the percentage of each situation among all (72) combinations of algorithm, number of training presences and virtual species. The control group in each experiment is set0, which uses random background points and presences with noise. The experimental groups are set1 to set4 that include 1–4 physiological limits in simulating training presences (Experiment 2), or both absences and presences (Experiment 3)

Experiment 2 (exp2) vs. Experiment 3 (exp3)	TSS (%)			Omission rate (%)			Commission rate (%)
Experiment 2 (exp2) vs. Experiment 3 (exp3)	Higher in exp2	No difference	Higher in exp3	Higher in exp2	No difference	Higher in exp3	Higher in exp2	No difference	Higher in exp3
set0 vs. set0	0	100	0	0	100	0	0	100	0
set1 vs. set1	100	0	0	35	44	21	0	0	100
set2 vs. set2	82	11	7	24	25	51	7	10	83
set3 vs. set3	29	31	40	13	31	57	39	31	31
set4 vs. set4	3	14	83	13	39	49	93	7	0

Percentages may not total 100% because of rounding.

4 DISCUSSION

We studied three methods of integrating physiological information into ENM by optimizing selection of training presences and absences. The first method, selecting physiologically informed absences, did not produce better performing models than random background points, unless complete knowledge of the species physiology was used (in our study, intolerance to four abiotic conditions). The second method, selecting presences that met physiological limits, produced better performing models and model improvement (measured with commission rate) was positively correlated with percentage of noise being removed from presence dataset. Our findings with these two methods generally held across all combinations of virtual species, algorithm and number of training presences. Our third method, integrating physiologically informed absences and presences with reduced noise, did not produce better performing models than adjusting presence datasets only, unless all or majority (three of four) of physiological limits were known; the improvement in model performance when adjusting both presences and absences was more obvious for species with broader distributions. The patterns we found were rather consistent for all algorithms, although with a few exceptions when fewer presences were used.

4.1 Physiologically informed absences

Although physiologically informed absences could be true absences because the environmental conditions are unsuitable for the species, the approach of simulating physiologically informed absences did not improve overall performance of models (i.e., TSS) unless most or all physiological limits were known. Our interpretation of this outcome is that incomplete information of species’ physiological limits provides a biased understanding of conditions associated with species’ absence, thus misguiding ENM towards a broader estimate of the fundamental niche. Martínez et al. (2015) obtained a broader species’ potential distribution based on one type of physiological tolerance (i.e., thermal) than based on an ecological niche model. The broader estimation should be signalled by higher commission rate, similar to what we observed with absences with partial physiological information. The disagreement between physiological prediction and ENM prediction may be attributed to the effect of overlooking biotic interactions (Martínez et al., 2015); another plausible explanation is that only one type of physiological tolerance amounts to insufficient understanding of the species’ fundamental niche.

In theory, incorporating more dimensions of physiological tolerance should improve identification of true absences, thus reduce overestimation of the fundamental niche; our results supported this expectation. We also found that models of broadly distributed species benefited most from use of physiologically informed absences incorporating all or majority of physiological limits. This outcome is probably due to the fact that such physiologically informed absences approximated the complete information of absence, the positive effect of which outweighed the negative effect of noise in presences. However, optimistic interpretations of this result should be avoided because in reality we rarely have a complete understanding of a species’ physiology. Therefore, the use of physiologically informed absences should be avoided when knowledge of a species’ physiology is scarce. Instead, we recommend the use of random background points because our experiments show a minimal effect of true absences on model optimization. However, we note that our method of selecting random background points may not be ideal in other, diverse situations (Barbet-Massin et al., 2012; Iturbide et al., 2015; Lobo & Tognelli, 2011; Phillips et al., 2009).

4.2 Removing noise from presence datasets

In contrast with the patterns observed for simulated physiologically informed absences, the technique of removing physiologically intolerant presences increased the overall model accuracy. Our interpretation is that removing noise from presence datasets reduces the commission rate, thus avoiding overestimation of species’ fundamental niche or potential distribution. Other studies have used different methods to eliminate noise from occurrence datasets in the context of ENM. For example, a thresholded model based on omission rate (e.g., 10%) may deliberately omit areas of lower suitability probability from the predicted potential distribution (Peterson et al., 2011), corresponding to a narrower estimate of the niche; however, threshold selection depends on researcher’ judgement on the quality of the occurrences, which can be subjective. Soley-Guardia et al. (2014) implemented a two-step approach to detect marginally suitable occurrences, by first building a preliminary niche model and locating occurrences with lower predicted suitability, then using habitat information from collectors’ field notes and literature to determine marginal occurrences; this approach can be objective although may be limited by the role of habitat or availability of such information (Soley-Guardia et al., 2014). Another approach is monitoring population dynamics (Pulliam, 2000; Soberón, 2007) to understand status of species (temporal vs. indefinite existence or sink vs. source population), but logistics and time constraints do not make this approach widely applicable.

Our proposed approach provides a biologically meaningful avenue for handling noise in species’ presence datasets. Physiological limits are species’ responses to abiotic conditions and should provide a reasonable approximation of the fundamental niche (Feng & Papeş, 2017; Martínez et al., 2015). A focal species’ distribution can be better understood through application of physiological limits to the presence dataset (Feng & Papeş, 2017), and the noise-reduced presence dataset can lead to better performing ENM, in our study evidenced by higher TSS and lower commission rate. A lower commission rate generally means a narrower but more precise prediction in geographic space. Such model improvement can increase the effectiveness of ENM conservation applications, for example in rare species surveys or invasive species management (Bond et al., 2014; Kulhanek et al., 2011), because limited effort and resources can be focused on a smaller, targeted area. However, the effect of physiological filters is dependent on their importance on a species’ distribution; best case scenarios are when the known physiological limits are dominant factors shaping species’ distribution and the noise is well expressed along the environmental axes being considered. We also note that models obtained with BRT and GAM trained with 100 presences showed relatively poor performance, characterized by high omission rates; therefore, we recommend avoiding these situations.

4.3 Completeness and effectiveness of physiological limits in ENM

Our study showed that complete knowledge of physiological limits applied to selecting absences will lead to better models than random background points. Better performing models are also obtained if at least half of the physiological limits are known and are applied to selecting both absences and presences, or if any physiological limit is known and is applied to reducing noise in presence data. Therefore, the effectiveness of the three techniques tested here largely depends on our knowledge of species’ physiology. For studies of real species, scholars commonly gain knowledge of physiological limits through reviewing literature (Araújo et al., 2013; Feng & Papeş, 2017; Hoffmann et al., 2013), but this knowledge could also be gained through field surveys (Churchill, 2013; White, Hamilton, & Sarnelle, 2015) and laboratory experiments (Claudi et al., 2013; McMahon, 1996). We expect that species’ physiological information is substantial in the literature, because physiology is one of earliest scientific disciplines in biology (Scheer, 1963). Thermal tolerance is one of the major foci of physiological research, a key determinant of species’ geographic distribution (Gaston, 2003; Kearney & Porter, 2009; Wiens & Donoghue, 2004) and frequently studied in relation to climate change and extinction risk (Sinervo et al., 2010) and invasive species management (Morse, 2009). Precipitation (Byrne, McMahon, & Dietz, 1988), pH (Claudi et al., 2013), salinity (Morton & Tong, 1985), and calcium concentration (Cairns & Yan, 2009) have also been studied for various purposes. Information on physiological limits has been accumulating in the literature, but taxonomic or trait biases could be common (Feng & Papeş, 2017). A complete understanding of species’ physiology seems unattainable for most species, as well as assessing the completeness of physiological knowledge. However, we argue that completeness does not necessarily mean effectiveness. Species’ ecological niche can be defined with a large number of variables (Hutchinson, 1957), but in a real landscape not all variables are playing a crucial role and those that do may not have equal roles. For example, air oxygen concentration must be a crucial factor in defining the niche of nine-banded armadillo, and possibly of many other terrestrial species, but is not expected to limit the species’ geographic range; on other hand, temperature and precipitation are important for shaping the distribution of nine-banded armadillo in North America (Feng & Papeş, 2015). As we have shown in Experiment 2, model improvement was positively associated with the percentage of noise being removed from presence datasets (Figure 3); from this perspective, we argue that, although complete physiological knowledge is rare, identifying one effective variable is more beneficial than considering several noninformative variables in the context of improving ENM.

4.4 Looking forward

Current ENM algorithms (typically correlative) are useful for identifying relationships between species’ presence or absence and the environment, which is an estimate in between the fundamental niche and realized niche (Soberón & Nakamura, 2009), but are generally weak in extrapolations to novel conditions (Peterson et al., 2011). Physiological information mechanistically relates species’ presence and absence with the environment, and these relationships are transferrable to novel conditions (e.g., climate change; Martínez et al., 2015). The drawback is lack of complete understanding of every biological aspect of a species. Future studies may consider different ways of defining physiologically intolerant locations or develop new ENM algorithms that can directly handle information from the perspective of species’ physiology. For example, new algorithms could use biologically meaningful information [e.g., physiological tolerance in Feng and Papeş (2017); expert range map in Domisch, Wilson, and Jetz (2016)] as prior understanding of species’ response to environmental conditions, and update the response functions based on occurrence data in a Bayesian framework (e.g., Brewer, O'Hara, Anderson, & Ohlemüller, 2016).

ACKNOWLEDGEMENTS

The computing for this project was performed at the High Performance Computing Center at Oklahoma State University supported in part through the NSF-OCI 1126330. We thank three anonymous referees for constructive comments on an earlier draft of this manuscript.

BIOSKETCHES

Xiao Feng is a Ph.D. candidate at Oklahoma State University. His research focuses on applications of ecological niche modelling (ENM) on biogeography questions and methodological and conceptual issues in ENM, with special interest in the integration of ENM and physiological information.

Monica Papeş is an assistant professor at University of Tennessee. Her research focuses on species’ geographic distributions and in particular on remote sensing applications to mapping biodiversity, seasonal variation of species’ distributions and predicting areas at risk of invasive species.

Supporting Information

REFERENCES

Allouche, O., Tsoar, A., & Kadmon, R. (2006). Assessing the accuracy of species distribution models: Prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology, 43, 1223–1232.
10.1111/j.1365-2664.2006.01214.x
Web of Science® Google Scholar
Araújo, M. B., Ferri-Yáñez, F., Bozinovic, F., Marquet, P. A., Valladares, F., & Chown, S. L. (2013). Heat freezes niche evolution. Ecology Letters, 16, 1206–1219.
10.1111/ele.12155
PubMed Web of Science® Google Scholar
Araújo, M. B., & Pearson, R. G. (2005). Equilibrium of species’ distributions with climate. Ecography, 28, 693–695.
10.1111/j.2005.0906-7590.04253.x
Web of Science® Google Scholar
Barbet-Massin, M., Jiguet, F., Albert, C. H., & Thuiller, W. (2012). Selecting pseudo-absences for species distribution models: How, where and how many? Methods in Ecology and Evolution, 3, 327–338.
10.1111/j.2041-210X.2011.00172.x
Web of Science® Google Scholar
Beck, J., Böller, M., Erhardt, A., & Schwanghart, W. (2014). Spatial bias in the GBIF database and its effect on modeling species’ geographic distributions. Ecological Informatics, 19, 10–15.
10.1016/j.ecoinf.2013.11.002
Web of Science® Google Scholar
Bond, N. R., Thomson, J. R., & Reich, P. (2014). Incorporating climate change in conservation planning for freshwater fishes. Diversity and Distributions, 20, 931–942.
10.1111/ddi.12213
Web of Science® Google Scholar
Brewer, M. J., O'Hara, R. B., Anderson, B. J., & Ohlemüller, R. (2016). Plateau: A new method for ecologically plausible climate envelopes for species distribution modelling. Methods in Ecology and Evolution, 7, 1489–1502.
10.1111/2041-210X.12609
Web of Science® Google Scholar
Byrne, R. A., McMahon, R. F., & Dietz, T. H. (1988). Temperature and relative humidity effects on aerial exposure tolerance in the freshwater bivalve Corbicula fluminea. Biological Bulletin, 175, 253–260.
10.2307/1541566
Web of Science® Google Scholar
Cairns, A., & Yan, N. (2009). A review of the influence of low ambient calcium concentrations on freshwater daphniids, gammarids, and crayfish. Environmental Reviews, 17, 67–79.
10.1139/A09-005
CAS Web of Science® Google Scholar
Churchill, J. C. (2013). Spatio-temporal spawning and larval dynamics of a zebra mussel (Dreissena polymorpha) population in a North Texas Reservoir: Implications for invasions in the southern United States. Aquatic Invasions, 8, 389–406.
10.3391/ai.2013.8.4.03
Web of Science® Google Scholar
Claudi, R., Prescott, T. H., Prescott, K. L., Mastitsky, S. E., Evans, D., & Taraborelli, A. C. (2013). Evaluating high pH for control of dreissenid mussels. Management of Biological Invasions, 4, 101–111.
10.3391/mbi.2013.4.2.02
Google Scholar
Domisch, S., Wilson, A. M., & Jetz, W. (2016). Model-based integration of observed and expert-based information for assessing the geographic and environmental distribution of freshwater species. Ecography, 39, 1–11.
10.1111/ecog.01925
Web of Science® Google Scholar
Dormann, C. F., Elith, J., Bacher, S., Buchmann, C., Carl, G., Carré, G., & Lautenbach, S. (2013). Collinearity: A review of methods to deal with it and a simulation study evaluating their performance. Ecography, 36, 27–46.
10.1111/j.1600-0587.2012.07348.x
Web of Science® Google Scholar
Elith, J., Graham, C. H., Anderson, R. P., Dudik, M., Ferrier, S., Guisan, A., & Zimmermann, N. E. (2006). Novel methods improve prediction of species’ distributions from occurrence data. Ecography, 29, 129–151.
10.1111/j.2006.0906-7590.04596.x
Web of Science® Google Scholar
Elith, J., & Leathwick, J. R. (2009). Species distribution models: Ecological explanation and prediction across space and time. Annual Review of Ecology, Evolution, and Systematics, 40, 677–697.
10.1146/annurev.ecolsys.110308.120159
Web of Science® Google Scholar
Elith, J., Leathwick, J. R., & Hastie, T. (2008). A working guide to boosted regression trees. Journal of Animal Ecology, 77, 802–813.
10.1111/j.1365-2656.2008.01390.x
CAS PubMed Web of Science® Google Scholar
Engler, R., Guisan, A., & Rechsteiner, L. (2004). An improved approach for predicting the distribution of rare and endangered species from occurrence and pseudo-absence data. Journal of Applied Ecology, 41, 263–274.
10.1111/j.0021-8901.2004.00881.x
Web of Science® Google Scholar
Feeley, K. J., & Silman, M. R. (2010). Modelling the responses of Andean and Amazonian plant species to climate change: The effects of georeferencing errors and the importance of data filtering. Journal of Biogeography, 37, 733–740.
10.1111/j.1365-2699.2009.02240.x
Web of Science® Google Scholar
Feng, X., & Papeş, M. (2015). Ecological niche modelling confirms potential north-east range expansion of the nine-banded armadillo (Dasypus novemcinctus) in the USA. Journal of Biogeography, 42, 803–807.
10.1111/jbi.12427
Web of Science® Google Scholar
Feng, X., & Papeş, M. (2017). Physiological limits in an ecological niche modeling framework: A case study of water temperature and salinity constraints of freshwater bivalves invasive in USA. Ecological Modelling, 346, 48–57.
10.1016/j.ecolmodel.2016.11.008
Web of Science® Google Scholar
Franklin, J., & Miller, J. A. (2009). Mapping species distributions: Spatial inference and prediction. Cambridge, UK: Cambridge University Press.
10.1017/CBO9780511810602
Google Scholar
Friedman, J., Hastie, T., & Tibshirani, R. (2000). Additive logistic regression: A statistical view of boosting. Annals of Statistics, 28, 337–407.
10.1214/aos/1016218223
Web of Science® Google Scholar
Games, P. A., & Howell, J. F. (1976). Pairwise multiple comparison procedures with unequal n's and/or variances: A Monte Carlo study. Journal of Educational Statistics, 1, 113–125.
Google Scholar
Gaston, K. J. (2003). The structure and dynamics of geographic ranges. Oxford, UK: Oxford University Press.
10.1093/oso/9780198526407.001.0001
Web of Science® Google Scholar
Hastie, T., & Tibshirani, R. (1990). Generalized additive models, 1st edn. London, UK: Chapman and Hall.
Web of Science® Google Scholar
Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G., & Jarvis, A. (2005). Very high resolution interpolated climate surfaces for global land areas. International Journal of Climatology, 25, 1965–1978.
10.1002/joc.1276
PubMed Web of Science® Google Scholar
Hijmans, R. J., Phillips, S., Leathwick, J., & Elith, J. (2013). dismo: species distribution modeling. R package version 0.8-5. Retrieved from http://CRAN.R-project.org/package=dismo
Google Scholar
Hirzel, A. H., Helfer, V., & Metral, F. (2001). Assessing habitat-suitability models with a virtual species. Ecological Modelling, 145, 111–121.
10.1016/S0304-3800(01)00396-9
Web of Science® Google Scholar
Hoffmann, A. A., Chown, S. L., & Clusella-Trullas, S. (2013). Upper thermal limits in terrestrial ectotherms: How constrained are they? Functional Ecology, 27, 934–949.
10.1111/j.1365-2435.2012.02036.x
Web of Science® Google Scholar
Hutchinson, G. E. (1957). Concluding remarks. Cold Spring Harbor Symposia on Quantitative Biology, 22, 415–427.
10.1101/SQB.1957.022.01.039
Web of Science® Google Scholar
Iturbide, M., Bedia, J., Herrera, S., del Hierro, O., Pinto, M., & Gutiérrez, J. M. (2015). A framework for species distribution modelling with improved pseudo-absence generation. Ecological Modelling, 312, 166–174.
10.1016/j.ecolmodel.2015.05.018
Web of Science® Google Scholar
Kearney, M., & Porter, W. (2009). Mechanistic niche modelling: Combining physiological and spatial data to predict species’ ranges. Ecology Letters, 12, 334–350.
10.1111/j.1461-0248.2008.01277.x
CAS PubMed Web of Science® Google Scholar
Kulhanek, S. A., Leung, B., & Ricciardi, A. (2011). Using ecological niche models to predict the abundance and impact of invasive species: Application to the common carp. Ecological Applications, 21, 203–213.
10.1890/09-1639.1
PubMed Web of Science® Google Scholar
Leathwick, J. R., Elith, J., Francis, M. P., Hastie, T., & Taylor, P. (2006). Variation in demersal fish species richness in the oceans surrounding New Zealand: An analysis using boosted regression trees. Marine Ecology Progress Series, 321, 267–281.
10.3354/meps321267
Web of Science® Google Scholar
Lehmann, A., Overton, J. M., & Leathwick, J. R. (2002). GRASP: Generalized regression analysis and spatial prediction. Ecological Modelling, 157, 189–207.
10.1016/S0304-3800(02)00195-3
Web of Science® Google Scholar
Leroy, B., Meynard, C. N., Bellard, C., & Courchamp, F. (2016). virtualspecies, an R package to generate virtual species distributions. Ecography, 39, 599–607.
10.1111/ecog.01388
Web of Science® Google Scholar
Lobo, J. M., & Tognelli, M. F. (2011). Exploring the effects of quantity and location of pseudo-absences and sampling biases on the performance of distribution models with limited point occurrence data. Journal for Nature Conservation, 19, 1–7.
10.1016/j.jnc.2010.03.002
Web of Science® Google Scholar
Mackenzie, D. I. (2005). Was it there? Dealing with imperfect detection for species presence/absence data. Australian & New Zealand Journal of Statistics, 47, 65–74.
10.1111/j.1467-842X.2005.00372.x
PubMed Web of Science® Google Scholar
Manel, S., Williams, H. C., & Ormerod, S. J. (2002). Evaluating presence-absence models in ecology: The need to account for prevalence. Journal of Applied Ecology, 38, 921–931.
10.1046/j.1365-2664.2001.00647.x
Web of Science® Google Scholar
Martínez, B., Arenas, F., Trilla, A., Viejo, R. M., & Carreño, F. (2015). Combining physiological threshold knowledge to species distribution models is key to improving forecasts of the future niche for macroalgae. Global Change Biology, 21, 1422–1433.
10.1111/gcb.12655
PubMed Web of Science® Google Scholar
Mathewson, P. D., Moyer-Horner, L., Beever, E. A., Briscoe, N. J., Kearney, M., Yahn, J. M., & Porter, W. P. (2016). Mechanistic variables can enhance predictive models of endotherm distributions: The American pika under current, past, and future climates. Global Change Biology, 23, 1048–1064.
10.1111/gcb.13454
PubMed Web of Science® Google Scholar
McCullagh, P. (1984). Generalized linear models. European Journal of Operational Research, 16, 285–292.
10.1016/0377-2217(84)90282-0
Web of Science® Google Scholar
McCullagh, P., & Nelder, J. A. (1989). Generalized linear models. London, UK: Chapman and Hall.
10.1007/978-1-4899-3242-6
CAS Google Scholar
McMahon, R. F. (1996). The physiological ecology of the zebra mussel, Dreissena polymorpha, in North America and Europe. American Zoologist, 36, 339–363.
Google Scholar
Morse, J. T. (2009). Assessing the effects of application time and temperature on the efficacy of hot-water sprays to mitigate fouling by Dreissena polymorpha (zebra mussels Pallas). Biofouling, 25, 605–610.
10.1080/08927010902989245
PubMed Web of Science® Google Scholar
Morton, B., & Tong, K. Y. (1985). The salinity tolerance of Corbicula fluminea (Bivalvia: Corbiculoidea) from Hong Kong. Malacological Review, 18, 91–95.
Google Scholar
Pearson, R. G., Raxworthy, C. J., Nakamura, M., & Peterson, A. T. (2007). Predicting species distributions from small numbers of occurrence records: A test case using cryptic geckos in Madagascar. Journal of Biogeography, 34, 102–117.
10.1111/j.1365-2699.2006.01594.x
Web of Science® Google Scholar
Peterson, A. T., Papeş, M., & Soberón, J. (2015). Mechanistic and correlative models of ecological niches. European Journal of Ecology, 1, 28–38.
10.1515/eje-2015-0014
Google Scholar
Peterson, A. T., Soberón, J., Pearson, R. G., Anderson, R. P., Martínez-Meyer, E., Nakamura, M., & Araújo, M. B. (2011). Ecological niches and geographic distributions. Princeton, NJ: Princeton University Press.
10.23943/princeton/9780691136868.001.0001
Google Scholar
Phillips, S. J., Anderson, R. P., & Schapire, R. E. (2006). Maximum entropy modeling of species geographic distributions. Ecological Modelling, 190, 231–259.
10.1016/j.ecolmodel.2005.03.026
Web of Science® Google Scholar
Phillips, S. J., & Dudik, M. (2008). Modeling of species distributions with Maxent: New extensions and a comprehensive evaluation. Ecography, 31, 161–175.
10.1111/j.0906-7590.2008.5203.x
Web of Science® Google Scholar
Phillips, S. J., Dudík, M., Elith, J., Graham, C. H., Lehmann, A., Leathwick, J., & Ferrier, S. (2009). Sample selection bias and presence-only distribution models: Implications for background and pseudo-absence data. Ecological Applications, 19, 181–197.
10.1890/07-2153.1
PubMed Web of Science® Google Scholar
Porter, W., Munger, J., Stewart, W., Budaraju, S., & Jaeger, J. (1994). Endotherm energetics – from a scalable individual-based model to ecological applications. Australian Journal of Zoology, 42, 125–162.
10.1071/ZO9940125
Web of Science® Google Scholar
Pulliam, H. R. (2000). On the relationship between niche and distribution. Ecology Letters, 3, 349–361.
10.1046/j.1461-0248.2000.00143.x
Web of Science® Google Scholar
Qiao, H., Peterson, A. T., Campbell, L. P., Soberón, J., Ji, L., & Escobar, L. E. (2016). NicheA: Creating virtual species and ecological niches in multivariate environmental scenarios. Ecography, 39, 805–813.
10.1111/ecog.01961
Web of Science® Google Scholar
Qiao, H., Soberón, J., & Peterson, A. T. (2015). No silver bullets in correlative ecological niche modelling: Insights from testing among many potential algorithms for niche estimation. Methods in Ecology and Evolution, 6, 1126–1136.
10.1111/2041-210X.12397
Web of Science® Google Scholar
R Core Team (2014). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. http://www.R-project.org/
Google Scholar
Ridgeway, G. (2006). Generalized boosted regression models. Documentation on the R Package ‘gbm’, version 1.5–7. Retrieved from http://www.i-pensieri.com/gregr/gbm.shtml
Google Scholar
Santika, T. (2011). Assessing the effect of prevalence on the predictive performance of species distribution models using simulated data. Global Ecology and Biogeography, 20, 181–192.
10.1111/j.1466-8238.2010.00581.x
Web of Science® Google Scholar
Scheer, B. T. (1963). Animal physiology. New York, NY: Wiley.
Google Scholar
Senay, S. D., Worner, S. P., & Ikeda, T. (2013). Novel three-step pseudo-absence selection technique for improved species distribution modelling. PLoS ONE, 8, e71218.
10.1371/journal.pone.0071218
CAS PubMed Web of Science® Google Scholar
Sinervo, B., Mendez-de-la-Cruz, F., Miles, D. B., Heulin, B., Bastiaans, E., Villagran-Santa Cruz, M., & Sites, J. W. (2010). Erosion of lizard diversity by climate change and altered thermal niches. Science, 328, 894–899.
10.1126/science.1184695
CAS PubMed Web of Science® Google Scholar
Soberón, J. M. (2007). Grinnellian and Eltonian niches and geographic distributions of species. Ecology Letters, 10, 1115–1123.
10.1111/j.1461-0248.2007.01107.x
PubMed Web of Science® Google Scholar
Soberón, J. M., & Nakamura, M. (2009). Niches and distributional areas: Concepts, methods, and assumptions. Proceedings of the National Academy of Sciences, USA, 106, 19644–19650.
10.1073/pnas.0901637106
CAS PubMed Web of Science® Google Scholar
Soberón, J., & Peterson, A. T. (2005). Interpretation of models of fundamental ecological niches and species’ distributional areas. Biodiversity Informatics, 2, 1–10.
10.17161/bi.v2i0.4
Google Scholar
Soley-Guardia, M., Radosavljevic, A., Rivera, J. L., & Anderson, R. P. (2014). The effect of spatially marginal localities in modelling species niches and distributions. Journal of Biogeography, 41, 1390–1401.
10.1111/jbi.12297
Web of Science® Google Scholar
Welch, B. L. (1947). The generalization of `Student's’ problem when several different population variances are involved. Biometrika, 34, 28–35.
10.1093/biomet/34.1-2.28
CAS PubMed Web of Science® Google Scholar
White, J. D., Hamilton, S. K., & Sarnelle, O. (2015). Heat-induced mass mortality of invasive zebra mussels (Dreissena polymorpha) at sublethal water temperatures. Canadian Journal of Fisheries and Aquatic Sciences, 72, 1221–1229.
10.1139/cjfas-2015-0064
Web of Science® Google Scholar
Wiens, J. J., & Donoghue, M. J. (2004). Historical biogeography, ecology and species richness. Trends in Ecology and Evolution, 19, 639–644.
10.1016/j.tree.2004.09.011
PubMed Web of Science® Google Scholar
Wintle, B. A., Elith, J., & Potts, J. M. (2005). Fauna habitat modelling and mapping: A review and case study in the Lower Hunter Central Coast region of NSW. Austral Ecology, 30, 719–738.
10.1111/j.1442-9993.2005.01514.x
Web of Science® Google Scholar
Wood, S. N. (2011). Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73, 3–36.
10.1111/j.1467-9868.2010.00749.x
Web of Science® Google Scholar
Zar, J. H. (2010). Biostatistical analysis, 5th edn. Upper Saddle River, NJ: Prentice Hall.
Google Scholar
Zurell, D., Berger, U., Cabral, J. S., Jeltsch, F., Meynard, C. N., Münkemüller, T., & Grimm, V. (2010). The virtual ecologist approach: Simulating data and observers. Oikos, 119, 622–635.
10.1111/j.1600-0706.2009.18284.x
Web of Science® Google Scholar

Citing Literature

Volume23, Issue10

October 2017

Pages 1157-1168

Can incomplete knowledge of species’ physiology facilitate ecological niche modelling? A case study with virtual species

Abstract