Volume 13, Issue 4 e10014
RESEARCH ARTICLE
Open Access

Detection of vertebrates from natural and artificial inland water bodies in a semi-arid habitat using eDNA from filtered, swept, and sediment samples

Rupert McDonald

Rupert McDonald

Trace and Environmental DNA Laboratory, School of Life and Molecular Sciences, Curtin University, Perth, Australia

Contribution: Conceptualization (equal), Data curation (equal), Formal analysis (equal), ​Investigation (equal), Methodology (equal), Writing - original draft (equal), Writing - review & editing (equal)

Search for more papers by this author
Philip W. Bateman

Philip W. Bateman

Trace and Environmental DNA Laboratory, School of Life and Molecular Sciences, Curtin University, Perth, Australia

Behavioural Ecology Lab, School of Molecular and Life Sciences, Curtin University, Perth, Australia

Contribution: Conceptualization (equal), ​Investigation (equal), Writing - original draft (equal), Writing - review & editing (equal)

Search for more papers by this author
Christine Cooper

Christine Cooper

School of Molecular and Life Sciences, Curtin University, Perth, Australia

Contribution: Conceptualization (equal), Formal analysis (equal), ​Investigation (equal), Visualization (equal), Writing - original draft (equal), Writing - review & editing (equal)

Search for more papers by this author
Mieke van der Heyde

Mieke van der Heyde

Trace and Environmental DNA Laboratory, School of Life and Molecular Sciences, Curtin University, Perth, Australia

Contribution: Conceptualization (equal), Data curation (equal), Formal analysis (equal), ​Investigation (equal), Methodology (equal), Supervision (equal), Visualization (equal), Writing - original draft (equal), Writing - review & editing (equal)

Search for more papers by this author
Mahsa Mousavi-Derazmahalleh

Mahsa Mousavi-Derazmahalleh

Trace and Environmental DNA Laboratory, School of Life and Molecular Sciences, Curtin University, Perth, Australia

Contribution: Data curation (supporting), Formal analysis (supporting)

Search for more papers by this author
Brock A. Hedges

Brock A. Hedges

School of Biological Sciences, The University of Adelaide, Adelaide, Australia

Contribution: ​Investigation (supporting), Writing - original draft (supporting), Writing - review & editing (supporting)

Search for more papers by this author
Michelle T. Guzik

Michelle T. Guzik

School of Biological Sciences, The University of Adelaide, Adelaide, Australia

Contribution: ​Investigation (supporting), Writing - original draft (supporting), Writing - review & editing (supporting)

Search for more papers by this author
Paul Nevill

Corresponding Author

Paul Nevill

Trace and Environmental DNA Laboratory, School of Life and Molecular Sciences, Curtin University, Perth, Australia

Correspondence

Paul Nevill, Trace and Environmental DNA Laboratory, School of Life and Molecular Sciences, Curtin University, Perth, Western Australia 6845, Australia.

Email: [email protected]

Contribution: Conceptualization (equal), Funding acquisition (lead), ​Investigation (equal), Methodology (supporting), Project administration (lead), Supervision (lead), Writing - original draft (equal), Writing - review & editing (equal)

Search for more papers by this author
First published: 24 April 2023
Citations: 9

Abstract

Biomonitoring is vital for establishing baseline data that is needed to identify and quantify ecological change and to inform management and conservation activities. However, biomonitoring and biodiversity assessment in arid environments, which are predicted to cover 56% of the Earth's land surface by 2100, can be prohibitively time consuming, expensive, and logistically challenging due to their often remote and inhospitable nature. Sampling of environmental DNA (eDNA) coupled with high-throughput sequencing is an emerging biodiversity assessment method. Here we explore the application of eDNA metabarcoding and various sampling approaches to estimate vertebrate richness and assemblage at human-constructed and natural water sources in a semi-arid region of Western Australia. Three sampling methods: sediment samples, filtering through a membrane with a pump, and membrane sweeping in the water body, were compared using two eDNA metabarcoding assays, 12S-V5 and 16smam, for 120 eDNA samples collected from four gnammas (gnamma: Australian Indigenous Noongar language term–granite rock pools) and four cattle troughs in the Great Western Woodlands, Western Australia. We detected higher vertebrate richness in samples from cattle troughs and found differences between assemblages detected in gnammas (more birds and amphibians) and cattle troughs (more mammals, including feral taxa). Total vertebrate richness was not different between swept and filtered samples, but all sampling methods yielded different assemblages. Our findings indicate that eDNA surveys in arid lands will benefit from collecting multiple samples at multiple water sources to avoid underestimating vertebrate richness. The high concentration of eDNA in small, isolated water bodies permits the use of sweep sampling that simplifies sample collection, processing, and storage, particularly when assessing vertebrate biodiversity across large spatial scales.

1 INTRODUCTION

Semi-arid and arid lands comprise more than one third of the Earth's land surface and include ecosystems such as forests, woodlands, savannas, shrublands, grasslands, and deserts (Lemons & Victor, 2003). Despite limited water availability due to a combination of low rainfall, high temperatures, and low humidity, these environments harbor a remarkably diverse biota, high levels of endemism and some of the world's most endangered species (Durant et al., 2012). Semi-arid and arid ecosystems have received less attention than other ecosystems, irrespective of their extent, ecological and social value, and the threats they face (Durant et al., 2012; Razgour et al., 2018). The extent of arid habitats is increasing globally due to climate change, and it is predicted they will cover 56% of the Earth's land surface by 2100 (Lemons & Victor, 2003; O'Farrell et al., 2010). Habitat degradation is a major issue for existing semi-arid and arid lands (Lemons & Victor, 2003; O'Farrell et al., 2010) and poses a significant threat to ecosystem function and services, upon which the livelihoods of more than 250 million people rely (James et al., 2013).

Biomonitoring is vital for establishing baseline data that is needed to identify and quantify ecological change and to inform management and conservation activities in semi-arid, arid, and other habitats (Campbell et al., 2002; Epanchin-Niell et al., 2012; Herrick et al., 2006). Conventional techniques such as observational surveys (direct observation, camera traps, audio recordings), recording field signs (hair/feathers, scats, tracks, feeding signs) and live trapping can be costly, labor intensive and may have ethical considerations (e.g., Cross et al., 2020; Furlan et al., 2020; Waudby et al., 2019). Some species may also have lower detectability than others, leading to biases in survey data (e.g., Einoder et al., 2018; Fernandes et al., 2018; Ji et al., 2013; Thompson & Newmaster, 2014). As a result, conventional monitoring techniques have some limitations, particularly detecting species of high priority for conservation or management, which are often rare, endangered, invasive, or elusive (Barnes & Turner, 2016; Harper, Handley, et al., 2019; Kelly et al., 2014; Rodgers & Mock, 2015). Therefore, there is an urgent need for biomonitoring practices that can generate detailed and current environmental information on arid ecosystems (Kelly et al., 2014).

Environmental DNA (eDNA) metabarcoding is a relatively new biomonitoring tool that has potential to overcome some of the disadvantages of more conventional approaches (Barnes & Turner, 2016). Metabarcoding involves sequencing small regions of DNA isolated from substrate samples and comparing the sequences against a library of taxonomically identified sequences to determine the species present in the sample (Taberlet et al., 2012). High-throughput sequencing has revolutionized the application of eDNA, enabling ecological communities to be characterized at a relatively low cost due to the simultaneous analysis of many samples (Furlan et al., 2020; Kelly et al., 2014). Metabarcoding of eDNA has proved highly successful in freshwater and marine systems (Egeter et al., 2018; Furlan et al., 2020; Harper, Handley, et al., 2019; Palacios et al., 2020; Thomsen et al., 2012; Ushio et al., 2017, Wang et al., 2021). However, despite rapid advancements in the technology, it is currently difficult to apply metabarcoding to terrestrial biomonitoring where eDNA may not be well preserved (van der Heyde et al., 2020), appropriate sampling substrates may be limited (Fahner et al., 2016; van der Heyde et al., 2020, 2021), consistent protocols to sample substrates are not well established (Harrison et al., 2019; van der Heyde et al., 2022), and reference databases for arid habitat taxa are incomplete (Bradford et al., 2010; Carrasco-Puga et al., 2021; Egeter et al., 2018; Palacios et al., 2020; van der Heyde et al., 2020). As a result, the application of eDNA for biomonitoring in arid lands is rare.

In water-limited environments, both natural and human-constructed (artificial) water bodies are critical water sources for animals (Davies, 1972; Hedges et al., 2021; James et al., 1999; Redfern et al., 2003). Consequently, these water bodies may be particularly rich sources of eDNA and important for biodiversity monitoring as they can provide a broad snapshot of biodiversity over a wide area given that animals may migrate over vast distances to water(. Studies have shown the applicability of eDNA metabarcoding in identifying fauna using water holes (Farrell et al., 2022; Furlan et al., 2020; Harper, Handley, et al., 2019; Mas-Carrió et al., 2021) but, to date, no studies have explored the use of different sampling methods (e.g., filtering vs sweeping vs sediment). Sample volume protocols designed for marine environments (i.e., 1–5 L) are typically applied to freshwater habitats (Egeter et al., 2018; Palacios et al., 2020; Takahashi et al., 2023) despite distinct differences in water characteristics. Water bodies in semi-arid and arid lands are often of a relatively small volume and a greater turbidity (due to high concentrations of algae, sediment, and organic debris) than marine or other freshwater environments (Harper, Buxton, et al., 2019). Attempts to apply sampling protocols for marine environments to studies of water bodies in semi-arid and arid habitats have been hindered by pump blockages during filtering of water for analysis, reducing the number of samples processed (Egeter et al., 2018; Klymus et al., 2017). Techniques to reduce sample turbidity (e.g., centrifugation, increased filter membrane pore size, pre-filtering) may reduce eDNA capture, especially when eDNA is bound to suspended particles (Turner et al., 2014), and increase costs and contamination risk (Klymus et al., 2017; Takahara et al., 2012). Turbidity issues can be avoided by collecting sediments rather than water, with the additional benefits of reduced eDNA decay in sediment compared to water (Buxton et al., 2018; Turner et al., 2015) and a high yield of eDNA due to binding of DNA to particulate matter (Turner et al., 2015). However, there is evidence that humic substances in sediment inhibit PCR amplification (e.g., Stoeckle et al., 2017) and that differences in eDNA viability between substrates can lead to a failure to detect all the components of the faunal community (Palacios et al., 2020).

Biomonitoring using eDNA metabarcoding has been used previously to detect terrestrial vertebrates in artificial water sources (Rodgers & Mock, 2015), but no studies to date have compared detection of eDNA from natural and artificial water sources in the same ecosystem. Here, we sampled water from both natural and artificial water bodies. While both water bodies are accessed by numerous fauna, there are likely to be differences in the community composition between them due to the location, structure, size, and immediate surroundings (e.g., Korine et al., 2016; Letnic et al., 2014; Schneider & Griesser, 2009). Here we tested the hypothesis that vertebrate eDNA can be detected in water and sediment samples collected from both gnammas (natural water sources) and cattle troughs (artificial water sources) in the semi-arid Great Western Woodlands, Western Australia. We investigated whether the vertebrate taxonomic richness and community assemblages detected with eDNA metabarcoding techniques differed for natural and artificial water sources and examined the effect of sampling methodology on these variables. We compared results from (1) sediment samples, (2) water samples filtered through membranes (filtered samples) and (3) unfiltered samples collected on membranes dipped in the water (swept samples; Bessey et al., 2021) and evaluated the effect of sample replication on detected taxonomic richness. Our overall aim was to expand the scope of eDNA metabarcoding to biomonitoring in environments previously considered suboptimal for the application of eDNA approaches, thereby enhancing the potential of eDNA techniques for conservation and management.

2 MATERIALS AND METHODS

2.1 Study sites

Fifteen samples were collected from each of four gnammas and four cattle troughs (n = 120; see below), located in a 96 km2 area of the Fraser Range, 380 km NNE of Esperance, Western Australia. The study site has a semi-arid climate with 268 mm of annual rainfall occurring year-round. Vegetation communities are dominated by Eucalyptus woodlands and Acacia shrublands and herb lands. The Fraser Range is within the Great Western Woodlands (GWW), the world's largest semi-arid to arid woodland (Newbey et al., 1984), which covers 16 million hectares and supports an exceptional native flora (>3300 plant species) and fauna (49 native mammal, 11 feral mammal, 138 reptile, 14 frog, and 215 bird species; Department of Parks and Wildlife, 2013; Fox et al., 2016).

2.2 Sample collection

Water samples, sediment samples, and samples from membranes swept through the water body were collected in May 2021 (late Autumn) from the four gnammas and four cattle troughs. Five replicate samples of each type were taken from each water source. Gnammas ranged from 30 cm to 300 cm in width, and 20 cm to 100 cm in depth. Cattle troughs were 130 cm wide and 60 cm deep. Five 50 mL water samples were collected into falcon tubes at five random locations in each water source, at a depth of 5–15 cm from the surface. Five Supor 47 mm 0.45 μm pore-size filter membranes (Pall Corporation) were submerged at a depth of 5–15 cm at five random locations from each water source, and “swished” around for 15 s before being placed immediately into individual zip-lock bags. Finally, five randomly chosen sediment samples were scooped from the bottom of each water source into a 50 mL falcon tube. Disposable gloves were worn during sampling and were changed between every sample at each location. Samples were placed on ice immediately after collection. Water samples (50 mL) were filtered within 24 h through a Supor 47 mm 0.45 μm pore size filter membrane (Pall Corporation) using a Sentino Microbiology Pump (Pall Corporation). Equipment was decontaminated between samples in a 10% bleach solution for 10 min and rinsed thoroughly before subsequent use. Two filtering control samples were taken by pumping 500 mL of local tap water through filter membranes. All samples were kept on ice from the time of collection or processing and were frozen at −20°C at Curtin University within 72 h of collection.

2.3 Sample processing and DNA extraction

Prior to DNA extraction, samples were defrosted in a refrigerator (4°C) overnight. DNA was extracted from filter membranes using the Qiagen DNeasy Blood and Tissue Kit (Qiagen). Filter membranes were cut up into 2 mm wide strips and incubated in 540 mL buffer ATL and 60 μL proteinase K at 56°C for 3 h. Sediment samples were homogenized using a TissueLyser (Qiagen) and DNA was extracted from 250 mg of sediment using the DNeasy PowerLyser Powersoil Kit (Qiagen), which contains steps to remove inhibitors from the extracts. Samples were then extracted using the QIAcube automated platform (Qiagen) and eluted to 100 μL using the manufacturer's protocol. DNA extracts were immediately frozen at −20°C. DNA extraction blanks (negative controls) were processed with each batch of 30 samples (n = 4) using the extraction reagents only.

2.4 Assessment of DNA extracts

Due to the degraded nature of eDNA, metabarcoding primers typically target short barcode regions to improve amplification success (Yu et al., 2012). The primers used were the 12Sv5-F/R targeting the mitochondrial 12S gene (F: 5'-TAGAACAGGCTCCTCTAG-3'; R: 5'-TTAGATACCCCACTATGC-3', ~98 bp (Riaz et al., 2011) and the mammal specific primers 16Smam1/2 targeted the mitochondrial 16S ribosomal gene (F: 5'-CGGTTGGGGTGACCTCGGA-3'; R: 5'-GCTGTTATCCCTAGGGTAACT-3', ~135 bp (Taylor, 1996). Both assays target regions conserved across vertebrates (12S-V5) or mammals (16Smamm) and can be recovered from degraded DNA (Kitano et al., 2007; Sarri et al., 2014; Staats et al., 2016). Quantitative polymerase chain reaction (qPCR) was used to detect the quality and quantity of DNA in each extract, and extract and verify the optimum DNA input for metabarcoding (Murray et al., 2015). Here, qPCR assays were run on all samples using the 12S-V5 F/R primers on the neat extract, 1/5 and 1/10 dilution to screen for PCR inhibitors in the reaction. The extraction of sediment samples included inhibitor removal steps because those tend to be more abundant in sediments, but all sample types were screened. The majority of water/passive samples were found to have an optimum dilution in the realm of neat, 1/5 or 1/10, whereas the majority of sediment samples had to be diluted to 1/100. Using the optimum dilutions determined by qPCR with 12S-V5 primers, a supplementary qPCR assay was run on a subset of 20 samples to determine whether the 16smam forward/reverse primers were as effective at amplifying DNA at these dilutions. In some instances, primer bias can differentially amplify eDNA at sites with different community composition (Aird et al., 2011). To mitigate these effects the combination of a vertebrate primer and a specific mammal primer were used. In addition, 12S and 16S are broadly used for mammal detection in eDNA metabarcoding studies and there is a greater availability of reference sequences available online for taxonomic assignment (Deagle et al., 2014; Valentini et al., 2016).

The polymerase chain reaction (PCR) mix for quantification contained: 2.5 mM MgCl2 (Applied Biosystems), 10 × PCR Gold buffer (Applied Biosystems), 0.25 mM dNTPs (Astral Scientific), 0.4 mg/mL bovine serum albumin (Fisher Biotec), 0.4 μmol/L forward and reverse primer, 1 U AmpliTaq Gold DNA polymerase (Applied Biosystems), and 0.6 μL of a 1:10,000 solution of SYBR Green dye (Life Technologies). All PCR amplification was conducted on a StepOne Plus (Applied BioSystems) real-time qPCR instrument with the following conditions: 5 min at 95°C, 50 cycles of 95°C for 30 s, 30 s at the annealing temperature (58°C) and 45 s at 72°C, followed by 15 s at 95°C, 1 min at 60°C, and 15 s at 95°C during the melt curve stage, ending with 10 min of elongation at 72°C. Contamination was minimized by preparing the PCR mixes in a dedicated clean laboratory, and then adding DNA extract in a separate laboratory, inside specialized ultraviolet hoods.

2.5 DNA amplification and sequencing

Based on qPCR results, fusion tagging was performed on samples that contained adequate amplifiable DNA by assigning each sample a unique combination of fusion tag primers. Each fusion tag primer combination contained a unique multiplex identifier between 6 and 8 bp in length, Illumina's sequencing adaptors (i.e., P5 and P7) and the gene-specific primer (described above). A single-step fusion protocol was carried out with unique index combinations before qPCR was used to generate amplicons of each fusion-tagged sample using the same reagents and cycling conditions as described above. The fusion-tagged amplicons were generated in duplicates for each biological replicate to maximize amplicon numbers for sequencing and reduce the chances of non-detections and the effects of PCR stochasticity (Murray et al., 2015). PCR replicates were then pooled, amplicons cleaned using the QIAquick PCR Purification Kit (Qiagen) and then quantified using the QIAxcel Advanced System (Qiagen). Based on this quantification, the DNA library for sequencing was made from pools, combined in approximate equal concentrations. A Pippin Prep (Sage Science) was used to size-select the amplicons in this library, and the library was then cleaned using the QIAquick PCR Purification Kit (Qiagen). A Qubit fluorometer (Thermo Fisher Scientific) was used to quantify the final DNA library, before sequencing as per Illumina's sequencing protocols for single-end sequencing, using Illumina's single direction MiSeq 300 V2 Reagent Kit on the Illumina MiSeq platform (Illumina).

2.6 Sequence analysis and taxonomic assignment

Using a high-performance computing cluster (Pawsey Supercomputing Centre), sequences were analyzed with the eDNAFlow automated pipeline (Mousavi-Derazmahalleh et al., 2021), which performed the following tasks: sequence quality was checked with FASTQC (Andrews, 2010) and filtered with AdapterRemoval v2 (Schubert et al., 2016) for Phred quality score lower than 20 and trimming sequences with Ns as enforced in eDNAFlow by—trimns and—trimqualities parameters. Remaining trimmed sequences were demultiplexed and sequences smaller than expected minimum amplicon length were trimmed (12S-V5 minimum length 50 bp; 16smam minimum length 25 bp) using OBITools' ngsfilter and obigrep tools, respectively (Boyer et al., 2016). Unique sequences, zero-radius operational taxonomic units (ZOTUs-denoised sequences) and an abundance table were generated using the USEARCH (Edgar, 2010) commands fastx-uniques, unoise3, (minsize 8) and otutab, respectively. The ZOTUs generated from both assays were queried against the nucleotide database Genbank (https://www.ncbi.nlm.nih.gov/genbank/) in October 2021 using the following parameters in Basic Local Alignment Search Tool (BLASTN) (Altschul et al., 1990): perc_identity ≥90, evalue ≤1e−3, best_hit_score_edge 0.05, best_hit_overhang 0.25, qcov_hsp_perc 100, max_target_seqs = 10. Taxonomic identification was then assigned with more strict parameters using the eDNAFlow Lowest Common Ancestor (LCA) script (Mousavi-Derazmahalleh et al., 2021) with a minimum percentage identity (%identity) of ≥95, and if a ZOTU had multiple blast assignments where the difference between their %identity was equal to or smaller than 1, then that ZOTU was assigned to the nearest common taxonomic level otherwise a species level assignment was returned.

The results of the LCA script were compared against existing taxonomic data for the sampling area (Department of Parks and Wildlife, 2013), and if necessary, manually curated to ensure that the taxa assigned are known to occur in the sampling area. While the majority of ZOTUs were assigned to species level, the LCA script dropped a few ZOTUs to the nearest common taxonomic level (e.g., Meliphagidae sp.). Where ZOTUs were assigned to taxa that are not local to the sampling area, the ZOTUs were either reassigned to sister species, for example, black-flanked rock-wallaby (Petrogale lateralis) and Australian shelduck (Tadorna tadornoides) known to occur in the area, or dropped to the nearest common taxonomic level that currently exists in the sampling area, for example, vesper bats, (Vespertilionidae sp.), that is, ZOTUs were manually assigned to locally known species if all other species for that ZOTU were exotic to the sampling area: Australian magpie (Gymnorhina tibicen), yellow-throated miner (Manorina flavigula), Australian magpie-lark (Grallina cyanoleuca), euro (Osphranter robustus), and sheep (Ovis aries). In cases where ZOTUs were assigned to family level and multiple exotic species were attributed to a ZOTU with 100% identity, the ZOTUs were reassigned to generic rank of species known to be native to the sampling area (e.g., Corvus sp.). All ZOTUs assigned to “chordata environmental samples” were removed from the data set, and all ZOTUs assigned to exotic canid spp. (i.e., Nyctereutes viverrinus, Canis lupus rufus, Canis lupus) were reassigned to Canis lupus familiaris. ZOTUs assigned to exotic “Artiodactyla spp.” (i.e., Bison bonasus, Bos javanicus, Bos mutus, Cephalophus dorsalis, Muntiacus sp., Pudu puda, Syncerus caffer, Tragelaphus eurycerus) were reassigned to domestic cow (Bos taurus).

2.7 Statistical analysis

We generated a presence–absence matrix for the two assays (12S-V5 and 16smam) for both sources (gnammas and cattle troughs) and all sampling methods (swept, filtered, and sediments). This matrix was used to calculate taxon richness for each sample. The effect of source and sampling method was evaluated using a two-way analysis of variance (ANOVA) achieved using StatistiXL v 2.0 (www.statistiXL.com). The presence–absence matrix was also used to examine differences in biological assemblages identified from the various sources and sampling methods, using the PERMANOVA+ software add-on in PRIMER7 (Clarke & Gorley, 2015). To avoid unassigned resemblance values, two gnamma-sediment samples (GS1_3 and GS2_4) and one gnamma-filter sample (GW2_2) for which no taxa were detected were removed from the matrix, before a Jaccard distance matrix was generated between samples. Using these data, a PERMANOVA was run using Type III sums of squares, unrestricted permutation of raw data and significance determined by 9999 permutations of the pseudo-F statistic (Clarke & Gorley, 2015). PRIMER7 was used to visualize and estimate pair-wise tests to determine how the sources and sampling methods compared using non-metric multidimensional scaling (nMDS).

We used the “BiodiversityR” (Kindt & Coe, 2005) and “drc” packages (Knezevic et al., 2007) with R 3.5.1 (R Core Team, 2018) to generate accumulation curves for taxa at each site and with each sampling method. Asymptotic regression rarefaction curves were generated for each sampling method and models were visualized using the package “ggplot2” (Chiarucci et al., 2008; Wickham & Sievert, 2016). Curves represent the order-free accumulation of mean taxa detections calculated from random permutations of all possible orderings of taxa detections. We then used an EcoTest (Cayuela et al., 2015) to statistically compare rarefaction curves representing each sampling method for each source respectively, testing the null hypothesis that three samples were drawn from a single assemblage and any differences in their rarefaction curves reflect only sampling effects. Using the package “rareNMtests” (Cayuela et al., 2015) the abundances of all taxa detections were summed to generate a pooled composite curve. The test statistic (Z) was calculated from the cumulative summed areas of difference between the three individual curves representing each sampling method and the composite curve. The observed value of this Z-statistic was compared to a null model distribution constructed from rarefaction curves generated from 200 random permutations of all possible orderings of taxa detections in each set of replicates across the three sampling methods and significance assessed at α = 0.05. A “leave one out” analysis was used to subsequently determine how z-values and significance changed when sampling methods were individually omitted from the analysis, after Cayuela et al. (2015).

3 RESULTS

There was successful eDNA amplification from 110 and 111 of 120 samples using the 12S-V5 and 16smam assays, respectively. Sequencing yielded 9,740,203 (mean per sample = 74,924 ± 4395 standard error; SE) sequences for the 12S-V5 assay and 12,530,593 (mean per sample = 96,389 ± 5762 SE) for the 16smam assay sequences in total. A small proportion of sequence reads were present in the field and extraction controls, highest in the 16smam assay (3.09% of total reads) and lowest in the 12S-V5 (2% of total reads). A proportion of sequence reads were assigned to humans, which were highest in the 16smam assay (27.05%) and lowest in the 12S-V5 assay (13.84%); ZOTUs representing humans were not included in the statistical analysis. One ZOTU assigned to Gallus amplified with the 12S-V5 assay accounted for 0.7% of total reads for that assay and was removed as a likely contaminant as G. gallus DNA is used in the laboratory as a positive control.

3.1 Vertebrate taxon richness detected across sources and sampling methods

12S-V5 and 16smam assays produced a total of 9190 and 3203 ZOTUs respectively, from which 6584 and 1144 were assigned. A total of 26 unique taxa were detected from both assays combined, representing 12 orders and 21 families including 11 mammal, 15 bird and 1 amphibian taxa (Table 1). The 12S-V5 assay detected 25 unique taxa and the 16smam assay nine unique taxa. Eight common taxa were detected by both assays (Table 1). There were significant differences for total vertebrate richness between gnammas (3.55 ± 0.23 taxa per sample, 20 taxa total) and cattle troughs (4.65 ± 0.34 taxa per sample, 22 taxa total; F5, 114 = 7.9, p = .006) and between sampling methods (F5, 114 = 7.4, p = .001; Figure 1). The highest total richness was detected in swept (4.3 ± 0.36 taxa per sample; 23 taxa total) and filtered samples (4.9 ± 0.39 taxa per sample; 22 taxa total) while significantly fewer taxa were detected from sediment samples (3.1 ± 0.29 taxa per sample; 13 taxa total; p ≤ .016; Figure 1 and Figure 2). No unique taxa were detected in sediment samples compared to water samples (both swept and filtered), while 13 taxa were only detected in water samples. Thirteen bird taxa were detected overall and nine were detected only in water samples and not in sediment. Similarly, 11 mammal taxa were detected and only three (Ovis aries, Macropus fuliginosus and Osphranter robustus) were detected in water and not in sediment (Figure 2).

TABLE 1. Vertebrate taxa detected from five samples each from four gnamma holes (granite rock pools) and four cattle troughs in the Great Western Woodlands, Western Australia, indicating sampling method (sediment, sweeping, and filtering) and assay (12S-V5 and 16smam). Site indicates which of the gammas (G1–G4) and troughs (T1–T4) the taxa were detected in.
Gnamma Trough Site
Sediment Swept Filtered Sediment Swept Filtered Sediment Swept Filtered
Amphibia Neobatrachus pelobatoides 16S 16S 16S G1/G3 G3 G1/G2/G3
Mammalia Macropus fuliginosus 12S T4
Petrogale lateralis 12S 12S 12S T1 T1/T4 T1/T4
Osphranter robustus 12S 12/16S G2/G4
T1/T4 T1/T4
Oryctolagus cuniculus 16S 16S T4 T4
Vespertilionidae sp. 12S 12S 12S 12S 12S G3/G4 G2/G3
T1 T1/T4 T3
Felis catus 12S T1/T4
Canis lupus familiaris 12/16S 12/16S 12/16S 12/16S 12/16S 12/16S G1/G2/G3/G4 G1/G2/G4 G1/G3/G4
T1/T2/T3 T1/T2/T3/T4 T1/T2/T3/T4
Sus scrofa 16S 12/16S 16S 12/16S 12S G1 G1/G2/G3 G1/G2/G3/G4
T1/T3/T4 T1
Camelus dromedarius 12/16S 16S 12/16S 16S 16S G1/G2/G4 G1/G4 G1/G2/G3/G4
T1/T2 T1/T4
Bos taurus 12/16S 12/16S 12/16S 12/16S 12/16S 12/16S G1/G2/G3/G4 G1/G2/G3/G4 G1/G2/G3/G4
T1/T2/T3/T4 T1/T2/T3/T4 T1/T2/T3/T4
Ovis aries 12S 12/16S T4 T4
Aves Dromaius novaehollandiae 12S 12S 12S 12S G2/G4 G4
T4 T2/T4
Falco peregrinus 12S 12S G2 G2
Tadorna tadornoides 12S G1
Eolophus roseicapilla 12S 12S 12S 12S 12S G1/G3/G4 G3/G4
T1/T2/T4 T1/T2/T3/T4 T1/T2/T3/T4
Northiella haematogaster 12S 12S 12S G1
T1/T2/T4 T1/T2
Ocyphaps lophotes 12S 12S 12S 12S 12S 12S G1 G1 G1/G2/G4
T1/T2/T3 T1/T2 T1/T2/T4
Phaps chalcoptera 12S 12S 12S 12S 12S 12S G1 G1/G2/G4 G1/G2/G3
T1/T2/T3 T1/T2/T3/T4 T1/T2/T3/T4
Gymnorhina tibicen 12S 12S 12S G4 G4
T2
Manorina flavigula 12S 12S 12S G2/G3
T2 T2
Pardalotus striatus 12S G2
Meliphagidae sp. 12S 12S 12S 12S G3/G4 G1/G2/G3/G4
T1/T2/T3
Corvus sp. 12/16S 16S 12S 12S G1/G3 G4
T2 T2/T4
Grallina cyanoleuca 12S 12S T2 T2
Cinclosoma clarum 12S 12S G4
T3
Details are in the caption following the image
Vertebrate taxon richness detected in (a) gnammas (rock holes) and cattle troughs, and (b) using sediment, sweeping, and filtering sampling methods from the two sources in the Great Western Woodland, Western Australia. Letters indicate significant differences (α = 0.05) between factors. Values are mean ± SE for n = 5 from each of four gnammas and four troughs.
Details are in the caption following the image
Tallies of feral mammals, birds, native mammals, and amphibians detected from sediments, swept water samples, and filtered water samples collected from gnamma holes (granite rock pools) and cattle troughs. Venn diagrams display the relationship of these tallies in (a) filtered (n = 22), swept (n = 23), and sediment (n = 13) samples from gnamma holes and cattle troughs, (b) gnamma holes (n = 20) and cattle troughs (n = 22) across all sampling methods (c) filtered (n = 18), swept (n = 19), and sediment (n = 10) samples collected from cattle troughs, (d) filtered (n = 15), swept (n = 15), and sediment samples (n = 11) collected from gnamma holes. Four gnammas and four troughs were sampled.

3.2 Source and sampling effects on vertebrate taxon assemblages

There was a significant effect of source (pseudo-F = 9.44; p < .001) and sampling method (pseudo-F = 3.86; p < .001) on the vertebrate assemblages detected and a significant interaction between these factors (pseudo-F = 2.32; p = .002;). All sampling techniques resulted in identification of different species assemblages for gnammas (p ≤ .013), but for cattle troughs the assemblages detected by swept samples did not differ from those detected by the filtered and sediment samples (p ≥ .050).

Differences between water sources appear to be driven by higher variation in amphibian and bird assemblages in gnamma samples and higher variation in mammal assemblages in cattle troughs (Table 1, Figure 2). For both sources there was a higher variation in the bird assemblage in both swept and filtered samples compared to sediment samples while the higher variation in the mammal assemblage in swept samples compared to filtered samples appeared to be driving the unique groupings observed in these sampling methods (Figure 2).

For gnammas, where the various treatments yielded statistically different species assemblages, a number of native birds were detected (i.e., Manorina flavigula, Northiella haematogaster, and Tadorna tadornoides) in filtered samples compared to swept samples where Eolophus roseicapilla (Australian galah) and a bat (Vespertilionidae sp.) were detected (Table 1). In contrast, Osphranter robustus and a few bird species (Corvus sp., Cinclosoma clarum, Dromaius novaehollandiae, Falco peregrinus, Gymnorhina tibicen and Meliphagidae sp.) were detected from sediment samples (Table 1). Meanwhile, eDNA metabarcoding of cattle trough filtered samples detected a few unique birds (Corvus sp., D. novaehollandiae, Grallina cyanoleuca, Gymnorhina tibicen, Manorina flavigula, Northiella haematogaster and Meliphagidae sp.) and mammals (Felis catus, Ovis aries, and Osphranter robustus) in contrast to sediment samples which yielded other mammals also detected in water samples (Oryctolagus cuniculus and Camelus dromedarius; Table 1).

3.3 Vertebrate taxa richness accumulation with replication effort

Rarefaction EcoTests indicate that accumulation curves differed between sampling methods for both gnammas (z = 20.18; p = .040) and cattle troughs (z = 30.2; p = .045; Figure 3). However, when sediment was omitted from the analysis there was no difference between curves representing filter and sweeping sampling methods for either source (z = 2.82; p = .800). For both gnammas and cattle troughs, mean vertebrate taxon richness per sample was greatest for filtered samples, followed by swept and sediment samples. None of the curves approached an asymptote by the fifth replicate but the gradients in curves from cattle trough samples were lower at the fifth replicate than were those from gnamma samples (Figure 3).

Details are in the caption following the image
Rarefaction curves of vertebrate taxa detected with environmental DNA as a function of sampling effort in four gnammas and four cattle troughs in the Great Western Woodlands, Western Australia from samples collected from sediments, swept water samples, and filtered water samples.

4 DISCUSSION

We detected a variety of mammals, birds, and amphibians from both artificial and natural semi-permanent freshwater sources in the semi-arid Great Western Woodlands, Western Australia. Our results indicate that sediment samples, filtered water samples and membranes swept through the water are all appropriate sampling approaches for detecting vertebrates in waterbodies in a semi-arid environment using eDNA metabarcoding. However, the overall community structure detected varied both with the type of water source and the sampling technique. We observed that five replicates per water source was insufficient to detect all species interacting with these water sources. These findings are consistent with previous studies that have assessed the impact of sample collection on eDNA-based terrestrial biodiversity assessment (e.g., Newton et al., 2022; van der Heyde et al., 2020) and demonstrate that this approach is most effective when replicate samples are collected from multiple water sources using a variety of sampling methods and substrates.

We detected only a relatively small proportion (10.2% of mammals, 6.5% of birds, and 7.15% of amphibians) of the vertebrate diversity recorded from the Great Western Woodland region (Department of Parks and Wildlife, 2013) but we did detect numerous threatened and elusive taxa (e.g., Cinclosoma clarum, Petrogale lateralis, Northiella haematogaster), as well as feral species (36.4% of total feral diversity in GWW) that threaten native biota. These results indicate that eDNA metabarcoding from water sources can contribute to biomonitoring in semi-arid lands, particularly if sample collection involves numerous replicate samples, multiple sampling sources and techniques, and occurs over large spatial scales.

Overall, we detected more vertebrate taxa in samples from artificial water bodies than from natural water bodies (Figure 1 and Figure 2), despite them being found in similar habitats. Differences in accessibility of gnammas and cattle troughs for fauna may account for some of the variation in vertebrate richness and assemblage or that animals may preferentially use certain water bodies. For example, gnammas may present an injury risk with an uneven, slippery surface around the perimeter (Bayly, 1999) and the relative lack of plant cover on outcrops may result in greater vulnerability to predation while drinking for some species (Crosmary et al., 2012; Votto et al., 2022). Terrestrial mammals may therefore prefer to drink from above-ground cattle troughs where the water surface is higher, reducing the risk of falling, and facilitating vigilance. This may be reflected by the greater diversity of mammals we detected from troughs. In contrast, we detected a greater diversity of birds from gnammas than from troughs (Figure 2). It is possible that diurnal birds may avoid cattle troughs due to the prevalence of large mammals, including predators (Fontaine et al., 2006) and humans (Meager et al., 2012) at these artificial water sources, especially since access is less of an issue for flighted birds.

Differences in ambient conditions in artificial and natural water sources related to shape and volume (e.g., diameter, depth, surface area, and edge complexity) may also influence persistence of vertebrate eDNA. For instance, ultraviolet exposure can increase eDNA degradation (Green et al., 2011 but see Bae & Wuertz, 2009) and high temperature denatures DNA molecules both directly and by enhancing microbial activity and enzyme kinetics (Fu et al., 2012; Okabe & Shimazu, 2007). Ledges, overhangs, and a smaller surface to volume ratio may provide protection from the impacts of temperature and ultraviolet for eDNA in gnammas compared to troughs, improving detection of small species which may deposit less eDNA, for example, birds compared to mammals. Our observations are consistent with previous studies that compare detection of terrestrial vertebrate taxa between multiple eDNA sources (Newton et al., 2022; van der Heyde et al., 2020) and support the conclusion that, when aiming to characterize arid or semi-arid vertebrate biodiversity using eDNA metabarcoding, samples should be collected from both natural and artificial water sources to maximize the taxa detected. It is worth noting also that while eDNA is often proposed as a replacement of conventional methods, concomitant use of technology such as camera traps or conventional survey methods will value-add to eDNA data (Newton et al., 2023; Ryan et al., 2022), particularly with regard to behavior of taxa near water holes, for example, just drinking or also bathing in the source (Harper, Handley, et al., 2019; Mas-Carrió et al., 2021).

Sampling protocol can have a major impact on the biological community detected with eDNA metabarcoding (e.g., Stoeckle et al., 2017). Our results for vertebrate assemblages in a semi-arid habitat support this finding. Sediment samples yielded significantly lower vertebrate taxonomic richness (50% of total richness) than did swept (88% of total richness) and filtered (84% of total richness) water samples (Figure 1); no unique taxa were detected in sediment samples while 13 taxa were only detected in water (swept and filtered) samples (Table 1 and Figure 2). Our findings are consistent with a previous study comparing eDNA detection in water and sediments for a single species, the great crested newt (Triturus cristatus; Buxton et al., 2018), but contradict some previous studies comparing eDNA detection in water and sediments (including those from arid lands; Palacios et al., 2020), which found that sampling sediments yielded higher biodiversity than water samples (Palacios et al., 2020; Turner et al., 2015). Environmental DNA can degrade rapidly in water, and the role of sediments in slowing eDNA degradation is well documented (Barnes et al., 2014; Hou et al., 2014; Romanowski et al., 1991). However, a variety of abiotic and biotic factors determine the rate of eDNA degradation and eDNA can be detectable in water for periods of only days to as long as weeks (Barnes et al., 2014) depending on environmental conditions. Our data suggest that either the rates of eDNA degradation in the water at our two sites were low, or, that the species we detected visited the water frequently enough to contribute detectable levels of eDNA still suspended in the water when sampling occurred.

To our knowledge, this is the first successful trial of swept eDNA collection from terrestrial water sources. Filtration through a membrane using a pump is the most common aquatic eDNA sampling method, typically processing large volumes of water rapidly. This approach is considered essential for accurately detecting the breadth of taxa present (Shu et al., 2020) but can be time consuming if the water contains particulates and/or organics. Our findings indicate that comparable taxonomic richness can be detected from filtered samples and swept samples from small semi-arid habitat water sources (Figure 2). Similar findings were reported for fish detection in marine systems, where sample membranes were submerged for 4–24 h (Bessey et al., 2021). Interestingly, we only submerged the membranes for 15 s but still detected considerable faunal diversity. High concentrations of eDNA in isolated water bodies (Buxton et al., 2017) could account for the high vertebrate richness detected by eDNA metabarcoding of water samples collected by sweeping. To better understand the efficacy of sweeping we recommend future studies compare sweeping to filter volumes of 50 mL, 500 mL, and 1000 mL to address whether our results demonstrate effective sampling of different volumes with filtration and sweeping. We recognize that the 50 mL of filtered water we sampled here is considerably less than for most sampling protocols (Hunter et al., 2019 but see Day et al., 2019), but collecting small sample volumes was necessary due to the small volumes present at many of our study sites, and our results confirm that is enough for high eDNA concentration water bodies like the ones in this study. Additionally, the target substrate should also be considered since DNA in open water is more readily recovered using the current methods compared to DNA in sediment bound to clay particles, which requires a targeted approach (Sakata et al., 2020).

Our finding of comparable taxonomic richness between swept and filtered sampling approaches has profound implications for sampling turbid water, sampling in remote regions and for the logistics of collecting and transporting large volumes of samples. Optimal water filtering procedures are among the most important considerations when collecting and filtering turbid water samples. Particulates block water pumps, increasing the time it takes to filter samples and reducing the total number of samples that can be processed (Egeter et al., 2018; Hunter et al., 2019; Klymus et al., 2017; Turner et al., 2014). This issue is eliminated with sweeping. Membranes swept in water can also facilitate effective sample collection in remote regions where transporting pumps and/or large volumes of water samples and sourcing power to run pumps or freeze large volumes of water can be logistically challenging.

The application of eDNA metabarcoding as a biomonitoring tool to assess biological richness in terrestrial environments is becoming more frequent in the literature (e.g., van der Heyde et al., 2022) and commercially (e.g., Gold et al., 2021). However, the level of replication required to ensure that richness is not underestimated remains largely unexplored. Other studies assessing biological richness from arid zone water sources have used 1–12 biological replicates per environmental sample (e.g., Egeter et al., 2018; Furlan et al., 2020; Palacios et al., 2020) with optimal replication varying with species and context (Ficetola et al., 2015). Here we have demonstrated that five biological replicates collected from isolated water sources are insufficient to detect the true vertebrate richness at our study site (Figure 3). Even for the best-performing water samples, rarefaction curves representing the accumulation of vertebrate richness with increasing replication effort continued to increase beyond the fifth replicate, particularly for samples collected from gnammas (Figure 3). Increasing replication reduces false-negative results but may increase the probability of false positives stemming from contamination and inflates workload and analysis costs (Ficetola et al., 2015; Furlan et al., 2020). A need for high sample numbers for eDNA metabarcoding reduces the relative benefits of this approach as a rapid, cost-effective biomonitoring tool compared to more conventional approaches (Ficetola et al., 2015; Furlan et al., 2020).

5 CONCLUSIONS

Our findings indicate that eDNA-based monitoring of vertebrates in a semi-arid habitat was most effective when samples were collected from both natural and artificial water sources using multiple sampling methods. We demonstrated that the water source significantly influenced determination of taxonomic richness and assemblage and that sediment samples had significantly less utility for determining vertebrate richness and were not associated with the identification of unique taxa compared to water samples either filtered or swept. We recommend that future studies should explore the use of sample storage buffers and eDNA extraction protocols that are tailored to reduce inhibition, especially in sediment samples. In contrast, we detected no difference in vertebrate richness between sweeping and filtering approaches for water samples. This finding has profound implications for the application of eDNA methods to inland regions and water sources, especially in remote areas, as sweep sampling dramatically reduces the time and logistics required to process samples. Finally, our observations demonstrated that increasing replication increased the number of vertebrate taxa detected, but >5 replicate samples are required to accurately sample small inland water bodies. In conclusion, eDNA metabarcoding for biomonitoring holds promise for applications in semi-arid and arid lands but the design and execution of these studies must be carefully considered to overcome the unique limitations of this method in these environments.

AUTHOR CONTRIBUTIONS

Rupert Mcdonald: Conceptualization (equal); data curation (equal); formal analysis (equal); investigation (equal); methodology (equal); writing – original draft (equal); writing – review and editing (equal). Bill Bateman: Conceptualization (equal); investigation (equal); writing – original draft (equal); writing – review and editing (equal). Christine Cooper: Conceptualization (equal); formal analysis (equal); investigation (equal); visualization (equal); writing – original draft (equal); writing – review and editing (equal). Mieke Van Der Heyde: Conceptualization (equal); data curation (equal); formal analysis (equal); investigation (equal); methodology (equal); supervision (equal); visualization (equal); writing – original draft (equal); writing – review and editing (equal). Mousavi-Derazmahalleh Mahsa: Data curation (supporting); formal analysis (supporting). Brock Hedges: Investigation (supporting); writing – original draft (supporting); writing – review and editing (supporting). Michelle Guzik: Investigation (supporting); writing – original draft (supporting); writing – review and editing (supporting). Paul Nevill: Conceptualization (equal); funding acquisition (lead); investigation (equal); methodology (supporting); project administration (lead); supervision (lead); writing – original draft (equal); writing – review and editing (equal).

ACKNOWLEDGMENTS

MvdH and MTG were funded by the Australian Research Council (Linkage Grant LP190100555) in partnership with Curtin University, The University of Adelaide, BHP Billiton Iron Ore Pty Ltd, Rio Tinto Ltd, Chevron Australia Pty Ltd, Western Australian Museum, South Australian Museum, the Department for Biodiversity, Conservation and Attractions, the Western Australian Biodiversity Science Institute, Department of Water and Environmental Regulation (Western Australia). BAH was supported by a Commonwealth funded Research Training Program scholarship (The University of Adelaide), the Roy and Marjory Edwards Scholarship as administered by Nature Foundation, and the Justin Costelloe Scholarship as administered by Kati Thanda–Lake Eyre Basin Authority, Department of Agriculture, Water, and the Environment. We would also like to thank members of the Trace and Environmental DNA (TrEnD) Laboratory for support with metabarcoding workflows and bioinformatics, particularly Ethan Ryan. The authors have no conflict of interest to declare. We thank Murray Penter and Independence Group Limited for facilitating and assisting with sample collection. We gratefully acknowledge the Whadjuk, Ngadju, and Wudjari people, Traditional Custodians of the country on which relevant components of this research were carried out. We acknowledge their ongoing connections to land, sea, and community, pay our respects to their Elders past, present, and emerging, and extend that respect to all Aboriginal and Torres Strait Islander peoples today. We would also like to acknowledge and thank the Tjaltjraak Native Title Aboriginal Corporation for hosting us on Wudjari Country and sharing their ecological and cultural knowledge about Gnamma holes.

    FUNDING INFORMATION

    This work was funded by IGO Limited and supported by the Pawsey Supercomputing Centre.

    DATA AVAILABILITY STATEMENT

    Sequencing data and dada2 script are available at the Dryad Digital https://doi.org/10.5061/dryad.wdbrv15rb.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.