Isolation and characterization of macaroni penguin (Eudyptes chrysolophus) microsatellite loci and their utility in other penguin species (Spheniscidae, AVES)
Abstract
We report the characterization of 25 microsatellite loci isolated from the macaroni penguin (Eudyptes chrysolophus). Thirteen loci were arranged into four multiplex sets for future genetic studies of macaroni penguin populations. All 25 loci were tested separately in each of four other penguin species [Adélie penguin (Pygoscelis adeliae), chinstrap penguin (Pygoscelis antarctica), gentoo penguin (Pygoscelis papua) and king penguin (Aptenodytes patagonicus)]. Between eight and 12 loci were polymorphic per species. These loci are expected to be useful for studies of population genetic structure in a range of penguin species.
Many species of penguin are currently in decline (IUCN, 2008). Macaroni penguins, Eudyptes chrysolophus, are the most numerous species of penguin in the Southern Ocean but are now listed as Vulnerable as a result of marked decline in populations in areas such as South Georgia and Marion Island (IUCN, 2008). We report the isolation and characterization of markers suitable for investigating population genetic structure and levels of genetic diversity in the macaroni penguin, with application to a number of other penguin species.
Blood samples were taken from breeding adult macaroni penguins present during the incubation or brood guard stage in the colony located at a place called Macaroni Cwm (54°00′10″S, 38°01′10″W) on Bird Island, South Georgia. Blood sampling was performed as part of work undertaken by the British Antarctic Survey. For each individual, 0.1 mL of blood was taken from the brachial vein and immediately stored in 1.4 mL of absolute ethanol in rubber-sealed screw-topped microfuge tubes. All animal procedures were performed in accordance with UK Home Office guidelines and subject to ethical review. Genomic DNA was extracted using an ammonium acetate extraction protocol (Nicholls et al. 2000). A microsatellite-enriched library was constructed essentially using the method of Armour et al. (1994). The library was enriched for the following di- and tetranucleotide microsatellite motifs and their complements: (GT)n, (CT)n, (GTAA)n, (CTAA)n, (TTTC)n and (GATA)n, which had been bound to magnetic beads by following the procedure described by Glenn & Schable (2005). Transformant colonies were not screened for the presence of a repeat but were directly sequenced.
Two-hundred and eighty-eight bacterial transformant colonies were sequenced in the forward and reverse orientation and a consensus sequence created. A total of 132 unique sequences contained microsatellite motifs (EMBL accession numbers: FM878332–FM878463). Forty-seven sequences contained a minimum of eight repeats. Primer sets were designed for 32 of these sequences using primer3 software (Rozen & Skaletsky 2000).
All loci were amplified individually prior to multiplexing (see Table 1 for multiplex scheme). Primer sets were tested in up to 28 macaroni penguin individuals belonging to the colony located at Macaroni Cwm. Polymerase chain reaction (PCR) amplification was performed in 2-μL reactions (Kenta et al. 2008) containing 10 ng of genomic DNA, 1 μL master mix (Qiagen multiplex PCR kit) and 1 μL of each primer at 0.2 μm concentration. PCR cycling conditions were 15 min at 95 °C, followed by 40 cycles of 94 °C for 30 s, 60 °C for 90 s and 72 °C for 60 s, with a final step of 72 °C for 10 min. PCR products were diluted and allele sizes assigned using an ABI 3730 DNA Analyser (Applied Biosystems) and the size standard ROX 500. After initial primer testing, seven markers were discarded due to unreliable or failed PCR amplification (Ech023, Ech025, Ech049, Ech078, Ech087, Ech109 and Ech122).
Locus | EMBL accession number and clone name | Repeat motif | Primer sequence (5′–3′) and fluoro label | Clone allele size (bp) | PCR multiplex set | N | A | Obs. allele size range (bp) | H O | H E | F (null) | P (HW) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Ech003 | FM878334 MP15A05b | (GT)9 | F: [6FAM]-TCAAAGGCAAGTGAAACCTGACTGR: CGGTGTTTATCTCTGGGCTTTGTC | 177 | — | 26 | 4 | 175–185 | 0.62 | 0.62 | −0.01 | 0.17 |
Ech005 | FM878336 MP15A10b | (CTTTT)16 | F: [HEX]-GCCTTATGCAGGTACAGATGTCTTCTCR: TGCTGAAGGTAAAGAGCAGTGAAATC | 468 | — | 27 | 33 | 281–533 | 0.48 | 0.98 | 0.33 | 0.00 |
Ech007 | FM878338 MP15A12 | (CTTT)16 | F: [6FAM]-CAGTGAAGCATTCTCTCGCAGAACR: AAGCTTGGTGCCCGTATCTTTG | 281 | 1 | 28 | 18 | 254–323 | 0.89 | 0.90 | −0.01 | 0.73 |
Ech008 | FM878339 MP15B03 | (CA)10 | F: [HEX]-TGGGAAGCGTAACGGCCTAAR: TCAGTCACTCTCCCATTTTGTAGGG | 207 | 1 | 28 | 4 | 184–203 | 0.36 | 0.32 | −0.09 | 1.00 |
Ech009 | FM878340 MP15B05 | (AC)14 | F: [6FAM]-CCTCTGTTGTACAAGCTGGATGCR: GCAATTCTGCTTGCCTCTGC | 228 | 1 | 28 | 8 | 217–230 | 0.82 | 0.75 | −0.06 | 0.85 |
Ech010 | FM878341 MP15B07 | (GT)10 | F: [6FAM]-CGTGTTCCACTGCAACCTGACR: CCAGAGGAGGGAGTGAGGAATC | 400 | 1 | 7 | 8 | 396–408 | 0.71 | 0.90 | ND | 0.11 |
Ech011 | FM878342 MP15B08 | (GT)12 | F: [HEX]-CACAGCACTGTGACAGAAGCAR: TTAACTGGTTCGATTTGTCAGTCTTT | 171 | 1 | 27 | 8 | 164–177 | 0.74 | 0.78 | 0.01 | 0.03 |
Ech012 | FM878343 MP15B09 | (CTTT)17 (CT)2 (CTTT)19 | F: [6FAM]-TCATCATTCTTTTGGCTGTCTGAR: AAGCAATGAACAAGAGAGAGAAGGA | 267 | — | 25 | 22 | 216–304 | 0.68 | 0.95 | 0.16 | 0.00 |
Ech014 | FM878345 MP15B11 | (GATA)9 | F: [6FAM]-AACGCCAGAAAGATGCCAGAACR: CGGCACATACTAACATAGTCCCTTCAG | 223 | 3 | 28 | 6 | 218–238 | 0.82 | 0.79 | −0.03 | 0.60 |
Ech020(Z-linked) | FM878351 MP15C08 | (CTTT)16 | F: [6FAM]-GCTCCCTTGCAAACGAAACCR: AAACGTGGATAGACAGATGCAGAGAG | 249 | — | 13F4M | 8 3 | 227–281227–260 | 00 | 00.71 | —ND | —0.03 |
Ech024 | FM878355 MP15D03 | (CT)10 | F: [HEX]-TGTTTCTTGCTTGCTACTTCCACAGR: CCAAATATGTTGTGGTCGACAGTG | 239 | — | 28 | 5 | 184–203 | 0.50 | 0.55 | 0.01 | 0.14 |
Ech029 | FM878360 MP15D09b | (TC)17 | F: [HEX]-TGCAAAGAGGGAGAATCCACAGR: GCTATCCCGCCATCTCTTCTTC | 225 | 3 | 27 | 10 | 217–233 | 0.82 | 0.86 | 0.02 | 0.21 |
Ech030 | FM878361 MP15D10 | (CTAT)14 | F: [HEX]-TGACGCCGCAGGGACTTCR: GCTCAGCTCTTGCTCACAGTTTCAG | 298 | 2 | 28 | 11 | 202–258 | 0.79 | 0.86 | 0.04 | 0.17 |
Ech036 | FM878367 MP15F02 | (GT)18 | F: [HEX]-GAGAGGGTTCAGAATGACATCACGR: GTCCATGGGAGCAGACCTGAG | 211 | 2 | 28 | 16 | 191–231 | 0.89 | 0.94 | 0.02 | 0.31 |
Ech039 | FM878370 MP15F05 | (TTTCT)19 | F: [6FAM]-TCTTCCCAGGCAAGGAATAACR: CTCTGACTAATTGCTAAGAAATACTCCTG | 280 | — | 22 | 25 | 217–359 | 0.91 | 0.96 | 0.02 | 0.27 |
Ech050 | FM878381 MP15H07 | (AC)12 | F: [HEX]-TGTCCAAGTCAGCAAAGCATCCR: CGTCTGCTGGCTGGTGAGAG | 341 | 3 | 27 | 8 | 329–343 | 0.82 | 0.82 | −0.01 | 0.26 |
Ech051 | FM878382 MP15H10 | (GT)12 | F: [HEX]-AGTAGTCTACACCATTTCAAACCTCCTGR: TCTCAGCAACCATCCCAAATG | 202 | 2 | 27 | 8 | 199–209 | 0.44 | 0.45 | 0.01 | 0.65 |
Ech060 | FM878391 MP16B01 | (TG)9 | F: [6FAM]-CATGTTGGGTTGCAGAAGCR: AACAGACTGTAACACTATCAAGGTATATCATC | 250 | 2 | 25 | 4 | 242–250 | 0.52 | 0.55 | 0.00 | 0.25 |
Ech063 | FM878394 MP16B10 | (CA)9 | F: [6FAM]-TCTCAATAACAATTATAGCTTTTATCTGCAAACR: CCCCTCTGGCTCCAAGAAGT | 206 | — | 25 | 9 | 173–225 | 0.48 | 0.78 | 0.23 | 0.00 |
Ech065 | FM878396 MP16C03 | (GT)12 | F: [6FAM]-TGACATGTATGGGGAGGAAAGGTTR: ACACTGGGCCTGTGGGAAAA | 162 | 4 | 28 | 14 | 154–182 | 0.86 | 0.88 | 0.01 | 0.24 |
Ech071 | FM878402 MP16D08 | (CTCAT)14 | F: [HEX]-CAGCCCACCGGTCTCTTACAGR: TGCAATGGTCTCTTCAGGAGATG | 250 | 4 | 27 | 11 | 209–254 | 0.93 | 0.89 | −0.03 | 0.69 |
Ech081 | FM878412 MP16F07 | (CAAGG)9 | F: [HEX]-GGTGCAACCCTTGCATGACCR: GGACTGCCGGCACTTCTTAGC | 273 | — | 27 | 7 | 235–257 | 0.96 | 0.73 | −0.20 | 0.34 |
Ech091 | FM878422 MP16H07 | (GT)9 | F: [HEX]-TCCGCAGTTCACGAGGAGTCR: ACAAGCCCTCTGCCTGTCTTG | 396 | 1 | 26 | 10 | 395–406 | 0.73 | 0.84 | 0.06 | 0.00 |
Ech113 | FM878444 MP17E05 | (GT)9 | F: [6FAM]-TGTGTTTACATTCAATCCATCAATGCR: GCAGGTAGGAACATTAAGTGAACCTTG | 210 | 2 | 24 | 7 | 201–227 | 0.25 | 0.64 | 0.45 | 0.00 |
Ech130 | FM878461 MP17H07 | (CA)9 | F: [6FAM]-TCCTCAGTTTATGTGAAGTGGAAGATTTR: CATGCTGACGCTGTCAGAACAT | 196 | — | 13 | 12 | 173–235 | 0.31 | 0.94 | 0.49 | 0.00 |
- N, number of unrelated macaroni penguins Eudyptes chryisolophus genotyped belonging to the colony located at Macaroni Cwm, Bird Island, South Georgia; F, female; M, male; A, number of alleles observed; HO, observed heterozygosity; HE, expected heterozygosity; F (null), estimated frequency of null alleles calculated using CERVUS v3 (Marshall et al. 1998; Kalinowski et al. 2007); ND, not done; P (HW), probability of deviation from Hardy–Weinberg proportions calculated using genepop v4 (Raymond & Rousset 1995; Rousset 2007).
- Note: The majority of loci included in multiplex sets were found to amplify most reliably and were the easiest to score.
Twenty-five loci were polymorphic when tested in 28 individuals sampled from the colony located at Macaroni Cwm (Table 1). Individuals were sexed using the Z-002D primer set (Dawson 2007). Loci were checked for sex linkage based on the genotyping of known-sex macaroni penguins [17 females (ZW) and seven males (ZZ)]. Twenty-four of the 25 polymorphic loci displayed heterozygotes in females and amplified in males indicating they were not sex linked. One locus, Ech020, displayed nine different alleles but was homozygous in all 20 individuals in which it amplified (13 females, four males, three unknown sex). The evidence of homozygous females (ZW) suggests that Ech020 is Z-linked. There was possible evidence for null alleles at this locus as eight individuals failed to amplify (four females, three males, one unknown sex), which may explain why all males were homozygotes. The sequence of locus Ech020 (FM878351) displayed significant homology to sequence located on the chicken Z chromosome, confirming its Z-linked status (homology detected by following the procedure described by Dawson et al. 2006).
Observed and expected heterozygosity and predicted null allele frequency were calculated using CERVUSv3 (Marshall et al. 1998; Kalinowski et al. 2007). Tests for departure from Hardy–Weinberg proportions and linkage disequilibrium were implemented in genepopv4 (Raymond & Rousset 1995; Rousset 2007) for up to 28 individuals sampled from the colony located at Macaroni Cwm. None of the loci showed genotypic disequilibrium after a sequential Bonferroni correction for multiple comparisons (Rice 1989).
The genotypes of six loci showed significant departures from Hardy–Weinberg proportions after a sequential Bonferroni correction for multiple comparisons (P = 0.002; loci Ech005, Ech012, Ech063, Ech091, Ech113 and Ech130; Table 1). Some of these loci may possess null alleles but others may simply require further PCR optimization. Several of these loci show potential utility for genotyping other penguin species (see below).
Seventeen of the polymorphic loci were selected and assessed for multiplexing potential. The selected loci showed no significant deviation from Hardy–Weinberg equilibrium and had a low estimated null allele frequency when amplified separately (as singleplexes). For these loci, the same 28 individuals were amplified individually as a singleplex PCR and then as part of a multiplex set. Four loci (Ech003, Ech024, Ech050 and Ech051) were difficult to score or did not amplify well when part of their assigned multiplex set. However, 13 loci were easily scorable and the same genotypes were obtained following singleplex and multiplex genotyping. The mean observed and expected heterozygosity of these 13 loci measured in up to 28 individuals sampled from the colony located at Macaroni Cwm was 0.64 and 0.76, respectively, and the combined first and second parent exclusion probability was better than 99.9%.
Prior to this study, a total of only 39 penguin microsatellite loci had been isolated and from few penguin species: Humboldt penguin (Spheniscus humboldti) (Schlosser et al. 2003), little blue penguin (Eudyptula minor) (Billing et al. 2007), yellow-eyed penguin (Megadyptes antipodes) (Boessenkool et al. 2008) and Adélie penguin (Pygoscelis adeliae) (Roeder et al. 2001, 2002). We therefore tested all 25 loci for their cross-species utility in other species of penguin (Table 2). Three unrelated individuals from king penguin (Aptenodytes patagonicus), gentoo penguin (Pygoscelis papua), chinstrap penguin (Pygoscelis antarctica) and Adélie penguin (Pygoscelis adeliae) were genotyped. Seventeen markers were polymorphic in at least one of the four other penguin species tested (Table 2). Eight loci (Ech008, Ech012, Ech030, Ech051, Ech060, Ech071, Ech081 and Ech091) were polymorphic in at least three of the four additional species tested (Table 2). Despite testing only three individuals, between eight and 12 new polymorphic microsatellite loci were identified for each species (Table 2). If necessary, amplicons could be sequenced and new species-specific primers designed. These macaroni penguin microsatellite loci are expected to be useful for population genetic studies in a variety of penguin species.
Locus* | Ech003 | Ech005 | Ech007 | Ech008 | Ech009 | Ech010 | Ech011 | Ech012 | Ech014 | Ech020 (Z-linked) | Ech024 | Ech029 | Ech030 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Expected allele size range (bp)† | 175–185 | 281–533 | 254–323 | 184–203 | 217–230 | 396–408 | 164–177 | 216–304 | 218–238 | 227–281 | 184–203 | 217–233 | 202–258 |
Species | |||||||||||||
King penguin | 173 | [318, 334, 382, 384, 527] | 253 | 203, 205 | 214 | 398 | 170, 180, 182, 186, 188 | X | X | [240, 322, 346] | 235 | 205 | 270, 278, 282 |
Gentoo penguin | 173 | [477, 487, 507, 525] | 265 | 203 | 212 | 392 | 170 | 274, 286, 290, 304, 316, 326 | X | [226, 270, 378, 388] | 237 | 207 | 270, 274, 278, 282, 290 |
Chinstrap penguin | 173 | [605, 664] | 265 | 207, 209 | 212, 222 | 392, 396 | 160 | 316, 332, 336, 340, 348, 350 | X | [292, 322, 406] | X | 207 | 278, 286, 298, 302, 314 |
Adélie penguin | 173 | [501, 619, 643] | 265, 269, 274 | 203, 205 | 260, 268, 272, 292 | 392, 396 | 170, 172 | 260, 268, 272, 292 | X | [226, 252] | 234, 238, 240 | 207 | 282 |
Presumed (same) microsatellite amplified | Yes | No (/Highly variable) | Yes | Yes | Yes | Yes | Yes | Yes | — | No | Yes | Yes | Yes |
Number of species polymorphic | 0 | NM | 1 | 3 | 2 | 2 | 2 | 3 | X | NM | 1 | 0 | 3 |
- ‘X’, no amplification; NM, nonmicrosatellite/nonmatching macaroni penguin observed allele sizes. For three loci (Ech005, Ech020, Ech039), the increments in the observed allele size suggest that the amplicon was not a microsatellite. The sizes of some of the alleles amplified differed significantly from the size expected based on the cloned macaroni penguin sequence and the size of the alleles amplified in macaroni penguin, and may suggest that a different locus has been amplified in other species. The alleles of these loci are therefore presented in parentheses.
- Species tested: king penguin Aptenodytes patagonicus, gentoo penguin Pygoscelis papua, chinstrap penguin Pygoscelis antarctica and Adélie penguin Pygoscelis adeliae.
- *Each locus was tested separately (singleplexed) in three individuals per species; the alleles of polymorphic loci which are in the expected size range are presented in bold.
- †Based on the genotyping of 28 macaroni penguin (Eudyptes chrysolophus) individuals.
Acknowledgements
Blood and DNA samples were kindly provided by Robert Buckland, Ian Hartley, Fiona Hunter, Jagroop Pandhal, Richard Philips and Douglas Ross. We thank Andy Krupa for genotyping advice. Terry Burke provided comments on the manuscript. This work was performed at the Molecular Genetics Facility at Sheffield supported by the Natural Environment Research Council, UK.