Isolation and characterization of simple sequence repeat markers for the tetraploid forage grass Dactylis glomerata
Communicated by O. A. Rognli
Abstract
With 1 table
To isolate simple sequence repeat (SSR) markers from Dactylis glomerata, four SSR-enriched genomic libraries for different repeat motifs from genomic DNA of a Japanese orchardgrass variety, ‘Akimidori II’, were constructed. A total of 969 unique SSR-containing clones were identified and used for primer design. A BLAST search revealed that 63 of the sequences showed significant homology with sequences of other plant species including 18 that showed significant gene homology. After screening of a panel consisting of eight orchardgrass individuals, 606 SSR markers showed polymorphism, 70 generated monomorphic bands and the other primers either generated multiple bands or did not provide any amplification product. In the 606 polymorphic SSR markers, most had a large number of alleles, ranging from 1 to 13 with an average of 4.5 alleles per SSR primer pair. The number of alleles per individual ranged from 1 to 4. These SSR markers developed in this study will have broad utility for genotyping applications in this species, including gene targeting, quantitative trait locus mapping, variety or species identification and marker-assisted selection.
Dactylis glomerata (orchardgrass or cocksfoot) is an important forage species in temperate regions and is native to Europe, Asia and North Africa. It has also been introduced into North America. It is used as a hay grass and for pastures because of its high yields, moderate shade tolerance and high sugar content. Among Japan’s cultivated forage species, it is second only to timothy (Phleum pratense) in its cultivated acreage.
Orchardgrass has 28 chromosomes and is a mostly cross-pollinated, auto-tetraploid species with a large genome (4312 Mbp) (Creber et al. 1994, Bennett and Leitch 1997). Because of its auto-tetraploid status and large genome, the genetic markers that were reported for this species are mainly non-species-specific types such as isozymes (Tosun et al. 2002), randomly amplified polymorphic DNA (Kölliker et al. 1999, Tuna et al. 2004), amplified fragment length polymorphisms (AFLP) (Peng et al. 2006, 2008), inter-simple sequence repeats (ISSRs) (Zeng et al. 2006) and sequence-related amplified polymorphisms (Zeng et al. 2008).
The genomes of all eukaryotes contain a class of sequences referred to as simple sequence repeats (SSRs) (Tautz et al. 1986) or microsatellites (Litt and Luty 1989). SSRs with tandem repeats of a basic motif of ≤6 bp have emerged as important sources of ubiquitous genetic markers for many eukaryotic genomes (Wang et al. 1994). SSR markers have the advantages of being polymerase chain reaction (PCR)-based, multi-allelic, co-dominant and highly polymorphic. Especially because of their multi-allelic status, SSR markers will be highly useful in genetic studies of auto-polyploid species. Until the present study, no species-specific orchardgrass SSR markers had been reported, although some cross-species amplification tests using SSR markers from Lolium multiflorum × Festuca glaucescens and tall fescue (Festuca arundinacea) have been conducted (Rouf Mian et al. 2005, Litrico et al. 2009).
Here, we report the isolation and characterization of 969 orchardgrass SSR markers from four SSR-enriched genomic libraries. One screening panel consisting of eight orchardgrass individuals was tested with all 969 SSR markers to determine their amplification ability, detect polymorphism levels and identify a set of loci suitable for framework mapping.
Four SSR-enriched genomic libraries (CA/TG, GA/TC, AAG/TTC and TAGA/TCTA) were constructed by Genetic Identification Services (Chatsworth, CA, USA) from one individual of ‘Akimidori II’, a Japanese orchardgrass variety, and the sequences (about 700 bp long) were obtained at Dragon Genomics (Yokkaichi, Mie, Japan). Detailed methods were described in Cai et al. (2003). After sequencing, the Phred values (Brent et al. 1998) were calculated, and clones with Phred values >15 and longer than 100 bp were used for primer design.
A total of 4000 clones (1000 from each of the four libraries) were sequenced. After excluding a few clones whose sequences flanking the SSR motifs were too short to allow us to design both forward and reverse primers, a total of 969 unique SSR clones (sequences containing at least seven dinucleotide repeats or five trinucleotide repeats or three tetranucleotide repeats or two pentanucleotide repeats) were identified and used for design primers. The efficiency of SSR marker isolation was 24.2% (of the sequenced clones), and it was similar to that in zoysiagrass (19.2%, Cai et al. 2005) but lower than the results obtained in sunflower (49.2%, Tang et al. 2002), Italian ryegrass (25.6%, Hirata et al. 2006) and timothy (33.3%, Cai et al. 2003) through the use of SSR-enriched libraries produced by Genetic Identification Services, perhaps because our libraries have a high level of redundancy. The percentage of unique SSR clones in library B (motif GA/TC, 55.6%) was higher than in the other libraries (10.7–21.2%); this result agreed with that in timothy (Cai et al. 2003) and zoysiagrass (Cai et al. 2005).
The 969 unique SSR sequences were searched against the DDBJ nucleotide databases to identify the relationships between the SSR sequences for known genes. BLASTX searches were run on 22 June 2009, and a bits score >100 and an E-value ≤1 × 10−5 were considered to be significant. And a total of 63 sequences were found that showed significant homology with the sequences of other plant species. Of these, 34 showed significant homology with genomic or cDNA sequences with unknown functions from rice and other plant species, but 18 showed significant homology with known genes (Table 1), five showed significant homology with transposable elements, five were similar to microsatellite sequences from two other species and one was similar to a chloroplast gene in the Agrostis stolonifera cultivar ‘Penn A-4’ and also had a high and significant homology with rice genomic DNA.
Clone Name | Matched clone | Putative identification | Bits score and E-value |
---|---|---|---|
a01c20 | AF544047 | Hordeum vulgare knotted 4 (kn4) gene, partial cds | 194 bits (98), Expect = 1e−46 |
a01g10 | AP008209 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 3 | 115 bits (58), Expect = 2e−22 |
a01h18 | EF067844 | Hordeum vulgare vrs1 locus, complete sequence; Hox1 gene, complete cds | 202 bits (102), Expect = 1e−48 |
a01h20 | AP008209 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 3 | 153 bits (77), Expect = 8e−34 |
a01m08 | EF067844 | Hordeum vulgare vrs1 locus, complete sequence; Hox1 gene, complete cds | 165 bits (83), Expect = 3e−37 |
a01p11 | AP008213 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 7 | 105 bits (53), Expect = 2e−19 |
a02g09 | X56004 | Wheat gstA2 gene for glutathione-S-transferase | 260 bits (131), Expect = 4e−66 |
a02g17 | DQ680849 | Avena strigosa beta-amyrin synthase (Sad1) and cytochrome P450 CYP51H10 (Sad2) genes, complete cds | 153 bits (77), Expect = 9e−34 |
a03g12 | AB108174 | Phleum pratense DNA, SSR, microsatellite A08-D10 | 125 bits (63), Expect = 2e−25 |
a03n22 | EU282577 | Hordeum vulgare subsp. vulgare cultivar Galleon marker ZIG genomic sequence | 109 bits (55), Expect = 1e−20 |
a04a06 | M63224 | Wheat germin 9f-3.8 gene, complete cds | 107 bits (54), Expect = 5e−20 |
a04m05 | AK248621 | Hordeum vulgare subsp. vulgare cDNA clone:FLbaf142j22, mRNA sequence | 119 bits (60), Expect = 1e−23 |
a119 | AF474373 | Hordeum vulgare subsp. vulgare BAC 259I16, complete sequence | 107 bits (54), Expect = 6e−20 |
a157 | AJ872227 | Festuca glaucescens × Lolium multiflorum microsatellite DNA, clone B2-G3 | 141 bits (71), Expect = 4e−30 |
a167 | AJ872227 | Festuca glaucescens × Lolium multiflorum microsatellite DNA, clone B2-G3 | 137 bits (69), Expect = 4e−29 |
b01b04 | AP008207 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 1 | 149 bits (75), Expect = 2e−32 |
b01d09 | EU835980 | Triticum aestivum clone BAC 502E09, complete sequence | 107 bits (54), Expect = 6e−20 |
b01e04 | AB061327 | Avena sativa LTR-retrotransposon OARE-1 gag-pol pseudogene for polyprotein | 281 bits (142), Expect = 2e−72 |
b01h12 | AB061327 | Avena sativa LTR-retrotransposon OARE-1 gag-pol pseudogene for polyprotein | 270 bits (136), Expect = 5e−69 |
b01h13 | DQ680849 | Avena strigosa beta-amyrin synthase (Sad1) and cytochrome P450 CYP51H10 (Sad2) genes, complete cds | 226 bits (114), Expect = 1e−55 |
b01m05 | DQ680849 | Avena strigosa beta-amyrin synthase (Sad1) and cytochrome P450 CYP51H10 (Sad2) genes, complete cds | 480 bits (242), Expect = e−132 |
b02a06 | AP008217 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 11 | 123 bits (62), Expect = 9e−25 |
b02d15 | DQ680849 | Avena strigosa beta-amyrin synthase (Sad1) and cytochrome P450 CYP51H10 (Sad2) genes, complete cds | 163 bits (82), Expect = 1e−36 |
b02i20 | AP008215 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 9 | 105 bits (53), Expect = 2e−19 |
b02j02 | AJ872261 | Festuca glaucescens × Lolium multiflorum microsatellite DNA, clone B4-D9 | 170 bits (86), Expect = 2e−39 |
b02m15 | AP008208 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 2 | 161 bits (81), Expect = 4e−36 |
b02p06 | AK252159 | Hordeum vulgare subsp. vulgare cDNA clone: FLbaf149g10, mRNA sequence | 101 bits (51), Expect = 1e−18 |
b02p12 | AP008215 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 9 | 172 bits (87), Expect = 1e−39 |
b03c22 | AB061327 | Avena sativa LTR-retrotransposon OARE-1 gag-polpseudogene for polyprotein | 206 bits (104), Expect = 9e−50 |
b03g21 | EU944527 | Zea mays clone 221566 mRNA sequence | 238 bits (120), Expect = 2e−59 |
b03m21 | DQ900685 | Hordeum vulgare clone BAC 761F4, complete sequence | 190 bits (96), Expect = 4e−45 |
b03n15 | AK251621 | Hordeum vulgare subsp. vulgare cDNA clone: FLbaf119d05, mRNA sequence | 103 bits (52), Expect = 7e−19 |
b04e13 | AP008214 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 8 | 333 bits (168), Expect = 5e−88 |
b04g03 | AK253068 | Hordeum vulgare subsp. vulgare cDNA clone: FLbaf82i11, mRNA sequence | 246 bits (124), Expect = 7e−62 |
b04m16 | EF115543 | Agrostis stolonifera cultivar Penn A-4 chloroplast, complete genome | 178 bits (90), Expect = 2e−41 |
b05b14 | DQ680854 | Avena strigosa cytochrome P450 CYP51H11 (Cyp51H11) gene, complete cds | 99.6 bits (50), Expect = 7e−18 |
b05b22 | AP008209 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 3 | 103 bits (52), Expect = 8e−19 |
b05c05 | AK251223 | Hordeum vulgare subsp. vulgare cDNA clone: FLbaf111m19, mRNA sequence | 133 bits (67), Expect = 1e−27 |
b05e16 | AB108302 | Phleum pratense DNA, SSR, microsatellite B01-G12 | 276 bits (139), Expect = 9e−71 |
b05m12 | FJ234838 | Brachypodium sylvaticum clone BAC 37D5 hardness locus region genomic sequence | 145 bits (73), Expect = 2e−31 |
b05p14 | AF474071 | Hordeum vulgare subsp. vulgare cultivar Morex BAC clone 745c13, complete sequence | 125 bits (63), Expect = 2e−25 |
b06a08 | BT042060 | Zea mays full-length cDNA clone ZM_BFb0123F08 mRNA, complete cds | 101 bits (51), Expect = 4e−18 |
b06c18 | AP008210 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 4 | 129 bits (65), Expect = 1e−26 |
b06d06 | AM932685 | Triticum aestivum 3B chromosome, clone BAC TA3B95F5 | 194 bits (98), Expect = 4e−46 |
b06h15 | EU835198 | Triticum turgidum subsp. dicoccoides clones BAC 391M13 and BAC 1144M20 genomic sequence | 109 bits (55), Expect = 1e−20 |
b06l02 | AP008209 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 3 | 151 bits (76), Expect = 5e−33 |
b06m23 | AP008216 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 10 | 103 bits (52), Expect = 7e−19 |
b06n09 | AY038013 | Avena sativa isolate Pc68LrkC5 sequence containing retrotransposon and repetitive DNA linked to receptor kinase gene | 123 bits (62), Expect = 7e−25 |
b06n20 | AK251328 | Hordeum vulgare subsp. vulgare cDNA clone: FLbaf110k10, mRNA sequence | 103 bits (52), Expect = 4e−19 |
b6 | AK248140 | Hordeum vulgare subsp. vulgare cDNA clone: FLbaf16k19, mRNA sequence | 113 bits (57), Expect = 9e−22 |
c01b11 | AY038013 | Avena sativa isolate Pc68LrkC5 sequence containing retrotransposon and repetitive DNA linked to receptor kinase gene | 151 bits (76), Expect = 4e−33 |
c01b16 | AY642926 | Hordeum vulgare BAC CC24_14, complete sequence | 105 bits (53), Expect = 3e−19 |
c01g14 | EU601173 | Agrotis ipsilon cytoplasmic actin mRNA, complete cds | 226 bits (114), Expect = 6e−56 |
c01h19 | DQ537337 | Triticum aestivum clone BAC 897M20, complete sequence | 99.6 bits (50), Expect = 1e−17 |
c01k17 | FJ194457 | Triticum aestivum bZip type transcription factor bZIP1 mRNA, complete cds | 119 bits (60), Expect = 1e−23 |
c01k22 | AP008207 | Oryza sativa (japonica cultivar group) genomic DNA, chromosome 1 | 141 bits (71), Expect = 3e−30 |
c01l10 | DQ680849 | Avena strigosa beta-amyrin synthase (Sad1) and cytochrome P450 CYP51H10 (Sad2) genes, complete cds | 232 bits (117), Expect = 2e−57 |
c02b13 | FJ829742 | Avena strigosa clone L7M2.1 putative resistance protein gene, complete cds | 331 bits (167), Expect = 2e−87 |
c02j03 | DQ680849 | Avena strigosa beta-amyrin synthase (Sad1) and cytochrome P450 CYP51H10 (Sad2) genes, complete cds | 543 bits (274), Expect = e−151 |
c03f03 | AM489608 | Lolium perenne rk1 gene, px1 gene and hd1 gene | 99.6 bits (50), Expect = 1e−17 |
c03f12 | DQ680849 | Avena strigosa beta-amyrin synthase (Sad1) and cytochrome P450 CYP51H10 (Sad2) genes, complete cds | 256 bits (129), Expect = 9e−65 |
c9 | AY642926 | Hordeum vulgare BAC CC24_14, complete sequence | 117 bits (59), Expect = 5e−23 |
d02o04 | CT835259 | Oryza sativa (indica cultivar group) cDNA clone:OSIGCSN044C14, full insert sequence | 119 bits (60), Expect = 1e−23 |
The SSR loci were classified by repeat type and structure into four categories: perfect, imperfect, interrupted or compound repeat defined by Jones et al. (2001). All four libraries contained perfect clones at very high frequencies, with an average of 88.4% (ranging from 83.4% to 93.4%). Compound repeats were contained at higher frequencies in library D (14.4%) than in the others (3.3–8.8%). Imperfect clones were contained at low frequencies (≤1.0%) in each library. In all four libraries, the predominant motif was the expected type (CA/TG for A, GA/TC for B, AAG/TTC for C and TAGA/TCTA for D). In the most frequent perfect clones that contained dinucleotide, trinucleotide and tetranucleotide motifs, the average repeat numbers were 19.1 (CA/TG), 22.3 (GA/TC), 13.7 (AAG/TTC) and 12.0 (TAGA/TCTA).
Although AT/TA is the most common dinucleotide motif in plants (Cardle et al. 2000), this motif is not usually used in SSR-enrichment procedures owing to its self-complementary nature; therefore, we did not use AT/TA in our study. Of 857 perfect SSR clones, the motifs occurring at the highest rates were GA/TC (58.7%), CA/TG (20.6%), AAG/TTC (12.9%) and TAGA/TCTA (7.8%). These results and those in timothy (Cai et al. 2003) and zoysiagrass (Cai et al. 2005) are similar to those reported by Cardle et al. (2000), who reported that the most common dinucleotide motif found in plants is AT/TA, followed by GA/TC and CA/TG, and that the most common trinucleotide motifs were AAT/TAA and ATC/TAG, whereas AAG/TTC was dominant in Arabidopsis thaliana.
A panel consisting of eight orchardgrass individuals was used for testing PCR amplification. The panel included three individuals of ‘Akimidori II’, two individuals of ‘Loke’ (a Swedish variety) and three other orchardgrass individuals collected from natural grassland at Nasushiobara, Tochigi, Japan. The primers design, PCR and fragment analysis were according to Cai et al. (2003). Of the 969 primer pairs tested, 606 (62.5%) amplified polymorphic products in the eight orchardgrass individuals used, 70 (7.2%) amplified monomorphic products, 56 (5.8%) amplified multiple bands, and 237 (24.5%) amplified no bands. The percentage of functional primers (62.5% polymorphic and 7.2% monomorphic) in our study was 69.7%, which is higher than those in wheat (47.1%, Röder et al. 1998), and similar to those in timothy (74%, Cai et al. 2003), but lower than those in barley (83.4%, Ramsay et al. 2000), Italian ryegrass (90.4%, Hirata et al. 2006), perennial ryegrass (81%, Jones et al. 2001), sunflower (87.9%, Tang et al. 2002) and zoysiagrass (89.7%, Cai et al. 2005). The screening panel included three individuals of ‘Akimidori II’, two individuals of ‘Loke’ and three other orchardgrass individuals collected at same region (Nasushiobara, Tochigi, Japan), the close relationship of these individuals may be the reason of low rate of polymorphism.
The percentage of polymorphic primers in library B was 73.8% (398/539)—higher than those observed in the other three libraries (about 50%). Most markers had a large number of alleles, ranging from 1 to 13 (Table S1). The number of primer pairs that produced one allele per individual was 66 (10.9%) vs. 277 (45.7%) for two alleles, 184 (30.4%) for three alleles and 79 (13.0%) for four alleles (Table S1).
A pseudo-testcross F1 population consisting of 88 individuals derived from a cross between two orchard grass cultivars, ‘Akimidori II’ and ‘Loke’, was used to detect polymorphism of the mapping population. The efficiency of the polymorphic markers in a mapping population was also tested by using the 606 SSR markers in a screening between the two parents of the mapping population and revealed that 53% (321) of the markers were polymorphic.
We isolated and characterized 969 SSR sequences from four SSR-enriched libraries and used them to develop 606 polymorphic SSR markers by screening eight orchardgrass individuals. Orchardgrass is one of the best-documented examples of a natural intraspecific polyploidy complex, but before these SSR markers were developed, most studies were based on the geographical distribution, comparative morphology, cytogenetic structure and interploidal hybridization of the species. When combined with recent isozyme and other non-species-specific molecular markers, the SSR markers developed in this study will be powerful tools for studying old problems such as origin, differentiation, gene flow and gametic non-reduction in Dactylis, the oldest studied grass species. The markers will also provide an ideal marker system for studying new challenges in modern breeding, such as gene targeting, quantitative trait locus mapping, variety or species identification and marker-assisted selection in Dactylis species.
Acknowledgements
This study was financially supported by a grant from the Japan Racing Association. We also thank two anonymous reviewers for their valuable comments on this manuscript.