Divergent genes in potential inoculant Sinorhizobium strains are related to DNA replication, recombination, and repair
Abstract
To serve as inoculants of legumes, nitrogen-fixing rhizobium strains should be competitive and tolerant of diverse environments. We hybridized the genomes of symbiotically efficient and salt tolerant Sinorhizobium inoculant strains onto the Sinorhizobium meliloti Rm1021 microarray. The number of variable genes, that is, divergent or putatively multiplied genes, ranged from 503 to 1556 for S. meliloti AK23, S. meliloti STM 1064 and S. arboris HAMBI 1552. The numbers of divergent genes affiliated with the symbiosis plasmid pSymA and related to DNA replication, recombination and repair were significantly higher than expected. The variation was mainly in the accessory genome, implying that it was important in shaping the adaptability of the strains.
Introduction
With growing areas of infertile and degraded land taken into use, the need for well-adapted nitrogen-fixing symbiotic leguminous plants is increasing. Degraded soils are often devoid of compatible rhizobia and legume plants must therefore be inoculated with appropriate strains 1. Apart from being efficient nitrogen fixers the inoculant strains should be competitive and tolerant of diverse environments. Because of the complex requirements, no successful laboratory tests have so far been developed for the selection of strains with desired properties.
Sinorhizobium meliloti is widely used for the production of inoculants for temperate Medicago species, mainly for M. sativa. However, also many tropical plants form symbioses with Sinorhizobium species 2. There is thus a demand for Sinorhizobium inoculants possibly suited for inoculation of plants in arid area. S. meliloti Rm1021, the model strain for molecular work, is an antibiotic resistant derivative of the inoculant strain SU47 3. Rm1021 is less efficient in symbiosis than its parent strain SU47 3, 4. In 2001, the genome sequence of S. meliloti Rm1021 (synonym Ensifer meliloti Rm1021) was published 5, paving the way for microarray development. With the microarrays, differential gene expression to various stress factors were determined for Rm1021 at a genome level 6-8, but the genetic basis for stress tolerance in wild-type inoculant strains was hardly studied so far.
We hypothesized that efficient inoculant strains should carry genes varying from those of Rm1021. A Rm1021 microarray was used for comparative genome hybridisations (CGH) with three salt tolerant inoculant strains. The aim was to unravel genomic features in the wild-type strains that could be related to their symbiotic efficiency and salt tolerance. We compared the physical and the functional genome composition of the reference strain Rm1021 and the wild-type strains by determining the numbers and locations of variable, that is, divergent or putatively multiplied, genes as distributed on the Rm1021 replicons (chromosome, pSymA, pSymB), and by looking for variation in clusters of orthologous groups (COGs) and in genes expressed differentially by Rm1021 after prolonged exposure to NaCl and osmotic shock.
Materials and methods
S. meliloti AK23 (RSAM1774) isolated from M. officinalis (Kazakhstan), nodulates effectively Medicago officinalis and M. sativa 9. Sinorhizobium meliloti STM 1064 from M. minima (Algeria), nodulates effectively M. minima, M. sativa, and M. ciliaris (de Lajudie, unpublished results). S. arboris HAMBI 1552 isolated from Prosopis chilensis (Sudan) is one of the most efficient symbionts of several Acacia sp. and Prosopis sp. 2. In liquid medium, AK23 and HAMBI 1552 tolerate 600 and 750 mmol l−1 NaCl, respectively 10, 11, whereas Rm1021 grows poorly on LB agar with 400 mmol l−1 NaCl 12. The salt tolerance of STM 1064 was estimated by growing the strain in liquid M9 medium with 0, 600, and 800 mmol l−1 NaCl. Initial OD600 was adjusted to 0.1. STM 1064 grew in 600 mmol l−1 NaCl, while in 800 mmol l−1 NaCl there was no detectable growth.
For microarray experiments, the strains were cultivated in TY medium 13 at 30 °C in 50 ml aliquots at 180 rpm. Rm1021 was cultivated with appropriate antibiotics (streptomycin, 600 μg ml−1; nalidixic acid, 8 μg ml−1; kanamycin, 200 μg ml−1). Cultures were harvested after 19–22 h of incubation at OD600 = 0.8. DNA was extracted with the FastDNA Kit (MP Biomedicals) according to the manufacturer's instructions and quantified by spectrophotometry (Biophotometer, Eppendorf AG, Germany). Eight microgram of DNA was labelled by incorporation of Cy5-dCTP and Cy3-dCTP fluorescent dyes (test DNA/reference DNA (Rm1021) and vice versa) (Amersham Biosciences, UK) using the RadPrime DNA labelling kit (Invitrogen) and purified using PCR purification kit (Qiagen, Netherlands). The test and reference DNA were combined with 10 µg Herring sperm DNA, 20 µg poly-(dA) and 100 µg ribonuclease free yeast tRNA, vacuum dried, resuspended in 80 µl hybridisation buffer (50% formamide/6xSSC/0.5% SDS/5xDenhart's) and hybridized to the Sm6kOligo microarray 7 at 42 °C overnight. Arrays were scanned with GenePix Autoloader 4200A (Molecular Devices, LLC). The images were processed using GenePix 6.0 software.
Systematic variation normalization was performed using the R software 14. The divergent genes were identified using permutation-T-tests where the parameter for false discovery rate was set to zero. The probability of a gene to be randomly picked as divergent was calculated. Two deletion mutants of S. meliloti Rm1021, namely S. meliloti Rm1021 ΔsitA and S. meliloti Rm1021 Δfur, were used to optimize the CGH experiment. Both the reference strain and the test strains were labelled with both dyes and the experiment was done in duplicate. However the signal intensity obtained in the alternating dyes, particularly on the test strains was not proportional. Since swapping the dye did not produce the expected result we decided to label the reference strain with Cy5 and the test strains with Cy3 in subsequent experiments.
ImaGene 5.0 software (Biodiscovery Inc., Los Angeles, CA) was used for spot detection, image segmentation, and signal quantification. The log2 value of the intensity ratios was calculated for each spot with Mi = log2(Ri/Gi), where Ri = Ich1i-Bgch1i and Gi = Ich2i-Bgch2i with Ich1i and Ich2i being the intensity of a spot in channel l or channel 2, and Bgch1i and Bgch2i being the background intensity of a spot in channel 1 or channel 2, respectively. The mean intensity was calculated for each spot with Ai = log2(RiGi) 0.5. Normalization and t-statistics were carried out using the Emma 1.1 software 15. A gene was considered significantly different if p ≤ 0.05, the log2 ratio of the intensities (M-value) was ≥1 or ≤−1, and the mean intensity (A-value) was ≥8.
For K-means clustering analysis of the microarray data, the Genesis© software was used 16. Clustering with TMEV (version 3.1) software revealed that the replicate slides behaved in almost identical manner. The MA plot generated after the T-permutation test also confirmed the clustering. We calculated whether the distribution of divergent genes on different replicons, in different COG categories, and in genes affected by prolonged cultivation with 380 mmol l−1 NaCl 8 and by osmotic shock 6 were significant in comparison with the total number of genes with the Chi-square (χ2) test. The COG categories are as listed at Galardini et al. 17 (http://www.genomebiology.com/content/supplementary/1471-2164-12-235-s2.docx).
Results and discussion
To search for genetic determinants for effective nodulation and salt stress tolerance, we compared the genomes of three salt tolerant inoculant Sinorhizobium strains to the genome of S. meliloti Rm1021. In earlier studies, the number of variable genes in wild-type S. meliloti strains ranged from 170 to 653 18, 19. The variation was connected neither to original host nor to site of isolation 19. In our study, S. arboris HAMBI 1552 had more variable genes than the phylogenetically more closely related S. meliloti strains AK23 and STM 1064 (Table 1). It should be noted that CGH gives information only on the genes present in the Rm1021 genome. As in earlier studies 18, 19, the majority of the variable genes of the wild type strains showed decreased hybridisation intensity indicating gene deletion or sequence divergence (referred as divergent genes hereafter) in the probe region (Table 1).
Rm1021 | AK23 | STM1064 | HAMBI 1552 | ||||
---|---|---|---|---|---|---|---|
Assignmenta | No. genes | Divergent genesb (%) | Multiplied genesb (%) | Divergent genesb (%) | Multiplied genesb (%) | Divergent genesb (%) | Multiplied genesb (%) |
Location | |||||||
Genome | 6208 | 6.4 | 1.7 | 9.0 | 2.0 | 16.9 | 8.1 |
Chromosome | 3343 | 3.5 | 1.4 | 3.2 | 1.9 | 5.8 | 11.8* |
pSymA | 1293 | 19.8* | 1.4 | 29.0* | 1.5 | 53.1* | 1.0 |
pSymB | 1572 | 1.8 | 2.4 | 4.8 | 2.5 | 11.0 | 6.2 |
COGs | |||||||
COG J | 186 | 2.2 | 1.1 | 2.2 | 2.2 | 2.2 | 23.7* |
COG L | 288 | 20.5* | 3.1 | 27.4* | 1.0 | 44.4* | 7.3 |
Transposase genes | 119 | 29.4 | 3.4 | 42.9 | 0.8 | 80.7 | 3.4 |
COG X | 1342 | 9.0 | 1.9 | 13.9* | 2.4 | 25.0* | 6.8 |
- a COG categories: J: translation, ribosomal structure, and biogenesis; L: DNA replication, recombination and repair; X: no assigned COG category.
- b Replicons and COG categories for which the number of variable genes was higher than expected in the Chi-square test and p < 0.05 are marked with an asterisk (*).
For further insight into the genomic variation, we determined the location of the variable genes on the three genetic elements of Rm1021. In general, the housekeeping genes are on the chromosome, the genes related to symbiosis on pSymA and the genes related to survival in the free-living state in soil on pSymB 5. The genes that show increased hybridisation intensity are considered putatively duplicated 20. In S. arboris HAMBI 1552, the putatively duplicated genes were more abundant on the chromosome than on other genetic elements (Table 1). As in earlier studies 18, 19, 21, in all the strains the divergent genes were more abundant on pSymA (Table 1). Contrary to the chromosome and pSymB, on pSymA the proportion of accessory genes is higher than that of the core genes 19. The accessory genes include genes related to environmental adaptation that are likely to explain the phenotypic variation among rhizobial strains 17, 22.
For deeper understanding of the genomic differences, the variable genes were assigned to clusters of orthologous group (COG) categories 23. Earlier studies have shown that phosphate starvation, osmotic shock, and salt stress result in differential gene expression 6-8. Considering both all the Rm1021 genes and those induced by osmotic shock 6, 8, in the wild-type strains the divergent genes were more abundant in COG category L, that is, DNA replication, recombination and repair genes (Tables 1 and 2). With the exception of genes with no assigned COG category (COG X) of STM1064 and HAMBI 1552, the relative proportion of divergent genes in other COGs reflected the overall divergence. Stress induced mutagenesis is important in the adaptation of bacteria to changing conditions 24, and COG L genes are involved in the process. For S. meliloti strains AK83 and BL225C, the genes in the COG L were the most diverse, and COG L and COG X were the only groups where the number of accessory genes was higher than that of core genes 17. In Rm1021, 20 genes in the COG L that were affected by osmotic shock code for transposases 6, out of which all but one duplicated gene were divergent in HAMBI 1552 (Table 1). Eight and nine of the ten transposase genes induced by osmotic shock were divergent in AK23 and STM 1064, respectively (Table 1). In addition, the relative proportion of divergent transposase genes in the wild type strains was 4.5–4.8 times higher than that of all the divergent genes (Table 1). Genomic rearrangement by transposition could be considered to play a major role in adaptation 17. Although we cannot conclude how the divergence among the COG L and transposase genes affects the wild type strains, the observation implies that the strains have a strategy considerably different to that of Rm1021 for responding to osmotic shock and possibly to other stress conditions. In Rm1021, 444 genes were down-regulated under osmotic stress 6. Altogether 40 COG J genes, that is, translation, ribosomal structure, and biogenesis genes, were down-regulated by osmotic shock, presumably because protein biosynthesis is less intense at the lower growth rate caused by stress 6. In all the wild-type strains, the proportion of genes divergent from the down-regulated genes was approximately half of the total divergence (Tables, 2 1), implying that these genes and possibly also the down-regulation response are conserved. Duplication of genes is often associated with functional diversification 25. Altogether 44 COG J genes, including twenty osmotic shock affected genes, were duplicated in HAMBI 1552 (Table 1), reflecting the evolutionary distance between the S. meliloti and S. arboris strains.
Rm1021 | AK23 | STM1064 | HAMBI 1552 | ||||
---|---|---|---|---|---|---|---|
Assignmenta | No. genesb | Divergent genesc (%) | Multiplied genesc (%) | Divergent genesc (%) | Multiplied genesc (%) | Divergent genesc (%) | Multiplied genesc (%) |
Osmotic shock regulated genes | |||||||
Up-regulated | 539 | 7.2 | 1.9 | 8.5 | 1.5 | 16.1 | 6.9 |
COG L | 21 | 47.6* | 9.5 | 52.4* | 0.0 | 52.4 | 4.8 |
Transposase genes | 10 | 80.0 | 10.0 | 90.0 | 0.0 | 90.0 | 10.0 |
Down-regulated | 444 | 3.4 | 1.8 | 3.6 | 2.5 | 7.4 | 13.1 |
COG J | 40 | 2.5 | 2.5 | 2.5 | 7.5 | 0.0 | 50.0* |
COG L | 20 | 5.0 | 5.0 | 15.0 | 5.0 | 55.0* | 20.0 |
Transposase genes | 10 | 0.0 | 0.0 | 20.0 | 0.0 | 100.0 | 0.0 |
Salt stress regulated genes | |||||||
Up-regulated | 52 | 7.7 | 0.0 | 7.7 | 0.0 | 7.7 | 7.7 |
Down-regulated | 85 | 14.1 | 1.2 | 14.1 | 1.2 | 10.6 | 11.8 |
- a COG categories: J: translation, ribosomal structure, and biogenesis; L: DNA replication, recombination, and repair; X: no assigned COG category.
- b The number of genes for osmotic shock and salt stress affected genes from Domínguez-Ferreras et al. 6 and Rüberg et al. 8, respectively.
- c COG categories for which the number of variable genes was higher than expected in the Chi-square test and p < 0.05 are marked with an asterisk (*).
The long term cultivation of Rm1021 in 380 mmol l−1 NaCl up-regulated 52 and down-regulated 85 genes 8. Both AK23 and STM 1064 were divergent from 12 of the down-regulated genes (Table 1). Seven of these were related to synthesis of rhizobactin that is involved in the ability of Rm1021 to survive in iron deficient soils 26. Actually, all the eight rhizobactin operon genes of Rm1021 were divergent in AK23 and STM 1064 (Table S1). Contrary to the wild-type S. meliloti strains, the genes of HAMBI 1552 variable to the salt stress affected genes of Rm1021 included equal number of multiplied and divergent genes, among others, three divergent rhizobactin genes and two multiplied iron uptake genes. The rhizobactin operon is missing from the genomes of Aral sea isolates S. meliloti AK 21 and AK83 17, 27, implying that the wild type strains have alternative, possibly more stress tolerant, systems for iron uptake.
The 22 nodulation linked genes on the Rm1021 microarray include both structural nod genes and the nodD genes which are involved in the activation of Nod factor synthesis 7, 28. The NodD1 and NodD2 in Rm1021 are responsive to different compounds in plant root secretions whereas NodD3 is plant inducer-independent 28. Both AK23 and STM 1064 had two variable nodulation genes (Table 3). The multiplication of nodD3 and Nod factor secretion-linked nodJ in AK23 and STM 1064, respectively, is possibly connected to the efficient nodulation capabilities of the strains. Rm1021 and HAMBI 1552 have different host ranges; they nodulate herbaceous legumes and tropical trees, respectively. Also, the structures of their Nod factors are different 28, 29. In line with these differences, the nodD2, the nodL connected to acetylation of the Nod factor non-reducing terminal and nodE linked to Nod factor acylation by polyunsaturated fatty acids were divergent (Table 3).
Gene | Gene product | AK023 | STM1064 | HAMBI1552 |
---|---|---|---|---|
nodB | Chitooligosaccharide deacetylase | −1.27a | ||
nodD2 | nod-box-dependent transcription activator | −1.06 | ||
nodD3 | nod-box-dependent transcription activator | 1.16 | ||
nodE | β-ketoacyl-ACP synthase | −1.27 | ||
nodJ | ABC transporter, permease | 1.47 | ||
nodL | Nod factor acetyltransferase | −1.02 | ||
nodn | Dehydratase | 1.18 | −1.17 | |
noeB | Host specific nodulation protein | −1.91 | ||
cysElacA/lpxA/nodL | Putative acetyltransferase | −2.65 | ||
putative nodU | Probable carbamoyltransferase | 1.30 |
- a log2 ratio of the intensities (M-value). M-value ≤−1, divergent gene; M-value ≥1, putatively duplicated gene.
Concluding remarks
Both the physical location and COG category assignment of the divergent genes implied that the divergence was important in shaping the adaptability of the strains. Even though the microarray method successfully revealed the differences between the model strain Rm1021 and the potential inoculant strains, more knowledge on the genetic determinants of effective nodulation and stress tolerance is needed before genetic screening can replace traditional nodulation tests.
Conflict of interest
The authors have declared no conflict of interest.