Volume 37, Issue 7 pp. 743-750

Research Article

Full Access

A Shrinkage Method for Testing the Hardy–Weinberg Equilibrium in Case-Control Studies

Yong Zang,

Yong Zang

Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA

Search for more papers by this author

Ying Yuan,

Corresponding Author

Ying Yuan

Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA

Correspondence to: Ying Yuan, Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77230, USA. E-mail: [email protected]Search for more papers by this author

Yong Zang,

Yong Zang

Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA

Search for more papers by this author

Ying Yuan,

Corresponding Author

Ying Yuan

Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA

Correspondence to: Ying Yuan, Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77230, USA. E-mail: [email protected]Search for more papers by this author

First published: 11 August 2013

https://doi.org/10.1002/gepi.21753

Citations: 2

Share a link

Email
Wechat
Bluesky

ABSTRACT

Testing for the Hardy–Weinberg equilibrium (HWE) is often used as an initial step for checking the quality of genotyping. When testing the HWE for case-control data, the impact of a potential genetic association between the marker and the disease must be controlled for otherwise the results may be biased. Li and Li [2008] proposed a likelihood ratio test (LRT) that accounts for this potential genetic association and it is more powerful than the commonly used control-only χ² test. However, the LRT is not efficient when the marker is independent of the disease, and also requires numerical optimization to calculate the test statistic. In this article, we propose a novel shrinkage test for assessing the HWE. The proposed shrinkage test yields higher statistical power than the LRT when the marker is independent of or weakly associated with the disease, and converges to the LRT when the marker is strongly associated with the disease. In addition, the proposed shrinkage test has a closed form and can be easily used to test the HWE for large datasets that result from genome-wide association studies. We compare the performance of the shrinkage test with existing methods using simulation studies, and apply the shrinkage test to a genome-wide association dataset for Alzheimer's disease.

Introduction

The Hardy–Weinberg equilibrium (HWE) is one of the most important properties in population genetics. More than a century ago, G. H. Hardy and W. Weinberg individually noted that for a large, self-contained, and randomly mating population, assuming a bi-allelic locus with alleles A and a and corresponding allele frequencies p and q, the genotype frequencies are p², $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0001$ and q² for genotypes $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0002$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0003$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0004$ , respectively [Hardy, 1908; Weinberg, 1908]. As genotyping errors distort the genotype distribution and thus break the HWE, testing for this equilibrium has been routinely conducted as a way of checking the quality of genotyping [Gomes et al., 1999; Hosking et al., 2004; Xu et al., 2002]. Deviation from the HWE often indicates a poor quality of genotyping.

A straightforward way to test the HWE is the χ² test, which is based on pooling samples from the cases and controls [Weir, 1996]. We refer to this test as the pooled χ² test in order to distinguish it from other versions of χ² tests described later. The pooled χ² test has high statistical power but requires the subjects under investigation to be a random sample from the target population. For case-control data, subjects are retrospectively ascertained based on their disease status. Therefore, if a candidate marker is associated with the disease, the corresponding genotypes in the case-control sample are no longer representative of the target population. Consequently, the pooled χ² test may yield misleading results and mistakenly conclude a violation of the HWE when the target population is actually within the HWE [Wittke et al., 2005]. Nevertheless, it is worth noting that if the candidate marker is independent of the disease, the genotypes of the case-control samples are a random sample from the population, and thus the pooled χ² test is a valid and efficient test for assessing the HWE.

Many methods have been proposed to test the HWE for case-control samples without making the strong assumption that the candidate marker is independent of the disease. A widely used approach is to conduct the χ² test using only the controls, while discarding the cases. Unfortunately, this control-only χ² test is (approximately) valid only when the disease prevalence is low, and therefore the controls provide a good approximation of the general population. When the disease prevalence is moderate or high, the control-only χ² test leads to inflated type I errors [Li and Li, 2008]. In addition, because of discarding the information obtained from cases, the control-only χ² test is not efficient. To address these issues, Li and Li [2008] developed a likelihood ratio test (LRT) to assess the HWE using data from both cases and controls, while taking into account the potential association between the marker and the disease. Compared to the control-only χ² test, the LRT is more powerful for detecting departures from the HWE for common diseases and has comparable power for use in analyzing rare diseases. Yu et al. [2009] proposed a similar test based on the likelihood ratio framework. Wang and Shete [2010] developed a bootstrapping test that also accounts for the underlying genetic association. Because the performances of these tests are quite comparable, herein we focus on the LRT.

Although the LRT is valid and more powerful than the control-only χ² test, it has two limitations. First, because the LRT requires the estimation of extra nuisance parameters (e.g. penetrances), it can be substantially less powerful than the pooled χ² test if the marker actually is independent of the disease. This is a concern for some genetic studies, such as a genome-wide association study (GWAS), in which most of the markers are expected to be weakly or not associated with the disease. In addition, the LRT typically requires numerical optimization to calculate the test statistic (i.e. the maximum likelihood estimates); therefore it can be time consuming to apply the LRT to large-scale case-control studies, e.g. a GWAS, in which hundreds of thousands of single nucleotide polymorphisms (SNPs) need to be tested.

In this article, we propose a novel shrinkage test to circumvent the limitations of the LRT. Toward this goal, we first propose an extension of the pooled χ² test, called the generalized χ² test, for assessing the HWE. The generalized χ² test is valid regardless of whether or not the marker is associated with the disease. Based on that characteristic, we then propose a shrinkage test statistic, which takes a form of the weighted average of the pooled χ² test statistic and the generalized χ² test statistic. When the marker is independent of the disease, the proposed shrinkage test converges to the pooled χ² test, and therefore achieves high statistical power. When the marker is associated with the disease, the proposed shrinkage test converges to the generalized χ² test, and therefore remains statistically valid. A simulation study shows that, compared to the LRT, the shrinkage test is more powerful to detect departures from the HWE when the marker is weakly or not associated with the disease, and has comparable power when the marker is strongly associated with the disease. In addition, as the proposed shrinkage test has a closed form, it is easy to calculate and is particularly suitable for testing the HWE for large case-control datasets.

The remainder of this article is organized as follows. We first briefly review the pooled χ² test, and then propose the generalized χ² test and the shrinkage test. We compare the performances of the proposed test to existing methods using simulation studies and a GWAS dataset. We conclude the article with a brief discussion.

Methods

Consider a bi-allelic candidate marker with two alleles, A and a, having frequencies p and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0005$ , respectively, where p is the minor allele frequency (MAF). Denote three genotypes by $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0006$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0007$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0008$ with genotype frequencies $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0009$ for $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0010$ When the HWE holds, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0011$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0012$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0013$ . Denote the penetrance by $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0014$ , and the disease prevalence by $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0015$ , then the genotype frequencies in the cases and controls are given by $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0016$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0017$ , respectively, for $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0018$ . Define the genetic relative risks (GRRs) as $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0019$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0020$ , to characterize the underlying genetic association. When the candidate marker is not associated with the disease, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0021$ ; otherwise, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0022$ with at least one inequality holding. When the genetic association is present, the genetic model can be used to describe the relationship between $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0023$ 's. A genetic model is called recessive (REC) if $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0024$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0025$ ; additive (ADD) if $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0026$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0027$ , and dominant (DOM) if $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0028$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0029$ [Sasieni, 1997].

Consider case-control data consisting of r cases and s controls. Let $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0030$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0031$ denote the genotype counts of G₀, G₁, and G₂ in cases and controls, respectively. Let $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0032$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0033$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0034$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0035$ . The genotype counts for the cases and controls are displayed in Table 1.

Table 1. Genotype data for the case-control study of a bi-allelic candidate marker

	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0036$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0037$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0038$	Total
Case	r₀	r₁	r₂	r
Control	s₀	s₁	s₂	s
Total	n₀	n₁	n₂	n

The Pooled χ² Test

Weir [1996] proposed a (pooled) χ² test to assess the HWE through the Hardy–Weinberg disequilibrium coefficient, defined as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0039$

Obviously, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0040$ when the HWE holds. For case-control data, when the candidate marker is not associated with the disease, genotypes in the case-control samples are a random sample from the general population. Therefore, we can pool the cases and controls together and estimate $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0041$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0042$ , and

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0043$

The pooled χ² test statistic is defined as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0044$

which asymptotically follows a χ² distribution with 1 degree of freedom under the null hypothesis that the HWE holds.

The pooled χ² test is efficient and possesses high statistical power because of the use of the pooled data from cases and controls. However, when the candidate marker is associated with the disease, the χ² test is invalid because the genotypes in the case-control sample are no longer a random sample from the general population and therefore $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0045$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0046$ , and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0047$ are biased. Under this circumstance, the control-only χ² test, which uses only control data, is often used to examine the HWE.

The Generalized χ² Test

Before we introduce the shrinkage test, we propose a new χ² test for assessing the HWE, upon which the shrinkage test will be built. For convenience, we call this new test the generalized χ² test because it makes the χ² test [Weir, 1996] generally applicable to case-control data regardless of the association between the marker and the disease. The generalized χ² test is developed based on the basis of the following key observations: for the case-control data, the minor allele frequency p can be expressed as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0048$

and the genotype frequency g₂ can be expressed as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0049$

Similar to the LRT [Li and Li, 2008], we assume that the disease prevalence k is known. This assumption is plausible because disease prevalence can often be assessed from data external to that of the case-control study. Given the case-control data shown in Table 1, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0050$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0051$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0052$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0053$ can be consistently estimated by $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0054$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0055$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0056$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0057$ , respectively. Therefore, we estimate p and g₂ by

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0058$

Then, the generalized estimate of the Hardy–Weinberg disequilibrium (HWD) coefficient is given by

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0059$

and correspondingly, the generalized χ² test statistic for the HWE is defined as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0060$

(see the Appendix for derivation of a closed form of $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0061$ ) Under the null hypothesis that the HWE holds, T_gen asymptotically follows a χ² distribution with 1 degree of freedom.

Because the development of the generalized χ² test does not require any assumptions of the association between the marker and the disease, this test is generally applicable to case-control samples. Actually, the generalized χ² test is a Wald test that is asymptotically equivalent to the LRT. Compared to the LRT, the main advantage of the generalized χ² test is its computational simplicity. The generalized χ² test statistic has a closed form and thus is more suitable to test the HWE for modern large-scale case-control studies involving millions of markers. However, like the LRT, if the marker is not associated with the disease, the generalized χ² test is less powerful than the pooled χ² test. To address this issue, we propose a shrinkage test as follows.

The Shrinkage Test

The shrinkage test is based upon the following shrinkage estimate of the HWD coefficient

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0062$

which takes a form of the weighted average of the generalized estimate of the HWD coefficient $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0063$ and pooled estimate of the HWD coefficient $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0064$ . The shrinkage factor or weight w ( $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0065$ ) controls how much $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0066$ shrinks toward $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0067$ or $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0068$ . As described later, we select w in a data-adaptive way so that $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0069$ inherits the merits from both the pooled χ² test and the generalized χ² test. Specifically, we require that when the marker is associated with the disease, w goes to 1 and thus $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0070$ converges to $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0071$ to ensure the validity of the test; and when the marker is not associated with the disease, w goes to 0 and thus $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0072$ converges to $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0073$ to achieve the high statistical power of the pooled χ² test.

Correspondingly, we define the shrinkage test statistic as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0074$

A closed form of $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0075$ can be obtained through the Delta method (see the Appendix for details). Under the null hypothesis that the HWE holds, T_shrink follows a χ² distribution with 1 degree of freedom.

We now discuss how to construct the shrinkage factor w. In order to ensure that the value of w adaptively changes with the strength of the marker-disease association, we first define a measure of the marker-disease association and then use that as a basis for constructing the shrinkage factor.

We measure the strength of the marker-disease association using the Bayes factor. The Bayes factor is the cornerstone of Bayesian hypothesis testing [Jeffreys, 1961; Kass and Raftery, 1995] and provides an evidence-based measure of the likelihood of a hypothesis being true. As the standard Bayes factor involves high-dimensional integration and is sensitive to the prior of the unknown parameters, we herein adopt a variation of the Bayes factor, called the approximate Bayes factor (ABF; Wakefield [2007]; Xu et al. [2012]). The main difference between the ABF and the standard Bayes factor is that the ABF is based on the likelihood of a test statistic, whereas the standard Bayes factor is based on the likelihood of the observed data [Johnson, 2005; Wakefield, 2007].

Letting T denote a test statistic for testing the null hypothesis $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0076$ vs. the alternative $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0077$ , and assuming that the distribution of T depends only on θ, the ABF is defined as

Because the ABF does not depend on the nuisance parameters in the model, it is easy to calculate and is also immune to the influence of the prior of the nuisance parameters. There is a close relationship between the ABF and the posterior probabilities of the hypotheses. Letting $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0079$ denote the prior odds of H₁ vs. H₀ being true, the posterior probability of H₁ being true is

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0080$ (1)

We gauge the strength of the marker-disease association using the ABF based on the trend test statistic. Given the case-control data listed in Table 1, the trend test statistic under the additive model is defined as [Sasieni, 1997]

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0081$

Let β denote the logarithm of the odds ratio of the disease penetrance and define $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0082$ (no marker-disease association) and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0083$ (having marker-disease association). Under H₀, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0084$ and under H₁, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0085$ . Following Xu et al. [2012], we assign β a normal prior distribution $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0086$ with $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0087$ . The ABF based on Z_tt is given by

The value of the ABF is between 0 and ∞. It converges to 0 in the absence of a marker-disease association and to ∞ in the presence of such an association [Johnson, 2005].

With the ABF at hand, a nature choice for the shrinkage factor w is the posterior probability of the marker being associated with the disease, i.e. $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0089$ as given in (1), because of its appealing feature that $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0090$ in the presence of a marker-disease association (as $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0091$ goes to 1) and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0092$ in the absence of such an association (as $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0093$ goes to 0). To use this shrinkage factor, we found that the conventional noninformative prior $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0094$ (i.e. there is an equal prior probability that the marker is associated or not associated with the disease) did not work well in finite samples and led to inflated type I errors (see Table 1 in the Supplementary Materials). This is because in finite samples, the value of $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0095$ can be substantially smaller than 1 under H₁ although it asymptotically converge to 1. As a result, even there is a marker-disease association, T_pool still contributes to T_shrink, which causes inflated type I errors. One way to address this issue is to assign a higher prior probability to H₁, thus a lower prior weight to T_pool, to control the influence of T_pool. Based on the simulation study, we recommend $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0096$ with an addition set-off of 0.1 for w because it consistently yields good operating characteristics in terms of both the type I error and power. That is,

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0097$

Obviously, with this finite-sample adjustment, the value of w sometimes can be larger than 1 if $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0098$ is very close to 1. In this case, we simply round w to 1 so that $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0099$ .

Simulation Studies

We carried out comprehensive simulation studies to investigate the performance of the proposed shrinkage test and compare our proposed approach to the control-only χ² test and the LRT. We first investigated the type I error rate of the different methods under the null hypothesis that the HWE holds. We assumed the MAF $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0100$ , disease prevalence $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0101$ or 0.1, GRR $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0102$ , 1.5 or 2.0, and sample size $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0103$ . We considered three genetic models, including REC, ADD, and DOM, under which the values of λ₁ were determined.

The genotype distributions among the case and control groups were determined according to p, λ₁, λ₂, and k under the constraint of the HWE. Under each simulation condition, 10,000 replicates were used to evaluate the empirical type I error rate for the tests.

To evaluate the power of the methods to detect different departures from the HWE, following Li and Li [2008], we simulated case-control data from two genotyping error models (denoted as S1 and S2) introduced by Leal [2005]. Model S1 assumes that heterozygosity may be incorrectly genotyped as homozygosity. In this case, given the MAF p and error rate δ, the observed genotype probabilities are $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0104$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0105$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0106$ . In S2, homozygosity may be incorrectly genotyped as heterozygosity with $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0107$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0108$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0109$ . The parameters setting for evaluating the power were the same as those for evaluating the type I error rate except that we considered $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0110$ , 1.25 or 1.5. We specified an error rate of $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0111$ or 0.075 and evaluated the empirical power using 10,000 replicates under a significance level 0.05.

Figure 1 shows the type I error rates of the control-only χ² test, LRT and shrinkage test under different simulation conditions. We can see that the LRT and the shrinkage test consistently controlled the type I error rates at the nominal value (5%) across all conditions. In contrast, although the control-only χ² test performed reasonably well when the disease prevalence was low $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0112$ , it led to inflated type I error rates under REC and DOM with modest disease prevalence and strong genetic association (e.g. $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0113$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0114$ ). For example, the type I error rate of the control-only χ² test was inflated up to 10.84% under REC with $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0115$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0116$ . Therefore, if we were to use the control-only χ² test to evaluate the genotyping quality for large-sample studies (e.g. GWAS), we might falsely exclude the important candidate markers from further study.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Empirical type I error rates (%) of the control-only χ² test, LRT, and the shrinkage test based on 10,000 replicates with nominal level of 5% (dashed line).

In terms of power to detect departures from the HWE, the proposed shrinkage test outperformed the LRT and control-only χ² test, especially when the marker was weakly or not associated with the disease, as shown in Table 2. For example, when the marker is not associated with the disease, under genotyping error model S2 with disease prevalence $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0117$ , the power of the shrinkage test was about 12% and 14% higher than that of the LRT when the error rates are $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0118$ and 0.075, respectively. Such improvement stems from the fact that the shrinkage test automatically converged toward the pooled χ² test in the absence of a marker-disease association.

Table 2. Empirical power (%) of the control-only χ² test, LRT and the shrinkage test based on 10,000 replicates. The significance level is 0.05

model	λ₂	model	δ	Control	LRT	Shrinkage	Control	LRT	Shrinkage
				$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0119$			$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0120$
Genetic		Error
No	1.0	S1	0.05	34.2	34.2	43.2	35.4	41.1	48.6
association			0.075	65.7	66.0	78.4	65.1	73.2	82.4
		S2	0.05	35.0	35.8	47.8	35.6	41.8	51.4
			0.075	65.3	66.5	80.9	65.5	74.2	84.4
REC	1.25	S1	0.05	35.0	35.2	52.2	28.5	41.6	54.5
			0.075	63.8	65.7	79.3	57.6	73.5	83.7
		S2	0.05	35.7	36.0	37.0	41.2	41.5	41.6
			0.075	66.1	67.1	71.3	71.9	74.8	77.4
	1.5	S1	0.05	33.2	34.5	43.1	22.3	40.1	46.5
			0.075	63.9	65.9	71.8	50.9	74.9	78.1
		S2	0.05	36.6	36.5	34.5	49.2	43.0	41.4
			0.075	67.6	66.4	67.2	77.3	74.8	77.5
ADD	1.25	S1	0.05	34.3	34.3	38.4	34.6	41.3	43.8
			0.075	65.0	65.2	70.6	64.7	73.6	76.6
		S2	0.05	35.1	36.2	43.7	35.3	42.6	48.2
			0.075	64.6	65.9	74.6	65.1	75.2	80.3
	1.5	S1	0.05	33.9	33.9	33.8	33.7	40.6	39.9
			0.075	65.0	65.6	65.5	62.7	72.7	72.2
		S2	0.05	35.3	35.9	38.8	34.7	42.4	44.1
			0.075	66.3	66.8	70.0	65.7	75.7	77.5
DOM	1.25	S1	0.05	35.4	34.8	33.8	40.6	40.7	39.4
			0.075	66.1	65.8	66.2	69.9	72.9	72.8
		S2	0.05	34.3	36.0	43.4	29.5	41.8	47.3
			0.075	66.4	67.7	74.8	59.4	75.5	79.9
	1.5	S1	0.05	35.8	34.9	34.5	45.7	40.7	39.6
			0.075	66.1	65.3	64.6	73.6	73.2	72.8
		S2	0.05	33.5	35.2	36.9	25.5	42.3	44.0
			0.075	65.1	67.5	69.1	54.1	75.7	76.7

As the association between the marker and the disease becomes stronger (e.g. $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0121$ or 1.5), the power gain using the shrinkage test becomes smaller because the shrinkage test converges toward the generalized χ² test, which is asymptotically equivalent to the LRT. Even so, in general, we observed that the shrinkage test was slightly more powerful than the LRT in many cases, especially under the S2 genotyping error model. For instance, under the ADD and the moderate association with $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0122$ and disease prevalence $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0123$ , the shrinkage test was 7.5% and 8.7% more powerful than the LRT when the error rates were $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0124$ and 0.075 under genotyping error model S2.

The proposed shrinkage test also has a substantial edge over the LRT in terms of computing time. It took about 4 min to conduct 10,000 simulations using the LRT; whereas it took only 15 seconds using the proposed shrinkage test on a personal computer with a 3.20 GHZ Intel Core i5 CPU and 4.00 GB memory.

To further understand the behavior of the proposed shrinkage test, we investigated the relationship between the shrinkage factor w and strength of the marker-disease association (i.e. λ₂). As shown in Figure 2, across different genetic models, the value of the shrinkage factor w automatically adjusted according to the strength of the marker-disease association. When the association was weak (i.e. the value of λ₂ was close to 1), the value of the shrinkage factor w was small, thereby strongly converging the shrinkage test toward the pooled χ² test to achieve high statistical power. When λ₂ increased, the shrinkage factor w approached 1, thereby converging the shrinkage test toward the generalized χ² test to maintain the validity of the test. These results explain the underlying reason the shrinkage test yields higher power than the LRT, while also controlling the type I error rate at the nominal value, as described previously.

In this simulation studies, we focus on the single-SNP case-control studies. Actually, the proposed shrinkage test can also be used in the GWAS study where millions of SNPs are being tested. The only modification is that we should select a much stricter significance level (i.e. 0.0001) for the GWAS study due to the multiple comparison issue. In the following section, we applied the shrinkage test to a real GWAS dataset to investigate its performance in the GWAS study.

Application

We applied the proposed method to a GWAS dataset from the Genome Medicine Database of Japan (GeMDBJ) [Yoshida et al., 2003]. This dataset contains information from 763 patients diagnosed with Alzheimer's disease and 1,422 healthy volunteers from Japan. A total of 577,728 SNPs were genotyped to identify patterns of genomic variation associated with Alzheimer's disease. According to Matsui et al. [2009], the incidence rate of Alzheimer's disease was 14.6/1,000 persons in the Japanese population. To assess the genotyping quality of the data, we applied the proposed method to the SNPs. We first screened out the SNPs with estimated MAF < 0.05, and then applied the LRT and the shrinkage test to assess the HWE for the remaining 469,225 SNPs.

As shown in Table 3, across different cutoffs (of the p-value) for significance, compared to the LRT, the proposed shrinkage test identified more SNPs that had significant departure from the HWE, which suggests that the shrinkage test has higher power than the LRT. For example, with 0.05 as the significance cutoff, the LRT detected 33,006 significant SNPs, whereas the shrinkage test detected 34,952 significant SNPs. To obtain more insight into the variations in performance between the shrinkage test and the LRT, we calculated the p-values of the SNP-disease association for each of the SNPs based on the trend test (e.g. PV_assoc), and then stratified the SNPs into four groups according to PV_assoc; see Table 4. Consistent with our simulation results, compared to the statistical power of the LRT, the power gained by using the shrinkage test depended on the strength of the SNP-disease association. When the SNP-disease association was weak (i.e. PV_assoc ⩾ 0.05), the shrinkage test identified 1,896 more significant SNPs than the LRT (31,268 vs. 33,164). As the strength of the association became stronger, the difference between these two methods diminished. The shrinkage test identified 41, 8, and 1 more significant SNPs than the LRT when $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0125$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0126$ , and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0127$ , respectively. Given the fact that most candidate SNPs in the GWAS are not associated with the disease, the shrinkage test is preferred to the LRT for checking the quality of genotyping in the GWAS. In addition, the proposed shrinkage test required a substantially shorter computing time. To analyze this dataset, the LRT took more than 2.5 hr, whereas the shrinkage test took only about 10 min.

Table 3. Number of SNPs showing significant departure from HWE under different p-value cutoffs

	p-value less than
Method	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0128$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0129$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0130$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0131$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0132$
LRT	396	437	673	2,636	33,006
Shrinkage	1,411	1,669	2,149	4,605	34,952

Table 4. Number of SNPs showing significant departure from HWE, stratified by the p-value of the marker-disease association, based on the LRT and shrinkage test. The significance level for the LRT and shrinkage test is 0.05

	p-value for marker-disease association
Method	[0, 10⁻⁴)	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0133$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0134$	$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0135$	Total
LRT	3	374	1,361	31,268	33,006
Shrinkage	4	382	1,402	33,164	34,952

Conclusion

In this article, we propose a shrinkage test for assessing the HWE in case-control data. The proposed shrinkage test is more powerful than the LRT when the marker is independent of or weakly associated with the disease, and remains valid when the marker is strongly associated with the disease. Specifically, we propose a generalized χ² test that is asymptotically equivalent to the LRT but easier to calculate. Then, we develop a shrinkage test that takes the form of the weighted average of the pooled χ² test and the generalized χ² test. We construct the weight (or shrinkage factor) based on the ABF so that the weight adaptively shrinks the test toward the pooled χ² test or the generalized χ² test according to the strength of the marker-disease association. When the marker is independent of the disease, the shrinkage test converges to the pooled χ² test to achieve high statistical power; and when the marker is associated with the disease, the shrinkage test converges to the generalized χ² test to guarantee the validity of the test. In addition, the shrinkage test has a closed form and is easy to use. A simulation study and real data application show that the shrinkage test outperforms the existing methods with higher statistical power to detect departures from the HWE. The associated R code to implement the proposed shrinkage test can be downloaded from http://odin.mdacc.tmc.edu/yyuan/Software_release/HWE/simu.R

Acknowledgments

The authors thank two referees for their helpful comments and LeeAnn Chastain for her editorial assistance.

Appendix

Expressions for $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0136$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0137$

Use $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0138$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0139$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0140$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0141$ to represent $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0142$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0143$ , $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0144$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0145$ respectively and denote $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0146$ . Thus, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0147$ can be rewritten as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0148$

Define

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0149$

and the estimate of the covariance matrix of $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0150$ as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0151$

Thus, by using the Delta method, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0152$ can be expressed as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0153$

Similarly, define $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0154$ that can be considered as the estimated “disease prevalence” from the case-control sample, we have

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0155$

Denote $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0156$ , then

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0157$

Again, $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0159$ can be expressed as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0160$

and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0161$ can be obtained as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0162$

Finally, the variance estimate of $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0163$ ca be derived as

$urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0164$

Supporting Information

References

Gomes I, Collins A, Lonjou C, Thomas NS, Wilkinson J, Watson M, Morton N. 1999. Hardy-Weinberg quality control. Ann Hum Genet 63: 535–538.
10.1046/j.1469-1809.1999.6360535.x
CAS PubMed Web of Science® Google Scholar
Hardy GH. 1908. Mendelian proportions in a mixed population. Science 18: 49–50.
10.1126/science.28.706.49
Google Scholar
Hosking L, Lumsden S, Lewis K, Yeo A, McCarthy L, Bansal A, Riley J, Purvis I, Xu CF. 2004. Detection of genotyping errors by Hardy–Weinberg equilibrium testing. Eur J Hum Genet 12: 395–399.
10.1038/sj.ejhg.5201164
CAS PubMed Web of Science® Google Scholar
Jeffreys H. 1961. Theory of Probability. Oxford: Oxford University Press.
Google Scholar
Johnson VE. 2005. Bayes factors based on test statistics. J Roy Stat Soc, Ser B 67: 689–701.
10.1111/j.1467-9868.2005.00521.x
Web of Science® Google Scholar
Kass RE, Raftery AE. 1995. Bayes factors. J Am Statist Ass 90: 773–795.
10.1080/01621459.1995.10476572
Web of Science® Google Scholar
Leal S. 2005. Detection of genotyping errors and pseudo-SNPs via deviations from Hardy–Weinberg equilibrium. Genet Epidemiol 29: 204–214.
10.1002/gepi.20086
PubMed Google Scholar
Li M, Li C. 2008. Assessing departure from Hardy–Weinberg equilibrium in the presence of disease association. Genet Epidemiol 32: 589–599.
10.1002/gepi.20335
PubMed Web of Science® Google Scholar
Matsui Y, Tanizaki Y, Arima H, Yonemoto K, Doi Y, Ninomiya T, Sasaki K, Iida M, Iwaki T, Kanba S and others. 2009. Incidence and survival of dementia in a general population of Japanese elderly: the Hisayama study. J Neurol Neurosurg Psychiatry 80: 366–370.
10.1136/jnnp.2008.155481
CAS PubMed Web of Science® Google Scholar
Sasieni PD. 1997. From genotypes to genes: doubling the sample size. Biometrics 53: 1253–1261.
10.2307/2533494
CAS PubMed Web of Science® Google Scholar
Wakefield J. 2007. A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am J Hum Genet 81: 208–227.
10.1086/519024
CAS PubMed Web of Science® Google Scholar
Wang J, Shete S. 2010. Using both cases and controls for testing Hardy–Weinberg proportions in a genetic association study. Hum Hered 69: 212–218.
10.1159/000289597
PubMed Web of Science® Google Scholar
Weinberg W. 1908. On the demonstration of heredity in man. In: SH Boyer, editor. Papers on Human Genetics. Englewood Cliffs, NJ: Prentice-Hall.
Google Scholar
Weir BS. 1996. Genetic Data Analysis II: Methods for Discrete Population Genetic Data. Sunderland, MA: Sinauer Associates.
Google Scholar
Wittke-Thompson JK, Pluzhnikov A, Cox NJ. 2005. Rational inference about departures from Hardy–Weinberg equilibrium. Am J Hum Genet 76: 967–986.
10.1086/430507
PubMed Web of Science® Google Scholar
Xu J, Turner A, Little J, Bleecker ER, Meyers DA. 2002. Positive results in association studies are associated with departure from Hardy–Weinberg equilibrium: hint for genotyping error? Hum Genet 111: 573–574.
10.1007/s00439-002-0819-y
PubMed Web of Science® Google Scholar
Xu J, Yuan A, Zheng G. 2012. Bayes factor based on the trend test incorporating Hardy–Weinberg disequilibrium: more powerful to detect genetic association. Ann Hum Genet 76: 301–311.
10.1111/j.1469-1809.2012.00714.x
CAS PubMed Web of Science® Google Scholar
Yoshida T, Yoshimura K. 2003. Outline of disease gene hunting approaches in the Millennium Genome Project of Japan. Proc Japan Acad 79: 34–50.
10.2183/pjab.79B.34
Web of Science® Google Scholar
Yu C, Zhang S, Zhou C, Sile S. 2009. A likelihood ratio test of population Hardy–Weinberg equilibrium for case-control studies. Genet Epidemiol 33: 275–280.
10.1002/gepi.20381
PubMed Web of Science® Google Scholar

Citing Literature

Volume37, Issue7

November 2013

Pages 743-750

A Shrinkage Method for Testing the Hardy–Weinberg Equilibrium in Case-Control Studies

ABSTRACT

Introduction

Methods

The Pooled χ² Test

The Generalized χ² Test

The Shrinkage Test

Simulation Studies

Application

Conclusion

Acknowledgments

Appendix

Expressions for $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0136$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0137$

Supporting Information

References

Citing Literature

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

A Shrinkage Method for Testing the Hardy–Weinberg Equilibrium in Case-Control Studies

ABSTRACT

Introduction

Methods

The Pooled χ2 Test

The Generalized χ2 Test

The Shrinkage Test

Simulation Studies

Application

Conclusion

Acknowledgments

Appendix

Expressions for and

Supporting Information

References

Citing Literature

Figures

References

Related

Information

The Pooled χ² Test

The Generalized χ² Test

Expressions for $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0136$ and $urn:x-wiley:07410395:media:gepi21753:gepi21753-math-0137$