Volume 18, Issue 1 pp. 137-146
NOTE
Full Access

On bootstrap validity for specification tests with weak instruments

Firmin Doko Tchatoka

Firmin Doko Tchatoka

School of Economics, University of Adelaide, 10 Pulteney Street, Adelaide, SA, 5005 Australia

Search for more papers by this author
First published: 04 December 2014
Citations: 11

Summary

We study the asymptotic validity of the bootstrap for Durbin–Wu–Hausman tests of exogeneity, with or without identification. We provide an analysis of the limiting distributions of the proposed bootstrap statistics under both the null hypothesis of exogeneity (size) and the alternative hypothesis of endogeneity (power). We show that when identification is strong, the bootstrap provides a high-order approximation of the null limiting distributions of the statistics and is consistent under the alternative hypothesis if the endogeneity parameter is fixed. However, the bootstrap only provides a first-order approximation when instruments are weak. Moreover, we provide the necessary and sufficient condition under which the proposed bootstrap tests exhibit power under (fixed) endogeneity and weak instruments. The latter condition may still hold over a wide range of cases as long as at least one instrument is relevant. Nevertheless, all bootstrap tests have low power when all instruments are irrelevant, a case of little interest in empirical work. We present a Monte Carlo experiment that confirms our theoretical findings.

1. INTRODUCTION

Exogeneity tests of the type proposed by Durbin (1954), Wu (1973) and Hausman (1978), henceforth DWH tests, are widely used in applied work to determine whether the ordinary least-squares (OLS) or the instrumental variable (IV) method is appropriate. There is now a considerable body of research on this topic, and most studies often impose identifying assumptions on model coefficients, thus leaving out issues associated with weak instruments. It is well known that IV estimators can be imprecise and that inference procedures (such as tests and confidence sets) can be highly unreliable in the presence of weak instruments. In recent years, concerns have been raised about the reliability of DWH procedures in the presence of weak instruments because they mainly rely on IV estimators; see Staiger and Stock (1997), Guggenberger (2010), Hahn et al. (2010), Doko Tchatoka and Dufour (2011) and Kiviet and Niemczyk (2007), among others.

Staiger and Stock (1997) show that the limiting distributions of Hausman (1978) type statistics depend on the concentration matrix, which usually determines the strength of the identification. Doko Tchatoka and Dufour (2011) show that all DWH exogeneity statistics, including Wu (1973) T3 and alternative Hausman (1978) type statistics, are identification-robust even in a finite sample with or without Gaussian errors. However, applying the usual χ2 critical values to the T3 and Hausman (1978) statistic can lead to overly conservative procedures when identification is weak. Size correction can be achieved by resorting to the method of exact Monte Carlo tests, such as in Dufour (2006). However, the exact Monte Carlo method requires that the conditional distribution of the structural disturbance, given the instruments, be specified. In practice, researchers usually do not know the distribution of the errors even conditionally on available instruments. So, implementing the exact Monte Carlo tests can be difficult.

In this paper, we examine whether a distribution-free method, such as a bootstrap method, can improve the properties of the DWH statistics, especially when identification is not very strong. To be more specific, we exploit the score interpretation of these statistics (see Engle, 1982, and Smith, 1983) to suggest a bootstrap method similar to those of Moreira et al. (2009) for the score test of the null hypothesis in the structural parameters. Our results provide some new insights and extensions of earlier studies.

We show that when identification is strong, the bootstrap method offers a high-order approximation of the null limiting distributions of the DWH statistics. Furthermore, the bootstrap test consistency holds under fixed alternative hypotheses (i.e. when endogeneity is present and does not depend on the sample size). However, the bootstrap only provides a first-order approximation when identification is weak. Moreover, we provide the necessary and sufficient condition under which the proposed bootstrap tests exhibit power under fixed endogeneity and weak instruments. The latter condition may still hold over a wide range of cases, provided that at least one instrument is not irrelevant. However, all the proposed bootstrap DWH tests have low power when all instruments are irrelevant or close to irrelevant.

This paper is organized as follows. In Section 2, the model and assumptions are formulated, and the studied statistics are presented. In Section 3, the proposed bootstrap method is discussed and the limiting distributions of the corresponding DWH statistics are characterized. In Section 4, the Monte Carlo experiment is presented, and the auxiliary theorem and proofs are provided in the Appendix. Throughout the paper, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0001 stands for the identity matrix of order q. For any full-column rank urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0002 matrix A, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0003 is the projection matrix on the space of A, and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0004. The notation vec(A) is the urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0005 dimensional column vectorization of A. urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0006 for a squared matrix B means that B is positive definite. Convergence almost surely is symbolized by urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0007, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0008 stands for convergence in probability, while urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0009 means convergence in distribution. The usual orders of magnitude are denoted by urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0010 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0011 O(1) and o(1). urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0012 denotes the usual Euclidian or Frobenius norm for a matrix U, while rank(U) is the rank of U. For any set urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0013, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0014 is the boundary of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0015 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0016 is the ε-neighbourhood of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0017. Finally, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0018 is the supremum norm on the space of bounded continuous real functions, with topological space Ω.

2. FRAMEWORK

We consider a standard linear structural model described by the following equations,
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0019(2.1)
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0020(2.2)
where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0021 is a vector of observations on a dependent variable, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0022 is a vector of observations on a (possibly) endogenous explanatory variable, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0023 is a matrix of observations on strictly exogenous variables, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0024 is a matrix of excluded exogenous variables from 2.1 (instruments), urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0025 is a vector of structural disturbances, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0026 is a vector of reduced-form disturbances, β and γ are unknown fixed coefficients (structural parameters) and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0027 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0028 are vectors of reduced-form coefficients.
Let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0029, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0030 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0031 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0032, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0033 [urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0034 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0035] are elements of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0036. The sample mean of the first n observations of any random variable X is denoted by urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0037. We suppose that Z has full-column rank k with probability one and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0038. The full-column rank condition of Z ensures the existence of unique least-squares estimates in 2.2 when y2 is regressed on each column of Z. As long as Z has full-column rank with probability one and the conditional distribution of y2 given Z is absolutely continuous (with respect to the Lebesgue measure), urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0039 also has full-column rank with probability one. So, the least-squares estimates of β and γ in 2.1 are also unique. From 2.1 and 2.2, we can express the reduced forms for y1 and y2 as
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0040(2.3)
where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0041. If u and v2 have zero means and if Z has full-column rank with probability one, then the usual necessary and sufficient condition for the identification of model 2.12.2 is urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0042. If urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0043, then Z2 is irrelevant, and β and γ are completely unidentified. If π2 is close to zero, then β and γ are ill-determined by the data, a situation often called weak identification in the literature; see Dufour (2003).
We make the following generic assumptions on the model variables, where
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0044(2.4)

Assumption 2.1.urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0045 for some urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0046 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0047.

Assumption 2.2.For sample size urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0048, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0049 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0050, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0051 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0052, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0053, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0054 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0055.

Assumption 2.1 is similar to Assumptions 2 and 3 of Moreira et al. (2009) with urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0056 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0057. It requires that urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0058 has second moments or greater, and that its characteristic function be bounded above by 1. In particular, the second moments of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0059 exist if urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0060 for some urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0061. The bound on the characteristic function of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0062 is the commonly used Cramér condition; see Bhattacharya and Ghosh (1978). The first two convergences in Assumption 2.2 are the strong law of large numbers (SLLN) property of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0063 and Z, respectively, while the last one is the central limit theorem (CLT) property. In the remainder of the paper, we use the following additional definitions and notations:
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0064(2.5)
All the parameters and variables are defined in 2.4 and Assumption 2.2, and π02 is a constant vector such that urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0065.
Under Assumption 2.2, the exogeneity of y2 in 2.12.2 can be expressed as
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0066(2.6)
In this paper, we are concerned with the validity of the bootstrap for the DWH tests of H0 without any identifying assumptions of model 2.12.2. To investigate this, we consider six alternative versions of the DWH statistics, namely, the three statistics urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0067 (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0068, 3, 4) by Wu (1973) and three alternative Durbin–Hausman statistics (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0069, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0070, 2, 3). All statistics can be expressed in a unified way as
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0071(2.7)
where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0072 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0073 are the OLS and IV estimators of β, respectively, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0074, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0075, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0076, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0077, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0078, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0079, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0080, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0081, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0082, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0083, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0084, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0085, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0086 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0087.

Engle (1982) and Smith (1983) show that T2, T4 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0088 in 2.7 are score (LM) statistics, while T3, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0089 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0090 are quasi-Wald statistics. Staiger and Stock (1997) and Doko Tchatoka and Dufour (2011), among others, show that the quasi-Wald statistics can be overly conservative when IVs are weak. We investigate whether the bootstrap can improve the size and power of these tests, especially when identification is not very strong.

It is worth noting that the exogeneity tests in 2.7 also have their own shortcomings. Indeed, Moreira (2009, footnote 1) shows that testing urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0091 is equivalent to test urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0092 in model 2.12.2, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0093, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0094, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0095 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0096 are given in 2.3. This means that doing a pre-test on urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0097 may imply important size distortions when making inference on β using a t-type test after the pre-test. Guggenberger (2010) shows that the asymptotic size of the two-stage t-test where a DWH pre-test is used equals 1 for some choices of the parameter space. Despite this issue, the DWH tests are widely used in many empirical works. This paper studies only the asymptotic validity of the bootstrap for the DWH statistics and does not address the issues of pre-testing.

3. BOOTSTRAP VALIDITY FOR THE DURBIN–WU–HAUSMAN TESTS

We now wish to describe the proposed bootstrap method for the DWH exogeneity statistics. Let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0098 denote the first-stage OLS estimate of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0099 in 2.2 and let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0100 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0101 be the OLS estimates of β and γ from the structural equation 2.1. We adapt the bootstrap procedure by Moreira et al. (2009) to DWH exogeneity statistics as follows.
  • Step 1. From the observed data, compute urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0102 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0103 along with all other things necessary to obtain the realizations of the statistics urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0104, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0105, and the residuals from the reduced-form equation 2.3: urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0106, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0107. These residuals are then re-centred by subtracting sample means to yield urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0108.
  • Step 2. For each bootstrap sample urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0109, the data are generated following
    urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0110(3.1.)
    where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0111 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0112 are drawn independently from the joint empirical distribution of Z and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0113. The corresponding bootstrap statistics urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0114 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0115 are then computed for each bootstrap sample urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0116.
  • Step 3. The simulated bootstrap p-value of each statistic is obtained as the proportion of bootstrap statistics that are more extreme than the computed statistic from the observed data.
  • Step 4. The corresponding bootstrap test rejects exogeneity at level α if its p-value is less than α.

Although the above bootstrap steps are similar to those in Moreira et al. (2009), it is worth noting that there is a substantial difference. In contrast to Moreira et al. (2009), where the two-stage least-squares (2SLS) or the limited information maximum likelihood (LIML) estimators are suggested as the pseudo-true value of β under the bootstrap data-generating process (DGP), our algorithm uses the OLS estimator of β in 2.1. Indeed, Moreira et al. (2009) show that the validity of their bootstrap requires using an estimator urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0117 that satisfies urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0118 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0119 (i.e. urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0120 is a (strong) consistent estimator of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0121). In a linear classical setting of this paper, both the 2SLS and LIML estimators satisfy the sufficient conditions for strong consistency; see footnote 3 of Moreira et al. (2009, p. 55). The OLS estimator urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0122 is not qualified for strong consistency when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0123 (endogeneity). However, when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0124 (exogeneity), urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0125 is consistent and efficient even when IVs are weak. Based on this fact, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0126 is preferred to an alternative 2SLS or LIML estimator because the choice of the latter should imply a sizable efficiency loss under H0. Moreover, choosing urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0127 as the pseudo-true value of β when H0 is false is suggested in Horowitz (2001) to approximate the bootstrap power.

In the remainder of the paper, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0128 denotes the empirical distribution of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0129, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0130 given urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0131, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0132 is the probability under the empirical distribution function (given urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0133) and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0134 is its corresponding expectation operator. Also, let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0135 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0136 be the cumulative density function (cdf) and the probability density function (pdf) of a χ2-distributed random variable with one degree of freedom. To ease the exposition of our results, we shall deal separately with the case where identification is strong and the case where it is weak.

3.1. Strong identification

We focus here on the case where identification is strong and we study the limiting distributions of the bootstrap DWH statistics under both the null and alternative hypotheses. Let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0137 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0138 (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0139 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0140) denote the empirical cumulative distributions of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0141 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0142, respectively, evaluated at τ. Theorem 3.1 states the bootstrap validity for the DWH statistics under strong instruments.

Theorem 3.1.Suppose that Assumptions 2.1 and 2.2 are satisfied and that urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0143 is fixed. Then for some integer urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0144, we have (a) urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0145, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0146 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0147 if urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0148; (b) urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0149, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0150 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0151 as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0152 if urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0153 is fixed. Here, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0154, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0155 are polynomials in τ with coefficients depending on urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0156, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0157 and the moments of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0158

The proof of Theorem 3.1(a) follows the same steps as Theorem 3 in Moreira et al. (2009) and is therefore omitted. The proof of Theorem 3.1(b) follows similar steps to those of Lemma A.1(b) of the online Appendix, and thus it is omitted. Theorem 3.1(a) shows that the bootstrap estimates and the urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0159-term empirical Edgeworth expansion in Theorem A.1(a) (see the Appendix) for all statistics are asymptotically equivalent up to the urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0160 order under H0. Furthermore, the bootstrap makes an error of size urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0161 under H0, which is smaller as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0162 than both urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0163 and the error made by the first-order asymptotic approximations. The bootstrap provides a greater accuracy than the urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0164 order because all the DWH statistics are quadratic functions of symmetric pivotal statistics (see Horowitz, 2001, Chapter 52, equation (3.13)) under exogeneity (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0165) and strong identification (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0166). Theorem 3.1(b) implies that all bootstrap tests are consistent when endogeneity is fixed and model identification is strong. Note that the bootstrap DWH test consistency holds under fixed endogeneity and strong identification no matter which critical value τ is used in the bootstrap procedure. In particular, this would be the case if the bootstrap critical value or the empirical size-corrected critical value were used in the bootstrap procedure, as suggested by Horowitz (2001). Although Theorem 3.1(b) only considers fixed alternative hypotheses, there is no impediment to expanding it to local to zero alternatives of the form urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0167 (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0168). This proof is omitted in order to shorten the exposition.

3.2. Local to zero weak instruments

We now analyse the Staiger and Stock (1997) local to zero weak instruments framework. To be more specific, we assume that urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0169, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0170 is a fixed vector (possibly zero). Because of the lack of identification, an Edgeworth expansion, such as in Theorem 3.1(a), is no longer valid. Indeed, we can express urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0171, for example, as a quadratic function in urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0172, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0173 itself is a function urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0174 of the sample. However, this function urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0175 is not differentiable when IVs are weak. Nonetheless, we can prove the first-order validity of the bootstrap under both H0 and the alternative hypothesis (endogeneity).

Theorem 3.2.Suppose that Assumption 2.2 holds and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0176 (exogeneity). Let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0177, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0178 is fixed. If, for some urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0179, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0180, then urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0181, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0182, conditional on urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0183, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0184 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0185 are given in 2.5.

Theorem 3.3.Suppose that Assumption 2.2 is satisfied and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0186 is fixed (endogeneity). Let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0187, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0188 is fixed. If, for some urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0189, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0190, then the necessary and sufficient condition under which urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0191 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0192 exhibit power is that urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0193. More precisely, we have (a) urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0194; urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0195 conditional on urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0196, if urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0197, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0198 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0199 are given in 2.5; (b) urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0200; urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0201, conditional on urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0202, if urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0203.

The proofs of Theorems 3.2 and 3.3 follow directly from those of Lemmata A.5 and A.6 in the online Appendix, and thus are omitted. Theorem 3.2 shows that the bootstrap provides a first-order approximation of the empirical distributions of all DWH statistics. Therefore, the bootstrap DWH tests almost surely have the correct size, despite the lack of identification. Because of the score nature of T2, T4 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0204, the validity of the bootstrap for these statistics can be viewed as an extension of Moreira et al. (2009) to LM tests for exogeneity. However, the bootstrap validity for the Wald-type tests T3, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0205 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0206 is not intuitive. In general, the bootstrap often fails for Wald-type statistics when instruments are weak because their limiting distributions often involve nuisance parameters; see Moreira et al. (2009), among others. The validity of the bootstrap for the DWH statistics is mainly justified by the fact that they do not directly depend on the unidentified structural coefficient β, even when endogeneity is present; see Section 3 of Wu (1973). So, the lack of identification of β has no impact on the size of the bootstrap tests. This is not, however, the case for the power of the bootstrap tests. Indeed, Theorem 3.3(b) shows that the power of the bootstrap tests cannot exceed the nominal level if IVs are irrelevant (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0207), no matter which critical value is used in the bootstrap procedure. This is because the limiting distributions of all bootstrap statistics are the same as when H0 holds, although urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0208 is fixed. This result is intuitive because urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0209 is not identifiable if β is not identifiable (see Doko Tchatoka and Dufour, 2014), which is the case when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0210. Clearly, all values of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0211 are observationally equivalent when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0212 so that the bootstrap tests fail to discriminate between urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0213 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0214. However, the bootstrap tests exhibit power, provided that at least one instrument is not irrelevant (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0215).

4. MONTE CARLO EXPERIMENT

We use simulation to examine the finite-sample performance of the proposed bootstrap DWH tests. The DGP is described by 2.1 and 2.2 with urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0216 (no exogenous instruments in 2.1), Z2 contains urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0217 instruments, each generated i.i.d. urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0218 and independent of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0219 for all urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0220. The true value of β is set at 2 and the reduced-form coefficient π2 is chosen as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0221, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0222 is a vector of ones and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0223 characterizes the strength of the instruments. More precisely, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0224 is a design of complete non-identification (irrelevant instruments), urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0225 designs weak identification, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0226 is the set-up of moderate identification, and finally urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0227 symbolizes strong identification. The errors urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0228 are generated as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0229, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0230,
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0231
and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0232 measures the endogeneity of y2. The exogeneity of y2 is then expressed as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0233. In this experiment, we vary urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0234 in urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0235, but the results do not change qualitatively for alternative values of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0236. The rejection frequencies are computed using 1000 replications for the standard DWH tests, while those of the bootstrap tests are obtained with urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0237 replications and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0238 bootstrap pseudo-samples of size urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0239. Table 1 presents the empirical rejection frequencies of both the bootstrap and standard DWH tests. First, we observe that the empirical rejections are close to the 5% nominal level for all bootstrap tests when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0240 (exogeneity), irrespective of the value of λ (quality of the IVs). Meanwhile, only the LM versions of the standard DWH tests have correct size when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0241 (weak or moderate IVs). The quasi-Wald versions of the standard DWH tests are overly conservative under weak IVs. As expected, the rejections under exogeneity of the standard DWH tests are similar to those of the bootstrap tests and are close to the 5% nominal level with strong identification (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0242). Second, when identification is strong and endogeneity is large (urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0243 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0244), both the bootstrap and standard tests have rejections approaching or equal to 100%. However, all tests have low power when identification is very weak. Nevertheless, even the quasi-Wald bootstrap DWH tests exhibit power with weak IVs and large endogeneity. For example, the rejections of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0245, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0246 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0247 when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0248 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0249 are around 24%, which nearly triple those of T3, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0250 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0251.
Table 1. Rejection frequencies (in %) of the standard DWH tests
Bootstrap DWH tests
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0252 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0253 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0254 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0255
Statistics↓ urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0256 0 0.05 0.1 1 0 0.05 0.1 1 0 0.05 0.1 1 0 0.05 0.1 1
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0257 6.8 17.9 73.4 100 5.2 6.6 5.8 4.5 4.6 11.2 35.7 100 6.2 24.1 82.2 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0258 5.5 21.0 71.7 100 4.4 4.4 6.6 4.4 4.1 5.9 35.5 100 4.0 24.4 77.1 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0259 6.8 17.9 73.4 100 5.2 6.6 5.8 4.5 4.6 11.2 35.7 100 6.2 24.1 82.2 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0260 5.5 21.0 71.7 100 4.4 4.4 6.6 4.4 4.1 5.9 35.5 100 4.0 24.4 77.1 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0261 5.5 21.0 71.7 100 4.4 4.4 6.6 4.4 4.1 5.9 35.5 100 4.0 24.4 77.1 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0262 6.8 17.9 73.4 100 5.2 6.6 5.8 4.5 4.6 11.2 35.7 100 6.2 24.1 82.2 100
Standard DWH test
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0263 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0264 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0265 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0266
Statistics↓ urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0267 0 0.05 0.1 1 0 0.05 0.1 1 0 0.05 0.1 1 0 0.05 0.1 1
T2 5.0 19.3 74.8 100 4.8 6.2 6.0 4.9 3.8 12.0 38.4 100 5.1 25.6 82.8 100
T3 0.2 6.7 64.3 100 0.2 0.9 3.0 4.9 0.4 2.1 26.9 100 0.4 8.0 74.4 100
T4 4.9 19.0 74.7 100 4.8 6.2 5.9 4.9 3.8 11.4 38.2 100 4.9 25.3 82.7 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0268 0.2 6.6 64.0 100 0.2 0.8 3.0 4.8 0.4 2.0 26.7 100 0.4 7.7 73.7 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0269 0.2 6.7 64.4 100 0.2 0.9 3.0 4.9 0.4 2.3 27.2 100 0.4 8.1 74.8 100
urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0270 5.0 19.0 74.7 100 4.8 6.2 5.9 4.9 3.8 11.7 38.3 100 4.9 25.3 82.7 100

ACKNOWLEDGEMENTS

We are grateful to Professor Richard J. Smith, Managing Editor of the Econometrics Journal and to two anonymous referees for their constructive comments and suggestions. We also thank Jean-Marie Dufour, Mardi Dungey, Ngoc Thien Ahn Pham and Robert Garrard for several useful comments. This project is supported by a School of Economics and Finance (University of Tasmania) research grant, and I am grateful for the support.

    Appendix A: AUXILIARY THEOREM

    Let urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0271 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0272, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0273, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0274, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0275 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0276 are given in 2.7.

    Theorem A.1.Suppose that Assumptions 2.1 and 2.2 are satisfied and that urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0277 is fixed. Then for some integer urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0278, we have (a) urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0279, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0280 urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0281 if urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0282; (b) urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0283, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0284 as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0285 if urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0286 is fixed, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0287 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0288 depend on urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0289, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0290, and the moments of the distribution F of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0291.

    Proof.First, we can write urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0292 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0293 as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0294 (with urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0295 as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0296) and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0297. (a) Suppose that H0 is satisfied. We want to approximate urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0298 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0299 uniformly in τ. First, we can write both urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0300 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0301 as urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0302 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0303, where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0304 are convex sets. From Bhattacharya and Rao (1976, Corollary 3.2), we have urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0305 for some constant d and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0306. So, Theorem 1 of Bhattacharya and Ghosh (1978) holds with urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0307 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0308. By using the approximation of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0309 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0310 in Lemma A.1(a) of the online Appendix and the definition of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0311, Theorem A.1(a) follows directly from the fact that the odd terms of the quadratic expansion are even. (b) When urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0312 is fixed, the proof follows similar steps to those of Lemma A.1(b) of the online Appendix.urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0313

  1. 1 The assumption of zero means can also be replaced by another location assumption, such as zero medians.
  2. 2 See Smith (1983) for the score interpretation (equations (6) and (9)) and for the quasi-Wald one (equations (7), (8) and (10)).
  3. 3 Also, see Bickel and Freedman (1981) for further developments on the asymptotic validity of the bootstrap.
  4. 4 See equation (A.1) of the online Appendix for more details.
  5. 5 For example, when urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0314 is fixed, the limit of the derivative of H with respect to the argument of urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0315 corresponding to urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0316 (where urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0317) takes the form: urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0318. It is easy to see that this derivative is not well defined at urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0319. In addition, the limits of this derivative at urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0320 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0321, urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0322, are given by urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0323 and urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0324, respectively. These expressions are also equal to directional derivatives at urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0325 establishing that no limit exists at urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0326 for any sequence urn:x-wiley:13684221:media:ectj12042:ectj12042-math-0327 (similar to Moreira et al., 2009).
    • The full text of this article hosted at iucr.org is unavailable due to technical difficulties.