Generalizations of the point-optimal panel unit root tests of Moon, Perron and Phillips (MPP) are developed to cover cases of serially correlated errors. The resulting statistics involve two modifications relative to those of MPP: (a) the error variance is replaced by the long-run variance; (b) centring of the statistic is adjusted to correct for second-order bias effects induced by the correlation between the error and lagged dependent variable.

1. Introduction

There has been much recent interest in testing for the presence of stochastic trends in large panels; see, e.g., Breitung and Pesaran (2008) and Breitung and Westerlund (2013). A prototypical model consists of a deterministic trend component $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0001$ and an (unobserved) stochastic component $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0002$ for some observable panel observations $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0003$ for individual $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0004$ in period $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0005$ satisfying

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0006$ (1.1)

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0007$ is an error term that has zero mean and is stationary over time, and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0008$ for simplicity. Dynamic panel models with incidental trend components of this type arise in many applications in microeconometrics, multicountry growth studies and international finance. Empirical interest often centres on the individual dynamics and on whether there is commonality and persistence across individuals (i.e. that the autoregressive parameters $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0009$ are all unity) or whether such commonality occurs for certain subgroups of individuals.

Moon et al. (2007, hereafter MPP) developed tests that are point optimal against a specific alternative hypothesis. MPP adopted a local-alternative set-up, specifying the autoregressive parameter as lying in a local vicinity of unity whose width narrows as the sample size increases, according to the form

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0010$ (1.2)

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0011$ is a sequence of i.i.d. random variables and κ is a parameter defining the width of the vicinity as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0012$ . The null hypothesis of interest is then

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0013$ (1.3)

with the alternative

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0014$ (1.4)

The MPP tests are point optimal in the sense of giving highest power against a specific set of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0015$ . These tests were derived under the assumption that the error term $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0016$ is independent across individual units and over time. They represent a panel extension of the work of Elliott et al. (1996) in the time-series case where the autoregressive parameter converges to unity at a rate of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0017$ , regardless of the deterministic component in the model.

Independence assumptions are not realistic in many empirical applications and in this work we extend the MPP tests by allowing for serially correlated errors $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0018$ . In their Section 6.4 (p. 436), MPP briefly mentioned this extension. Here, we provide explicit test statistics that have optimal asymptotic properties. The modified tests replace estimated variances of the errors in MPP with estimated long-run variances, and adjust centring terms. Our main purpose is to provide the form of the modified tests and to give their asymptotic properties so that they can be used in empirical work.

The paper is organized as follows. In Section 2., we show how to construct the tests, give results for cases with no fixed effects, fixed effects and incidental trends, and discuss implementation. In Section 3., we report some simulation findings. We conclude in Section 4, and in the Appendix we provide technical derivations and supporting lemmata.

2. Tests under Serial Correlation

Following MPP, in the following analysis, we consider three deterministic trend cases: (a) no individual effects (i.e. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0019$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0020$ ); (b) fixed effects (i.e. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0021$ ); (c) heterogeneous or incidental linear trends (i.e. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0022$ ). In each case, we proceed in three steps. First, we define the likelihood ratio ( $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0023$ ) statistic under Gaussianity, which is known to be optimal by the Neyman–Pearson lemma when the null and alternative hypotheses are simple. Then, we show that this statistic can be approximated by a simpler version with parameters that are consistently estimable. Finally, we derive the asymptotic distribution of this approximation (with appropriate recentring). In all three cases, this asymptotic distribution coincides with the one in MPP.

Our notation is similar to that of MPP. Denote by Z, D, Y, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0024$ and U the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0025$ observation matrices whose $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0026$ th elements are $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0027$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0028$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0029$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0030$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0031$ , respectively. Define the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0032$ vectors $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0033$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0034$ and set $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0035$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0036$ . Define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0037$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0038$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0039$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0040$ . Let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0041$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0042$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0043$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0044$ denote the transpose of the ith row of Z, Y, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0045$ and U, respectively. With this notation, the model has the matrix form

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0046$

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0047$ .

Define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0048$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0049$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0050$ as the variance of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0051$ , the long-run variance of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0052$ and the one-sided long-run variance of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0053$ , respectively, so that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0054$ . Let Σ, Ω and Λ be the diagonal matrices with elements $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0055$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0056$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0057$ , respectively. Define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0058$ as the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0059$ covariance matrix of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0060$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0061$ as the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0062$ covariance matrix of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0063$ . As in MPP, we assume that the errors $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0064$ are cross-section independent over i.

We assume that the localizing coefficient $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0065$ in the local alternative 1.2 is a sequence of i.i.d. random variables with bounded support.1 Let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0066$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0067$ and define the quasi-difference operator

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0068$

Set $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0069$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0070$ .

The quasi-log-likelihood function of the panel Z, which we use in defining the likelihood ratio test statistic, has the form

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0071$

for some weight matrix B.

Throughout the paper, we assume panel linear process errors with conditions similar to those in the literature (e.g. Phillips and Moon, 1999). Let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0072$ , let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0073$ be the spectral density of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0074$ , and let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0075$ .

Assumption 2.1.(a) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0076$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0077$ with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0078$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0079$ for some $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0080$ ; (b) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0081$ for some $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0082$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0083$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0084$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0085$ ; (c) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0086$ and for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0087$ and some constant M.

These conditions extend the serial dependence restrictions of Elliott et al. (1996) (e.g. their Condition A) to heterogeneous panels. Assumption 2.1(a) assumes that the error term $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0088$ follows a linear process that is heterogeneous across i. The higher moments are needed to ensure the large $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0089$ asymptotics of panel data that are heterogeneous across i and serially correlated over t. Under cross-sectional homoscedasticity, these moment conditions could be weakened. Assumptions 2.1(b) and 2.1(c) restrict the temporal dependence of the error term $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0090$ to be ‘weak’ uniformly across i. These restrictions exclude long memory type strong dependence. The conditions in Assumption 2.1 are quite weak and are satisfied by many parametric weak dependent processes, such as stationary and invertible ARMA processes.

While Assumption 2.1 is quite general in terms of the serial correlation that is allowed, it is restrictive in that it assumes that all cross-sectional units are independent. This assumption is not reasonable for many interesting empirical data sets, such as cross-country studies where business-cycle effects are likely to induce correlation across countries. As in MPP, we conjecture that the procedures proposed below are valid after appropriate orthogonalization is applied, for example, after the removal of common factors as in Moon and Perron (2004), Bai and Ng (2004) or Phillips and Sul (2003). Moreover, the development of optimal procedures under cross-sectional dependence is beyond the scope of the current paper.

2.1. No fixed effect: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0091$

When $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0092$ , the model becomes

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0093$

Following MPP, in this case we consider local neighbourhoods of unity that shrink at the rate of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0094$ , so that the rate coefficient $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0095$ , and one-sided alternatives in which the support of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0096$ is a bounded interval $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0097$ for some $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0098$ so that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0099$ under this alternative. In terms of the first moment of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0100$ , the hypotheses about $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0101$ are

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0102$ (2.1)

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0103$ (2.2)

Suppose that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0104$ are Gaussian so that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0105$ with known $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0106$ and the initial conditions $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0107$ are all zeros.2 By the Neyman–Pearson lemma, rejecting a small value of the log-likelihood ratio test statistic

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0108$ (2.3)

would be the uniformly most powerful test for the null $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0109$ for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0110$ against the simple alternative $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0111$ for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0112$ . When the alternative is 1.4 with 2.2, this becomes a point-optimal test.

In order to implement the optimal test statistic 2.3, we need an estimate of the entire $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0113$ covariance matrix $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0114$ . This is a huge high-dimensional covariance estimation problem in a non-parametric set-up. The following theorem provides an approximation of the likelihood ratio test statistic in 2.3 with a statistic where the unknown nuisance parameters are consistently estimable.

Theorem 2.1.Let Assumption 2.1 hold with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0115$ . Assume that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0116$ → 0 as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0117$ . Then, for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0118$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0119$

Note that the approximate likelihood ratio statistic

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0120$ (2.4)

in Theorem 2.1 employs the Gaussian log-likelihood based on the long-run variance $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0121$ with an adjustment of the one-sided long run variance $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0122$ . The one-sided long-run drift correction appears to be a result of the correlation between the stationary error $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0123$ and the lagged dependent variable $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0124$ . The main advantage of this formulation is that it involves quantities (Ω and Λ) that can be estimated consistently.

The test statistic we propose is to use the approximated log-likelihood ratio 2.4 with appropriate centring. Define

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0125$

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0126$ is the sum vector and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0127$ .

Theorem 2.2.Let Assumption 2.1 hold and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0128$ as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0129$ . Then, under the local alternative $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0130$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0131$

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0132$

Remark 2.1.We can interpret the test statistic $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0133$ as an asymptotic version of the point-optimal test for panel unit roots with possible serial correlation of unknown form in the error term.

Remark 2.2.Compared to the corresponding statistic in MPP, which makes no allowance for serial correlation, there are two differences in $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0134$ . First, as discussed in MPP, we use the long-run covariance matrix $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0135$ instead of the variance matrix $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0136$ as the weight matrix. In addition, we recentre the statistic by subtracting the term $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0137$ , which corrects for the correlation between the stationary error $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0138$ and the lagged dependent variable $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0139$ . This term is not required for the test under temporal independence.

Remark 2.3.The limit distribution of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0140$ is the same limit as in MPP (see their Theorem 6).

2.2. Time-invariant fixed effects: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0141$

In this section, we consider the case where the incidental trends $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0142$ are fixed over time. This corresponds to the standard fixed effects model. In this case, the model has matrix form

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0143$

As before, suppose that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0144$ with known $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0145$ and the initial conditions $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0146$ are all zeros. Then, rejecting a small value of the test statistic,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0147$ (2.5)

for the null $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0148$ for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0149$ and the alternative $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0150$ for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0151$ is known as the uniformly most powerful invariant test that is invariant with respect to the transformation $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0152$ for arbitrary $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0153$ . Against the alternative in 1.4, this becomes a point-optimal invariant test (e.g. Dufour and King, 1991).

As mentioned in the previous section, this statistic is difficult to implement because of the presence of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0154$ , the full $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0155$ covariance matrix of the error. This again motivates the use of an approximation.

Theorem 2.3.Let Assumption 2.1 hold with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0156$ and let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0157$ as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0158$ . Then, for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0159$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0160$

Remark 2.4.This approximation is derived under the stronger rate condition $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0161$ → 0 as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0162$ in place of the condition $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0163$ as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0164$ that is used without fixed effects.

Remark 2.5.The approximation involves the same correction for second-order bias as in the case without fixed effects.

Again, the test statistic we propose is the approximate log-likelihood ratio 2.4 with appropriate centring. Define

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0165$

Theorem 2.4.Let Assumption 2.1 hold and let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0166$ as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0167$ . Then, under the local alternative $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0168$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0169$

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0170$ .

This asymptotic distribution is the same as without fixed effects and as in MPP (see their Theorem 9).

2.3. Incidental trends: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0171$

Under heterogeneous linear trends, we follow MPP and use local neighbourhoods of unity that shrink at the slower rate of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0172$ , so that the rate coefficient is $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0173$ . The alternative might be two-sided; that is, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0174$ with mean $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0175$ and variance $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0176$ , with a support that is a subset of a bounded interval $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0177$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0178$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0179$ . The slower rate of shrinkage in the local neighbourhoods of unity is the result of the presence of heterogeneous trend effects in the panel. The presence of these incidental trends reduces discriminatory power in testing for the presence of common stochastic trends, so wider localizing intervals are needed to attain non-trivial power functions.

Under these conditions, hypotheses 1.3 and 1.4 can be re-expressed as

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0180$ (2.6)

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0181$ (2.7)

Again, suppose that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0182$ with known $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0183$ and the initial conditions $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0184$ are all zeros. Then, similar to the case of time-invariant fixed effects, rejecting a small value of the test statistic,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0185$

for the null $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0186$ for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0187$ and the alternative $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0188$ for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0189$ , is known as the uniformly most powerful invariant test (with respect to the linear transformation $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0190$ for arbitrary $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0191$ ), and against the alternative in 1.4, it becomes a point-optimal invariant test. As before, we start by proving the validity of an approximation to this log-likelihood ratio.

Theorem 2.5.Let Assumption 2.1 hold with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0192$ and let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0193$ as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0194$ . Then, for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0195$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0196$

Remark 2.6.This approximation is derived under the condition $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0197$ as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0198$ , which is a stronger rate condition than that used for the intercepts case.

Remark 2.7.As before, the correction is a result of the presence of a second-order bias term arising from the correlation between the lagged dependent variables and the error term.

Again, we propose to use the approximate log-likelihood ratio with appropriate centring as a test statistic. Define

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0199$

where

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0200$

Theorem 2.6.Let Assumption 2.1 hold and let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0201$ as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0202$ . Then, under the local alternative $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0203$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0204$

As before, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0205$ reduces to the statistic from MPP when there is no serial correlation, and it has the same asymptotic distribution as in Theorem 13 of MPP.

2.4. Implementation of the tests

The test statistics $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0206$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0207$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0208$ depend on unknown parameters $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0209$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0210$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0211$ . Let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0212$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0213$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0214$ be consistent estimators of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0215$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0216$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0217$ , respectively. Similarly define the diagonal matrices of these elements as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0218$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0219$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0220$ . To implement these tests, we can replace Σ, Ω and Λ in $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0221$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0222$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0223$ with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0224$ $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0225$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0226$ and we denote the test statistics as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0227$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0228$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0229$ . We assume the following regarding these estimators.

Assumption 2.2.Under the local alternative, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0230$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0231$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0232$ .

Remark 2.8.An example of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0233$ that satisfies Assumption 2.2 is the time-series sample variance of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0234$

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0235$

Remark 2.9.When kernel spectral density estimation is used for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0236$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0237$ with bandwidth h, Assumption 2.2 is satisfied if: (a) the kernel function $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0238$ is continuous at zero and all but a finite number of other points, satisfying $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0239$ $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0240$ $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0241$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0242$ for some $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0243$ where parameter m is defined in Assumption 2.1(b); (b) the bandwidth h satisfies

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0244$ (2.8)

as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0245$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0246$ ; see, e.g., Moon and Perron (2004). If $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0247$ for some $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0248$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0249$ , then the bandwidth condition 2.8 is satisfied if

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0250$

that is,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0251$

Theorem 2.7.Under Assumptions 2.1 and 2.2, as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0252$ with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0253$ we have $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0254$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0255$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0256$ under the local alternative.

Implementation of the tests also requires the choice of an alternative to define the likelihood ratio, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0257$ . MPP have shown that the optimal choice of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0258$ is $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0259$ . With this choice, the likelihood ratio statistics above attain the power envelope. However, this choice is infeasible because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0260$ are the parameters under test. MPP proposed the assumption of a common $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0261$ for all cross-sectional units, and they called this test a common point-optimal (CPO) test. With this choice, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0262$ , and we can deduce from Theorems 2.2, 2.4 and 2.6 that under the null hypothesis,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0263$

while under the alternative hypothesis,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0264$

The surprising result here is that neither distribution depends on the choice of c used to construct the test. This feature implies that the power is the same for all choices of c asymptotically, although that choice matters in finite samples. Based on the simulation evidence provided in MPP, we set $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0265$ in the simulation below. Of course, this choice of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0266$ is not optimal unless the alternative hypothesis is homogeneous ( $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0267$ for all i) and results in a power loss relative to the power envelope.

3. Monte Carlo simulations

In this section, we report the results of a Monte Carlo experiment designed to assess the finite-sample properties of the tests presented above and compare them with other existing results. For this purpose, we use the same DGP as MPP but employ either an AR(1) or MA(1) process for the innovations. Thus, the generating model has the following form,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0268$

where the innovations follow either an AR(1) process

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0269$

or an MA(1) process

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0270$

We have also looked at some ARMA(1,1) cases but we do not report those results in order to ease presentation (these results are available from the authors upon request). We consider five specifications for serial correlation: white noise ( $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0271$ ), positive AR ( $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0272$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0273$ ), negative AR ( $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0274$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0275$ ), positive MA ( $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0276$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0277$ ) and negative MA ( $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0278$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0279$ ).

In all cases, we allow for heterogeneity and draw the idiosyncratic variance $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0280$ from a uniform distribution, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0281$ . This variance is scaled such that the scale of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0282$ is the same for all cases. In both the incidental intercepts case $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0283$ and the incidental trends case $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0284$ , the parameters are drawn from $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0285$ .

We focus the study on the size and size-adjusted power of the CPO test with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0286$ for all i, because MPP advocated that choice. For size calculations, we set $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0287$ for all i, which corresponds to $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0288$ for all i in our local-to-unity framework. For power, we draw $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0289$ from a uniform distribution between 0 and 8, as in one of the experiments of MPP. This specification ensures that power should be roughly constant as N and T increase.

We draw comparisons with two existing tests, those of Levin et al. (2002), hereafter LLC, and Im et al. (2003), hereafter IPS. We take three values for n (10, 25 and 100) and two values of T (100 and 250). All tests are conducted at the 5% significance level, and the number of replications is set at 2,000.

Estimation of the long-run variance and one-sided long-run variance is critical to the performance of the CPO test. We estimate these quantities in two ways based on demeaned first differences, as in Remark 2.8. The first method is a non-parametric estimator with quadratic spectral kernel and bandwidth selected in a data-based manner using the Andrews (1991) rule with no pre-whitening (PW). The second method uses pre-whitening where the appropriate model is chosen using BIC among the AR(1), MA(1), ARMA(1,1) and constant-only models. In the case of the LLC test, we follow the recommendation of Levin et al. (2002) and use a Bartlett kernel with a bandwidth equal to $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0290$ . Westerlund (2009) has shown that this choice gives the LLC test higher power than selecting the bandwidth in a data-dependent way. Similarly, for the IPS and LLC tests, the choice of lag augmentation is critical for performance. We choose this in a data-dependent way by BIC with a maximum of six lags. For both of these tests, we use the finite-sample adjustments provided in the original papers.

The size results are reported in Tables 1 and 3 for the incidental intercepts and trends cases respectively, while size-adjusted power is in Tables 2 and 4. For each of the five serial correlation specifications, each row corresponds to a different test.

Table 1. Size of tests: incidental intercepts

		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0291$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0292$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0293$
		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0294$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0295$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0296$
		100	250	500	100	250	500	100	250	500
White noise	This paper, no PW	1.6	2.2	2.6	2.8	3.2	3.6	4.2	3.7	5.2
	This paper, PW	2.4	2.4	2.5	3.0	3.8	4.5	0.6	2.5	4.4
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0297$	MPP (2007)	2.9	2.4	2.5	4.8	3.7	4.4	4.3	3.6	4.7
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0298$	IPS	6.4	3.7	4.9	7.0	5.2	5.2	8.4	6.9	5.1
	LLC	5.8	4.4	4.1	5.8	5.3	3.4	6.0	5.0	3.5
Positive AR	This paper, no PW	1.7	1.8	2.9	2.4	2.8	3.3	4.5	4.5	4.9
	This paper, PW	2.3	2.1	3.0	1.7	2.3	3.0	0.7	0.4	2.4
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0299$	MPP (2007)	0.3	0.5	0.8	0.1	0.1	0.1	0.0	0.0	0.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0300$	IPS	5.0	4.4	3.6	4.7	3.9	3.5	3.1	3.1	3.5
	LLC	5.0	4.2	4.0	4.3	4.3	3.7	2.6	2.8	3.9
Negative AR	This paper, no PW	1.7	2.1	2.2	2.5	3.2	4.0	5.4	4.2	5.2
	This paper, PW	2.0	2.5	2.1	1.2	2.6	3.3	0.3	1.5	2.2
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0301$	MPP (2007)	12.7	13.2	13.6	30.7	30.3	32.4	82.2	81.6	83.9
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0302$	IPS	12.8	7.3	5.4	20.4	9.4	7.2	42.6	15.3	9.3
	LLC	9.0	5.8	4.2	12.0	7.0	4.0	20.8	8.5	4.3
Positive MA	This paper, no PW	1.4	1.7	2.6	2.1	3.0	3.7	4.3	4.5	4.8
	This paper, PW	2.5	2.6	3.0	2.7	3.2	4.1	1.6	3.5	4.5
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0303$	MPP (2007)	0.7	0.7	0.6	0.3	0.2	0.6	0.0	0.0	0.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0304$	IPS	6.6	6.1	5.7	6.8	6.4	6.0	7.1	7.4	8.4
	LLC	6.1	5.9	4.7	5.4	6.0	4.6	5.2	7.2	5.6
Negative MA	This paper, no PW	1.3	2.0	2.6	2.3	3.5	3.6	4.6	4.5	4.3
	This paper, PW	1.5	2.4	3.0	0.9	3.1	4.0	0.0	1.6	3.5
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0305$	MPP (2007)	17.0	16.7	17.6	40.7	44.1	40.7	93.0	91.8	93.1
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0306$	IPS	25.0	13.8	10.2	40.4	22.6	14.3	85.0	50.0	32.9
	LLC	15.6	9.4	6.9	22.2	12.3	6.5	56.6	25.9	14.4

Note

The table reports the rejection frequency (in %) of a 5% test for a panel unit root. “This paper, no PW” refers to the common point optimal (CPO) tests proposed in this paper with no pre-whitening used when estimating long-run variances and c = 1. “This paper, PW” refers to the CPO tests in this paper with pre-whitening when estimating long-run variances. “MPP (2007)” refers to the CPO tests with c = 1 in MPP that do not allow for serial correlation. “IPS” is the t-bar test of Im et al. (2003) and “LLC” is the test of Levin et al. (2002).

Table 2. Size-adjusted power of tests - Incidental intercepts

		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0307$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0308$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0309$
		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0310$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0311$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0312$
		100	250	500	100	250	500	100	250	500
White noise	This paper, no PW	41.6	49.3	52.9	53.0	62.6	62.8	66.4	75.2	71.7
	This paper, PW	43.2	54.8	52.5	45.1	62.2	62.6	40.5	73.0	73.4
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0313$	MPP (2007)	50.5	55.1	54.1	59.4	66.4	62.5	76.7	78.7	74.6
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0314$	IPS	13.2	18.1	12.9	16.8	17.5	16.9	21.6	18.6	21.3
	LLC	3.7	1.9	1.1	5.3	3.3	2.8	5.7	3.8	3.1
Positive AR	This paper, no PW	42.9	51.8	45.3	54.8	62.8	63.6	63.7	69.5	73.4
	This paper, PW	48.8	53.1	48.0	59.5	64.1	62.2	63.5	73.5	74.8
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0315$	MPP (2007)	52.0	54.5	45.4	62.4	60.7	63.9	72.2	73.1	73.7
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0316$	IPS	12.0	13.0	16.1	12.4	17.2	14.8	21.3	19.5	20.7
	LLC	3.1	2.3	1.8	3.5	2.7	1.6	6.0	5.2	3.5
Negative AR	This paper, no PW	42.8	48.3	51.3	53.2	60.3	61.1	63.3	72.0	72.4
	This paper, PW	45.0	52.5	51.4	47.2	59.8	61.0	40.0	69.6	76.3
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0317$	MPP (2007)	48.4	47.8	50.5	62.9	59.6	60.6	70.1	71.5	74.7
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0318$	IPS	13.0	14.6	14.4	19.7	16.1	15.8	22.0	20.0	19.0
	LLC	5.2	2.6	1.3	6.1	2.3	2.4	7.9	4.6	3.2
Positive MA	This paper, no PW	44.2	48.0	51.3	52.8	63.2	59.8	63.6	72.8	73.0
	This paper, PW	47.6	51.8	53.5	59.0	64.5	61.2	58.4	74.7	74.8
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0319$	MPP (2007)	52.2	54.0	50.9	61.3	66.4	64.2	72.3	75.8	75.2
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0320$	IPS	10.4	12.7	12.1	14.3	16.4	17.4	17.9	20.8	18.7
	LLC	2.6	1.9	1.1	4.6	2.7	1.9	5.5	4.0	3.4
Negative MA	This paper, no PW	41.4	49.4	47.5	54.8	55.5	59.9	65.9	72.7	77.9
	This paper, PW	46.7	51.8	48.7	43.0	57.6	62.2	32.6	68.1	77.8
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0321$	MPP (2007)	48.6	48.7	46.3	57.9	57.2	58.4	68.8	70.0	72.1
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0322$	IPS	15.0	16.7	14.2	21.2	15.3	18.8	23.9	26.7	22.5
	LLC	8.1	2.6	1.6	8.5	4.3	3.7	11.8	6.7	3.6

Table 3. Size of tests: incidental trends

		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0323$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0324$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0325$
		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0326$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0327$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0328$
		100	250	500	100	250	500	100	250	500
White noise	This paper, no PW	0.0	0.1	0.5	0.0	0.6	1.5	0.0	0.5	0.9
	This paper, PW	1.0	1.0	1.4	1.8	2.3	2.7	1.4	4.1	4.5
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0329$	MPP (2007)	1.7	1.3	1.4	4.6	2.5	2.9	7.2	5.1	4.4
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0330$	IPS	8.2	5.5	5.2	10.6	5.6	4.4	14.9	7.5	6.4
	LLC	6.4	4.5	2.1	5.8	4.3	1.1	4.8	4.4	0.5
Positive AR	This paper, no PW	0.0	0.0	0.5	0.0	0.4	1.5	0.0	0.2	1.2
	This paper, PW	0.8	0.4	1.0	1.6	2.2	2.2	2.1	2.1	2.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0331$	MPP (2007)	0.0	0.0	0.1	0.0	0.0	0.0	0.0	0.0	0.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0332$	IPS	5.3	3.5	4.4	4.2	3.0	4.0	3.8	2.8	3.3
	LLC	3.7	2.7	1.7	2.0	2.6	0.9	0.7	1.2	0.2
Negative AR	This paper, no PW	0.0	0.1	0.5	0.0	0.2	1.0	0.0	0.3	1.3
	This paper, PW	0.4	0.6	0.7	0.5	1.2	1.9	0.2	1.7	2.1
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0333$	MPP (2007)	22.2	19.8	20.3	61.7	58.4	56.8	99.3	99.3	99.1
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0334$	IPS	25.3	10.7	7.1	40.6	15.6	8.9	84.1	31.7	13.7
	LLC	16.9	7.6	2.8	24.5	9.4	1.6	53.8	14.5	0.5
Positive MA	This paper, no PW	0.0	0.2	0.5	0.0	0.6	1.2	0.0	0.2	1.8
	This paper, PW	1.0	1.3	1.2	3.7	3.4	2.6	6.5	6.5	5.3
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0335$	MPP (2007)	0.1	0.1	0.1	0.1	0.1	0.0	0.0	0.0	0.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0336$	IPS	8.1	6.6	6.0	10.7	7.2	6.7	12.4	11.4	8.4
	LLC	5.5	5.5	1.7	5.6	5.9	1.8	2.6	7.0	0.6
Negative MA	This paper, no PW	0.0	0.1	0.4	0.0	0.3	1.1	0.0	0.4	0.9
	This paper, PW	0.1	0.7	0.9	0.4	2.4	2.9	0.2	4.0	3.9
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0337$	MPP (2007)	31.7	29.1	29.7	77.1	74.8	73.3	100.0	100.0	100.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0338$	IPS	48.2	23.5	15.3	77.6	42.2	26.6	99.9	86.0	63.0
	LLC	38.1	17.7	6.1	61.1	30.1	7.0	96.6	67.8	12.2

Table 4. Size-adjusted power of tests: incidental intercepts

		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0339$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0340$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0341$
		$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0342$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0343$			$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0344$
		100	250	500	100	250	500	100	250	500
White noise	This paper, no PW	10.8	17.0	15.8	12.9	17.3	17.2	17.5	23.2	26.1
	This paper, PW	15.6	17.6	16.2	14.2	20.5	19.3	19.9	27.9	28.2
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0345$	MPP (2007)	18.6	19.1	15.6	17.7	20.9	19.4	28.1	27.5	28.9
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0346$	IPS	10.5	10.1	10.1	9.8	12.8	12.7	13.9	14.0	11.9
	LLC	8.5	6.8	4.9	9.2	7.9	5.1	11.6	8.4	5.8
Positive AR	This paper, no PW	11.0	15.6	16.4	10.9	18.1	19.5	18.1	26.0	27.5
	This paper, PW	17.1	18.7	16.5	18.8	22.3	19.8	26.2	29.9	29.5
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0347$	MPP (2007)	17.7	16.8	15.4	18.7	20.3	19.3	27.6	27.7	28.6
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0348$	IPS	8.5	9.9	8.7	11.1	13.5	11.3	12.1	12.6	13.3
	LLC	6.2	6.6	3.9	8.5	7.0	6.0	8.6	5.6	5.0
Negative AR	This paper, no PW	10.7	15.1	15.2	14.1	16.8	18.0	15.8	23.3	23.8
	This paper, PW	13.9	16.6	16.9	16.7	20.1	19.0	20.1	28.6	25.9
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0349$	MPP (2007)	16.5	16.6	16.3	21.8	17.5	19.2	26.5	24.3	27.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0350$	IPS	12.0	9.6	10.4	11.2	9.9	10.9	11.9	10.9	13.8
	LLC	10.7	7.3	4.6	10.7	6.1	5.5	11.0	7.5	6.8
Positive MA	This paper, no PW	12.0	14.6	18.4	11.9	18.0	22.3	19.3	25.8	30.0
	This paper, PW	20.2	17.9	18.3	18.6	21.2	23.8	25.8	30.4	31.0
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0351$	MPP (2007)	19.0	16.2	19.3	19.4	20.7	24.4	32.1	28.1	30.3
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0352$	IPS	9.1	11.2	10.3	9.9	11.2	11.0	14.0	12.6	16.0
	LLC	8.3	7.7	6.0	7.8	8.0	5.1	11.5	7.0	6.9
Negative MA	This paper, no PW	10.7	16.3	18.0	13.0	19.3	22.0	15.0	23.2	29.6
	This paper, PW	14.3	16.7	17.5	11.8	18.7	20.1	12.2	23.7	29.4
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0353$	MPP (2007)	14.0	14.6	16.3	18.1	18.3	21.2	24.1	23.1	27.7
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0354$	IPS	9.9	9.4	10.2	11.2	12.2	11.7	15.5	12.6	14.2
	LLC	11.1	6.7	5.2	11.5	8.3	5.5	12.7	7.6	6.9

In general, we see that our CPO test is conservative. This is especially true in the incidental trends case. The test is better behaved without pre-whitening in estimating the long-run variance in the incidental intercepts case, but the reverse is true in the trends case. It is evident that the original MPP test is not robust to the presence of serial correlation with size that can be as low as 0 or as high as 1, and that the extensions proposed here are therefore needed.

Table 2 reports results for size-adjusted power in the intercept case. Size adjustment is needed, given some of the large size distortions reported in Table 1. We see that the size-adjusted power of the CPO tests robust to serial correlation is typically lower than that of uncorrected tests, but the difference becomes smaller as N and T increase, as predicted by theory because the tests have the same asymptotic distribution. Also, we see that these tests have much higher size-adjusted power than either the LLC or IPS tests. The LLC test has poor power because it corrects for bias by adjusting the numerator of the pooled OLS estimator, as pointed out by Moon and Perron (2008) and Breitung and Westerlund (2013).3

Table 4 presents size-adjusted power for the incidental trends case. Note that the alternative considered in this scenario is further from the unit root null than in Table 2 because of the different definition of local neighbourhoods. While the CPO tests have lower power in this case, the same conclusions remain as in the intercept case.

4. Conclusion

In this paper we develop generalizations of the point-optimal panel unit root tests of MPP to cover the case where the error term is serially correlated. The resulting statistics have two simple modifications relative to those in MPP. First, the variance of the errors is replaced by the long-run variance. Second, the centring of the statistic is adjusted to accommodate the second-order bias induced by the correlation between the error and lagged values of the dependent variable. Simulations show that these two adjustments lead to appropriately sized tests in most cases.

Acknowledgements

We thank Vanessa Smith for raising with us questions about the performance of the original point-optimal statistics in MPP when there are serial correlated errors and about the need for possibly different correction factors in that case. B. Perron acknowledges financial support from the SSHRC and FQRSC. P. C. B. Phillips acknowledges partial support from the National Science Foundation under Grant Nos. SES-0956687 and SES-1258258.

1 As mentioned in MPP, the assumption of a bounded support for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0633$ is made for convenience, and could be relaxed at the cost of stronger moment conditions. It is also convenient to assume that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0634$ are identically distributed, and this assumption could be relaxed as long as cross-sectional averages of the moments $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0635$ have well-defined limits.

2 This assumption of a zero initial value is strong. The treatment of initial values in panel unit root tests is still an open problem. In MPP, we showed that the assumption that the initial observation is drawn from the unconditional distribution cannot be easily extended to the panel case because the resulting test statistic diverges to infinity with probability 1.

3 Note that Moon and Perron (2008) incorrectly labelled their $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0636$ statistic as equivalent to the LLC statistic, while they should have labelled their $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0637$ statistic (which has lower power) as equivalent to the LLC statistic.

APPENDIX A:: PROOFS OF THE APPROXIMATIONS IN THEOREMS 2.1, 2.3 AND 2.5

We provide three appendices. Here, in Appendix A, we provide proofs of Theorems 2.1, 2.3 and 2.5 that approximate the Gaussian log-likelihood ratio statistic. In Appendix B, we provide sketches of the proofs of the limit distribution results in Theorems 2.2, 2.4 and 2.6. In Appendix C, we provide a heuristic proof of Theorem 2.7. We only provide sketches of the proofs in Appendices B and C because the details are similar to those of the corresponding theorems in MPP and can be established with only minor modifications. Throughout the appendices, M denotes a generic (finite) constant.

Proof of Theorem 2.1.Here, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0355$ . Let Assumption 2.1 hold and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0356$ . By definition, we can write

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0357$ (A.1)

Write

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0358$ (A.2)

where

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0359$

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0360$ (A.3)

where

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0361$

In the following subsections, we show that under Assumption 2.1, as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0362$

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0363$ (A.4)

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0364$ (A.5)

Then, using (A.1)–(A.5) with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0365$ , we deduce that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0366$

as required. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0367$

Proof of Theorem 2.3.Here, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0368$ . Let Assumption 2.1 hold and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0369$ . By definition, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0370$

Using (A.2), (A.3), (A.4) and (A.5), we can approximate the first term as

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0371$

Then, the required result for the theorem follows because

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0372$ (A.6)

for any $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0373$ such that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0374$ for some constant M. The proof of (A.6) is available in the following subsection. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0375$

Proof of Theorem 2.5.Here, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0376$ . Let Assumption 2.1 hold and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0377$ . By definition, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0378$

Using (A.2), (A.3), (A.4) and (A.5), we can approximate the first term as

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0379$

Also, in the following subsection, we show that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0380$ (A.7)

for any $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0381$ such that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0382$ for some constant M. Then, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0383$

Combining these expressions gives the required result:

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0384$

A.1 Supplementary results

A.1.1. A useful lemma

Before we start the proof of (A.4) and (A.5), we introduce a useful technical result. When A is a matrix, we use three different norms, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0385$ (where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0386$ denotes the maximum eigenvalue), $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0387$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0388$ (where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0389$ is the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0390$ th element of A). It is well known that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0391$

By definition, the covariance matrix of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0392$ is $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0393$ . Let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0394$ be the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0395$ matrix whose $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0396$ element $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0397$ is $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0398$ if $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0399$ and zero, if $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0400$ . Let

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0401$ (A.8)

Lemma A.1.Let Assumption 2.1 hold. Then, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0402$ for some constant M.

Proof.For the desired result, we show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0403$

By definition,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0404$

where the inequality holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0405$ under Assumption 2.1 and by the triangle inequality.

First, we show that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0406$ . Define

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0407$

By adding and subtracting the terms and the triangle inequality, we can bound

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0408$

where

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0409$

For term I₁, note that because

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0410$

and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0411$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0412$

where the second inequality uses $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0413$ , the last inequality uses $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0414$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0415$ and the equality uses the Taylor representation of the exponential function. Then,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0416$

For term I₂, note that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0417$

where the first inequality holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0418$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0419$ and the last inequality holds by the mean-value theorem and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0420$ . Then,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0421$

For term I₃, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0422$

where the last line holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0423$ . Finally, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0424$

where the second inequality holds because

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0425$

By combining terms $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0426$ , we have the required result

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0427$

The proof of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0428$ follows in a similar fashion and is omitted. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0429$

Proof of (A.5).We prove the required result when $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0430$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0431$ . Because

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0432$

the required result follows if we show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0433$ (A.9)

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0434$ (A.10)

For A.9, we follow similar arguments used in proving $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0435$ on p. 831 (in the proof of Lemma A2) of Elliott et al. (1996), and have for some constant M

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0436$

where the second inequality holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0437$ and by Lemma A.1, and the last inequality holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0438$

For A.10, we also follow similar arguments to those used in proving $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0439$ on p. 831 (in the proof of Lemma A2) of Elliott et al. (1996), and have for some constant M

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0440$

Proof of (A.4).We prove the required result when $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0441$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0442$ . By replacing $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0443$ in $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0444$ with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0445$ , we can decompose $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0446$ as

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0447$

where

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0448$

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0449$

First, similar arguments to those in the proof of (A.5) lead to

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0450$

Then, the required result follows if

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0451$

which follows if

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0452$

Note that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0453$

Using similar arguments to those used for the proof of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0454$ , we can show that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0455$

For $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0456$ , we show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0457$ (A.11)

Because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0458$ , the desired result follows. Because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0459$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0460$

Because tr $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0461$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0462$

For term I, we can bound

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0463$

Then, for some constant $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0464$ , we can bound

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0465$

as required. Next,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0466$

and, by Assumption 2.1(c),

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0467$

because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0468$ , as required. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0469$

A.1.2. More preliminary results

In this section, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0470$ and let Assumption 2.1 hold. Define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0471$ to be the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0472$ matrix whose $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0473$ ^th element is $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0474$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0475$ is defined in Assumption 2.1.

Define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0476$ . Direct calculations show that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0477$

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0478$ (A.12)

Define

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0479$

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0480$ is the tth element of the vector x and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0481$

Lemma A.2.(a) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0482$ for all k; (b) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0483$ for some finite constant M.

Proof.Part (a). By definition, for $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0484$

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0485$

For $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0486$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0487$

as required.

Part (b). By definition,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0488$

as required. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0489$

Lemma A.3.Suppose that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0490$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0491$ are $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0492$ vectors such that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0493$ is bounded, where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0494$ is the (t)^th element of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0495$ . Then, (a) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0496$ ; (b) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0497$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0498$ is defined in A.8.

Proof.Part (a). The proof is similar to that of Lemma A1 of Elliott et al. (1996) and is omitted.

Part (b). We replace $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0499$ in the proof of Lemma A.1 with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0500$ . Then, the required result follows if we show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0501$

For Part (b1), by definition, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0502$

By Lemma A.2(b), the first term is bounded by

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0503$

as required. Under Assumption 2.1(c), the second term is bounded by

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0504$

as required.

For Part (b2), we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0505$

By Part (a), we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0506$

Using a similar argument to that used in the proof of Part (b1), we can bound the second term by

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0507$

Combining these, we have the required result for Part (b2). $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0508$

For $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0509$ , we define

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0510$

We define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0511$ to be the $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0512$ th element of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0513$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0514$ to be the kth element of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0515$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0516$ . Similarly, we define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0517$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0518$ .

Lemma A.4.Under Assumptions 2.1, the following hold: (a) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0519$ $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0520$ ; (b) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0521$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0522$ ; (c) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0523$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0524$ ; (d) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0525$ , $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0526$ ; (e) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0527$ ; (f) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0528$ ; (g) $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0529$ .

Proof.Part (a). A direct calculation shows that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0530$ . We bound $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0531$ by

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0532$

By Lemma A.3(a), we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0533$

By Lemma A.2 and Assumption 2.1, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0534$

Finally, the last term is

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0535$

as required.

Part (b) is an immediate corollary of Part (a).

Part (c). First, note that under Assumption 2.1 we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0536$

where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0537$ . It follows immediately that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0538$

as required. Also, the desired result follows because

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0539$

where the second equality holds by Part (a).

Part (d). The desired result $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0540$ follows from (A.12) and by direct calculation. For the second desired result, note that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0541$ . First, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0542$ since $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0543$ by $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0544$ and by Part (g), which we prove later. Next, by definition,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0545$

Because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0546$ , where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0547$ is defined above Lemma A.1, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0548$

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0549$

Therefore, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0550$

as required.

Part (e). Note that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0551$

The required result follows because

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0552$

as required.

Part (f) follows by Lemma A.3(b2).

Part (g). By definition, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0553$

Here, the second equality holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0554$ where $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0555$ is defined above Lemma A.1, and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0556$ is defined in A.8 :

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0557$

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0558$

Combining the bounds of $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0559$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0560$ we have the desired result for Part (g). $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0561$

Proof of (A.6).The required result follows if we show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0562$

Note that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0563$

The first term is bounded by

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0564$

where the first equality holds by Lemma A.4(c)–(e) and the last equality holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0565$ . The second term is bounded by

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0566$

where the first equality holds by Lemma A.4(c), (d) and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0567$ , and the last equality holds because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0568$ . Combining these two, we have the required result

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0569$

The second term follows in similar fashion and we omit it. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0570$

Proof of (A.7).The required result follows, if we show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0571$

which is established by the following three steps.

Step 1. We show
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0572$
Step 2. By (A.6), we have
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0573$
Step 3. We show
$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0574$

Proof of Step 1.Note that because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0575$ is a diagonal matrix,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0576$

Then, the required result for Step 1 follows if we show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0577$

The required result follows because

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0578$

where the last line holds by Lemma A.4(b)–(d) and the condition $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0579$ $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0580$

Proof of Step 3.We show

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0581$

The other required result $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0582$ follows in similar fashion and we omit the derivation. Note that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0583$

For the first term, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0584$

where the first equality holds by Lemma A.4(c), (d) and (g) and the last equality holds by the condition $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0585$ . For the second term, note that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0586$

where the second equality holds by Lemma A.4(c), (d) and (f), and the last equality holds by the condition $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0587$ . Then, we have all of the desired results for Part (c). $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0588$

APPENDIX B:: PROOFS OF THE LIMIT DISTRIBUTION RESULTS, THEOREMS 2.2, 2.4 AND 2.6

In this section, we provide proofs of Theorems 2.2, 2.4 and 2.6. These proofs are very similar to the proofs of the corresponding results in MPP and therefore we provide just an outline of the proofs here.

Proof of Theorem 2.2.Because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0589$ , we can write

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0590$

Direct calculation shows that under the assumptions of the theorem, we have the following joint limits

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0591$

and the central limit theorem (e.g. Moon and Phillips, 1999)

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0592$

thereby giving the required result. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0593$

Proof of Theorem 2.4.For the required result of the theorem, it is enough to show that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0594$

Let $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0595$ . Then, $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0596$ , and we can rewrite $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0597$ as

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0598$

where

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0599$

We can follow the proof in MPP (pp. 449–450) and deduce that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0600$

as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0601$ with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0602$ , which proves the desired result. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0603$

Proof of Theorem 2.6.The required result is a consequence of Lemmata A.5 and A.6. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0604$

Lemma A.5.Let Assumption 2.1 hold. Then, as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0605$ with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0606$ , we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0607$

Proof.The proof is similar to the proof of Lemma 11 of MPP and is omitted. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0608$

Lemma A.6.Let Assumption 2.1 hold. Then, as $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0609$ with $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0610$ the following hold:

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0611$

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0612$

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0613$

Proof.The proofs of Parts (b) and (c) are similar to those of Lemma 12(b) and (c) of MPP, and are omitted.

Part (a). First, note from

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0614$

that

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0615$

Because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0616$ we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0617$

Then,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0618$

Under the assumptions of the lemma,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0619$

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0620$

leading to the required result for Part (a). $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0621$

APPENDIX C:: PROOF OF THEOREM 2.7

Proof of Theorem 2.7.We provide a sketch of the proof. Note that under Assumption 2.2, the following hold:

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0622$

Define $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0623$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0624$ Then,

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0625$

and

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0626$

Because $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0627$ under Assumption 2.1, we have

$urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0628$

with probability approaching one. These imply that $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0629$ satisfies the properties in Lemmata 8, 10 and 14 of MPP, while $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0630$ and $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0631$ satisfy the properties in Lemmata 8(a), 8(b), 10(a) and 14(a)–(d) of MPP. The desired results follow by similar arguments to those used in Theorems 8, 10 and 15 of MPP. $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0632$

Supporting Information

REFERENCES

Andrews, D. W. K. (1991). Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59, 817–58.
10.2307/2938229
Web of Science® Google Scholar
Bai, J. and S. Ng (2004). A PANIC attack on unit roots and cointegration. Econometrica 72, 1127–77.
10.1111/j.1468-0262.2004.00528.x
Web of Science® Google Scholar
Breitung, J. and J. Westerlund (2013). Lessons from a decade of IPS and LLC. Econometric Reviews 32, 547–91.
10.1080/07474938.2013.741023
Web of Science® Google Scholar
Breitung, J. and M. H. Pesaran (2008). Unit roots and cointegration in panels. In L. Matyas and P. Sevestre (Eds. ), The Econometrics of Panel Data, 279–322. Berlin: Springer.
10.1007/978-3-540-75892-1_9
Google Scholar
Dufour, J. and M. King (1991). Optimal Invariant tests for the autocorrelation coefficient in linear regressions with stationary or nonstationary AR(1) errors. Journal of Econometrics 47, 115–43.
10.1016/0304-4076(91)90080-W
Web of Science® Google Scholar
Elliott, G., T. Rothenberg and J. Stock (1996). Efficient tests for an autoregressive unit root. Econometrica 64, 813–36.
10.2307/2171846
Web of Science® Google Scholar
Im, K., H. Pesaran and Y. Shin (2003). Testing for unit roots in heterogeneous panels. Journal of Econometrics 115, 53–74.
10.1016/S0304-4076(03)00092-7
Web of Science® Google Scholar
Levin, A., F. Lin and C. Chu (2002). Unit root tests in panel data: asymptotic and finite-sample properties. Journal of Econometrics 108, 1–24.
10.1016/S0304-4076(01)00098-7
Web of Science® Google Scholar
Moon, H. R. and B. Perron (2004). Testing for a unit root in panels with dynamic factors. Journal of Econometrics 122, 81–126.
10.1016/j.jeconom.2003.10.020
Web of Science® Google Scholar
Moon, H. R., B. Perron and P. C. B. Phillips (2007). Incidental trends and the power of panel unit root tests. Journal of Econometrics 141, 416–59.
10.1016/j.jeconom.2006.10.003
Web of Science® Google Scholar
Phillips, P. C. B. and H. R. Moon (1999). Linear regression limit theory for nonstationary panel data. Econometrica 67, 1057–111.
10.1111/1468-0262.00070
Web of Science® Google Scholar
Phillips, P. C. B. and D. Sul (2003). Dynamic panel estimation and homogeneity testing under cross-section dependence. Econometrics Journal 6, 217–39.
10.1111/1368-423X.00108
Google Scholar
Westerlund, J. (2009). A note on the use of the LLC panel unit root test. Empirical Economics 37, 517–31.
10.1007/s00181-008-0244-8
Web of Science® Google Scholar

Citing Literature

Volume17, Issue3

October 2014

Pages 338-372

Point-optimal panel unit root tests with serially correlated errors

Summary

1. Introduction

2. Tests under Serial Correlation

2.1. No fixed effect: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0091$

2.2. Time-invariant fixed effects: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0141$

2.3. Incidental trends: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0171$

2.4. Implementation of the tests

3. Monte Carlo simulations

Note

4. Conclusion

Acknowledgements

APPENDIX A:: PROOFS OF THE APPROXIMATIONS IN THEOREMS 2.1, 2.3 AND 2.5

A.1 Supplementary results

A.1.1. A useful lemma

A.1.2. More preliminary results

APPENDIX B:: PROOFS OF THE LIMIT DISTRIBUTION RESULTS, THEOREMS 2.2, 2.4 AND 2.6

APPENDIX C:: PROOF OF THEOREM 2.7

Supporting Information

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Point-optimal panel unit root tests with serially correlated errors

Summary

1. Introduction

2. Tests under Serial Correlation

2.1. No fixed effect:

2.2. Time-invariant fixed effects:

2.3. Incidental trends:

2.4. Implementation of the tests

3. Monte Carlo simulations

Note

4. Conclusion

Acknowledgements

APPENDIX A:: PROOFS OF THE APPROXIMATIONS IN THEOREMS 2.1, 2.3 AND 2.5

A.1 Supplementary results

A.1.1. A useful lemma

A.1.2. More preliminary results

APPENDIX B:: PROOFS OF THE LIMIT DISTRIBUTION RESULTS, THEOREMS 2.2, 2.4 AND 2.6

APPENDIX C:: PROOF OF THEOREM 2.7

Supporting Information

REFERENCES

Citing Literature

References

Related

Information

2.1. No fixed effect: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0091$

2.2. Time-invariant fixed effects: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0141$

2.3. Incidental trends: $urn:x-wiley:13684221:media:ectj12030:ectj12030-math-0171$