Volume 66, Issue 1 pp. 196-204

Impact of Population Substructure on Trend Tests for Genetic Case–Control Association Studies

Gang Zheng

Corresponding Author

Gang Zheng

Office of Biostatistics Research, DPPS, National Heart, Lung and Blood Institute, 6701 Rockledge Drive, MSC 7913, Bethesda, Maryland 20892-7913, U.S.A.

email: [email protected]Search for more papers by this author
Zhaohai Li

Zhaohai Li

Department of Statistics, George Washington University, Washington DC 20052, U.S.A.

Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 6120 Executive Blvd, EPS, Bethesda, Maryland 20892, U.S.A.

Search for more papers by this author
Mitchell H. Gail

Mitchell H. Gail

Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 6120 Executive Blvd, EPS, Bethesda, Maryland 20892, U.S.A.

Search for more papers by this author
Joseph L. Gastwirth

Joseph L. Gastwirth

Department of Statistics, George Washington University, Washington DC 20052, U.S.A.

Search for more papers by this author
First published: 17 March 2010
Citations: 6

Abstract

Summary Hidden population substructure in case–control data has the potential to distort the performance of Cochran–Armitage trend tests (CATTs) for genetic associations. Three possible scenarios that may arise are investigated here: (i) heterogeneity of genotype frequencies across unidentified subpopulations (PSI), (ii) heterogeneity of genotype frequencies and disease risk across unidentified subpopulations (PSII), and (iii) cryptic correlations within unidentified subpopulations. A unified approach is presented for deriving the bias and variance distortion under the three scenarios for any CATT in a general family. Using these analytical formulas, we evaluate the excess type I errors of the CATTs numerically in the presence of population substructure. Our results provide insight into the properties of some proposed corrections for bias and variance distortion and show why they may not fully correct for the effects of population substructure.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.