Volume 71, Issue 2 pp. 508-519
BIOMETRIC PRACTICE

Multilevel quantile function modeling with application to birth outcomes

Luke B. Smith

Corresponding Author

Luke B. Smith

Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.

email: [email protected]Search for more papers by this author
Brian J. Reich

Brian J. Reich

Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.

Search for more papers by this author
Amy H. Herring

Amy H. Herring

Department of Biostatistics and Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7420, U.S.A.

Search for more papers by this author
Peter H. Langlois

Peter H. Langlois

Texas Department of State Health Services, Austin, Texas 78714-9347, U.S.A.

Search for more papers by this author
Montserrat Fuentes

Montserrat Fuentes

Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.

Search for more papers by this author
First published: 11 March 2015
Citations: 10

Summary

Infants born preterm or small for gestational age have elevated rates of morbidity and mortality. Using birth certificate records in Texas from 2002 to 2004 and Environmental Protection Agency air pollution estimates, we relate the quantile functions of birth weight and gestational age to ozone exposure and multiple predictors, including parental age, race, and education level. We introduce a semi-parametric Bayesian quantile approach that models the full quantile function rather than just a few quantile levels. Our multilevel quantile function model establishes relationships between birth weight and the predictors separately for each week of gestational age and between gestational age and the predictors separately across Texas Public Health Regions. We permit these relationships to vary nonlinearly across gestational age, spatial domain and quantile level and we unite them in a hierarchical model via a basis expansion on the regression coefficients that preserves interpretability. Very low birth weight is a primary concern, so we leverage extreme value theory to supplement our model in the tail of the distribution. Gestational ages are recorded in completed weeks of gestation (integer-valued), so we present methodology for modeling quantile functions of discrete response data. In a simulation study we show that pooling information across gestational age and quantile level substantially reduces MSE of predictor effects. We find that ozone is negatively associated with the lower tail of gestational age in south Texas and across the distribution of birth weight for high gestational ages. Our methods are available in the R package BSquare.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.