Multilevel quantile function modeling with application to birth outcomes
Corresponding Author
Luke B. Smith
Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.
email: [email protected]Search for more papers by this authorBrian J. Reich
Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.
Search for more papers by this authorAmy H. Herring
Department of Biostatistics and Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7420, U.S.A.
Search for more papers by this authorPeter H. Langlois
Texas Department of State Health Services, Austin, Texas 78714-9347, U.S.A.
Search for more papers by this authorMontserrat Fuentes
Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.
Search for more papers by this authorCorresponding Author
Luke B. Smith
Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.
email: [email protected]Search for more papers by this authorBrian J. Reich
Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.
Search for more papers by this authorAmy H. Herring
Department of Biostatistics and Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7420, U.S.A.
Search for more papers by this authorPeter H. Langlois
Texas Department of State Health Services, Austin, Texas 78714-9347, U.S.A.
Search for more papers by this authorMontserrat Fuentes
Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-8203, U.S.A.
Search for more papers by this authorSummary
Infants born preterm or small for gestational age have elevated rates of morbidity and mortality. Using birth certificate records in Texas from 2002 to 2004 and Environmental Protection Agency air pollution estimates, we relate the quantile functions of birth weight and gestational age to ozone exposure and multiple predictors, including parental age, race, and education level. We introduce a semi-parametric Bayesian quantile approach that models the full quantile function rather than just a few quantile levels. Our multilevel quantile function model establishes relationships between birth weight and the predictors separately for each week of gestational age and between gestational age and the predictors separately across Texas Public Health Regions. We permit these relationships to vary nonlinearly across gestational age, spatial domain and quantile level and we unite them in a hierarchical model via a basis expansion on the regression coefficients that preserves interpretability. Very low birth weight is a primary concern, so we leverage extreme value theory to supplement our model in the tail of the distribution. Gestational ages are recorded in completed weeks of gestation (integer-valued), so we present methodology for modeling quantile functions of discrete response data. In a simulation study we show that pooling information across gestational age and quantile level substantially reduces MSE of predictor effects. We find that ozone is negatively associated with the lower tail of gestational age in south Texas and across the distribution of birth weight for high gestational ages. Our methods are available in the R package BSquare.
Supporting Information
Additional Supporting Information may be found in the online version of this article.
Filename | Description |
---|---|
biom12294-sup-0001-SuppData.pdf3 MB | Supplementary Materials. |
biom12294-sup-0002-SuppDataCode.zip185.7 KB | Supplementary Materials Code. |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
References
- Alexander, G. R., Kogan, M., Bader, D., Carlo, W., Allen, M., and Mor, J. (2003). US birth weight/gestational age-specific neonatal mortality: 1995ā1997 rates for whites, Hispanics, and blacks. Pediatrics 111, e61āe66.
- Barker, D. J. (2006). Adult consequences of fetal growth restriction. Clinical Obstetrics and Gynecology 49, 270ā283.
- Behrens, C. N., Lopes, H. F., and Gamerman, D. (2004). Bayesian analysis of extreme events with threshold estimation. Statistical Modelling 4, 227ā244.
- Berrocal, V. J., Gelfand, A. E., and Holland, D. M. (2010). A bivariate space-time downscaler under space and time misalignment. The Annals of Applied Statistics 4, 1942.
- Bondell, H. D., Reich, B. J., and Wang, H. (2010). Noncrossing quantile regression curve estimation. Biometrika 97, 825ā838.
- Burgette, L. F. and Reiter, J. P. (2012). Modeling adverse birth outcomes via confirmatory factor quantile regression. Biometrics 68, 92ā100.
- Chen, J. and Lazar, N. A. (2010). Quantile estimation for discrete data via empirical likelihood. Journal of Nonparametric Statistics 22, 237ā255.
- Coles, S. (2001). An Introduction to Statistical Modeling of Extreme Values. London: Springer.
- De Gooijer, J. G. and Yuan, A. (2011). Kernel-smoothed conditional quantiles of correlated bivariate discrete data. Statistica Sinica 21, 1611ā1638.
- DeVore, R. A. (1977). Monotone approximation by splines. SIAM Journal on Mathematical Analysis 8, 891ā905.
- Dunson, D. B., Herring, A. H., and Siega-Riz, A. M. (2008). Bayesian inference on changes in response densities over predictor clusters. Journal of the American Statistical Association 103, 1508ā1517.
-
Frigessi, A., Haug, O., and Rue, H.
(2002).
A dynamic mixture model for unsupervised tail estimation without
threshold selection.
Extremes
5, 219ā235.
10.1023/A:1024072610684 Google Scholar
- Gardosi, J., Mongelli, M., Wilcox, M., and Chang, A. (1995). An adjustable fetal weight standard. Ultrasound in Obstetrics & Gynecology 6, 168ā174.
- Garite, T. J., Clark, R., and Thorp, J. A. (2004). Intrauterine growth restriction increases morbidity and mortality among premature neonates. American Journal of Obstetrics and Gynecology 191, 481ā487.
- Hendricks, K. A., Simpson, J. S., and Larsen, R. D. (1999). Neural tube defects along the Texas-Mexico border, 1993ā1995. American Journal of Epidemiology 149, 1119ā1127.
- Hoffman, C. S., Mendola, P., Savitz, D. A., Herring, A. H., Loomis, D., Hartmann, K. E., Singer, P. C., Weinberg, H. S., and Olshan, A. F. (2008a). Drinking water disinfection by-product exposure and fetal growth. Epidemiology 19, 729ā737.
- Hoffman, C. S., Mendola, P., Savitz, D. A., Herring, A. H., Loomis, D., Hartmann, K. E., Singer, P. C., Weinberg, H. S., and Olshan, A. F. (2008b). Drinking water disinfection by-product exposure and duration of gestation. Epidemiology 19, 738ā746.
- Honein, M. A., Kirby, R. S., Meyer, R. E., Xing, J., Skerrette, N. I., Yuskiv, N., et al. (2009). The association between major birth defects and preterm birth. Maternal and Child Health Journal 13, 164ā175.
- Ibrahim, J. G., Chen, M.-H., and Sinha, D. (2005). Bayesian survival analysis. Wiley Online Library.
- Kammann, E. E. and Wand, M. P. (2003). Geoadditive models. Journal of the Royal Statistical Society, Series C 52, 1ā18.
- Katz, J., Lee, A. C., Kozuki, N., Lawn, J. E., Cousens, S., Blencowe, H., et al. (2013). Mortality risk in preterm and small-for-gestational-age infants in low-income and middle-income countries: a pooled country analysis. The Lancet 382, 417ā425.
- Koenker, R. (2005). Quantile Regression. Number 38. New York: Cambridge University Press.
- Koenker, R. and Hallock, K. F. (2001). Quantile regression. Journal of Economic Perspectives 15, 143ā156.
- Koenker, R. W. and Bassett Jr, G. (1978). Regression quantiles. Econometrica 46, 33ā50.
- Machado, J. A. F. and Silva, J. S. (2005). Quantiles for counts. Journal of the American Statistical Association 100, 1226ā1237.
- Mochizuki, M., Maruo, T., Masuko, K., Ohtsu, T. (1984). Effects of smoking on fetoplacental-maternal system during pregnancy. American Journal of Obstetrics and Gynecology 149, 413ā420.
- Morgan, M. A., Goldenberg, R. L., and Schulkin, J. (2008). Obstetrician-gynecologistsā practices regarding preterm birth at the limit of viability. Journal of Maternal-Fetal and Neonatal Medicine 21, 115ā121.
- Narchi, H., Skinner, A., and Williams, B. (2010). Small for gestational age neonates-are we missing some by only using standard population growth standards and does it matter? Journal of Maternal-Fetal and Neonatal Medicine 23, 48ā54.
- Parzen, E. (1979). Nonparametric statistical data modeling. Journal of the American Statistical Association 74, 105ā121.
- Pulver, L. S., Guest-Warnick, G., Stoddard, G. J., Byington, C. L., and Young, P. C. (2009). Weight for gestational age affects the mortality of late preterm infants. Pediatrics 123, e1072āe1077.
-
Ramsay, J.
(1988).
Monotone regression splines in action.
Statistical Science
3, 425ā441.
10.1214/ss/1177012761 Google Scholar
- Reich, B. J., Cooley, D., Foley, K. M., Napelenok, S., and Shaby, B. A. (2013). Extreme value analysis for evaluating ozone control strategies. The Annals of Applied Statistics 7, 739.
- Reich, B. J., Fuentes, M., and Dunson, D. B. (2011). Bayesian spatial quantile regression. Journal of the American Statistical Association 106, 6ā20.
- Reich, B. J. and Smith, L. B. (2013). Bayesian quantile regression for censored data. Biometrics 69, 651ā660.
- Rogers, J. F. and Dunlop, A. L. (2006). Air pollution and very low birth weight infants: A target population? Pediatrics 118, 156ā164.
- Schieve, L. A. and Handler, A. (1996). Preterm delivery and perinatal death among black and white infants in a Chicago-area perinatal registry. Obstetrics & Gynecology 88, 356ā363.
- Schumaker,S. (1981). Spline Functions: Basic Theory. United States of America: John Wiley & Sons, Inc.
- ŠrÔm, R. J., BinkovÔ, B., Dejmek, J., and Bobak, M. (2005). Ambient air pollution and pregnancy outcomes: a review of the literature. Environmental Health Perspectives 113, 375.
- Tokdar, S. and Kadane, J. B. (2011). Simultaneous linear quantile regression: A semiparametric Bayesian approach. Bayesian Analysis 6, 1ā22.
- Tukey, J. W. (1965). Which part of the sample contains the information? Proceedings of the National Academy of Sciences of the United States of America 53, 127.
- Wang, H. and Tsai, C.-L. (2009). Tail index regression. Journal of the American Statistical Association 104, 1233ā1240.
- Wang, H. J., Li, D., and He, X. (2012). Estimation of high conditional quantiles for heavy-tailed distributions. Journal of the American Statistical Association 107, 1453ā1464.
- Warren, J., Fuentes, M., Herring, A., and Langlois, P. (2012). Spatial-temporal modeling of the association between air pollution exposure and preterm birth: Identifying critical windows of exposure. Biometrics 68, 1157ā1167.
- Wilcox, A. J. (2001). On the importance and the unimportance of birthweight. International Journal of Epidemiology 30, 1233ā1241.
- Yu, K. and Moyeed, R. A. (2001). Bayesian quantile regression. Statistics and Probability Letters 54, 437ā447.
- Zhou, H., Lange, K., and Suchard, M. A. (2010). Graphics processing units and high-dimensional optimization. Statistical Science 25, 311ā324.
- Zhou, J., Chang, H. H., and Fuentes, M. (2012). Estimating the health impact of climate change with calibrated climate model output. Journal of Agricultural, Biological, and Environmental statistics 17, 377ā394.