An approximate marginal logistic distribution for the analysis of longitudinal ordinal data
Corresponding Author
Nazanin Nooraee
University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands
email: [email protected]Search for more papers by this authorFentaw Abegaz
Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands
Search for more papers by this authorJohan Ormel
University of Groningen, University Medical Center Groningen, Interdisciplinary Center of Psychopathology and Emotion Regulation, Groningen, The Netherlands
Search for more papers by this authorErnst Wit
Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands
Search for more papers by this authorEdwin R van den Heuvel
University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands
Search for more papers by this authorCorresponding Author
Nazanin Nooraee
University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands
email: [email protected]Search for more papers by this authorFentaw Abegaz
Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands
Search for more papers by this authorJohan Ormel
University of Groningen, University Medical Center Groningen, Interdisciplinary Center of Psychopathology and Emotion Regulation, Groningen, The Netherlands
Search for more papers by this authorErnst Wit
Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands
Search for more papers by this authorEdwin R van den Heuvel
University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands
Search for more papers by this authorSummary
Subject-specific and marginal models have been developed for the analysis of longitudinal ordinal data. Subject-specific models often lack a population-average interpretation of the model parameters due to the conditional formulation of random intercepts and slopes. Marginal models frequently lack an underlying distribution for ordinal data, in particular when generalized estimating equations are applied. To overcome these issues, latent variable models underneath the ordinal outcomes with a multivariate logistic distribution can be applied. In this article, we extend the work of O'Brien and Dunson (2004), who studied the multivariate t-distribution with marginal logistic distributions. We use maximum likelihood, instead of a Bayesian approach, and incorporated covariates in the correlation structure, in addition to the mean model. We compared our method with GEE and demonstrated that it performs better than GEE with respect to the fixed effect parameter estimation when the latent variables have an approximately elliptical distribution, and at least as good as GEE for other types of latent variable distributions.
Supporting Information
Additional Supporting Information may be found in the online version of this article.
Filename | Description |
---|---|
biom12414-sup-0001-SuppData.pdf170 KB | Supplementary Materials. |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
References
- Achenbach, T. M. and Rescorla, L. A. (2006). The Achenbach System of Empirically Based Assessment. ed. Archer, R.P. Lawrence Erlbaum Associates Publishers. Mahwah: New Jersey, USA. 229–262.
- Agresti, A. and Natarajan, R. (2001). Modeling clustered ordered categorical data: A survey. International Statistical Review/ Revue Internationale de Statistique 69, 345–371.
- Albert, J. H. and Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association 88, 669–679.
- Bouma, E. M. C., Ormel, J., Verhulst, F. C., and Oldehinkel, A. J. (2008). Stressful life events and depressive problems in early adolescent boys and girls: The influence of parental depression, temperament and family environment. Journal of Affective Disorders 105, 185–193.
- Breslow, N. E. and Clayton, D. G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Association 88, 9–25.
-
Budden, M.,
Hadavas, P.,
Hoffman, L., and
Pretz, C.
(2007).
Generating Valid
Correlation Matrices. Applied Mathematics E-notes 7, 53–59.
- Clayton, D. (1992). Repeated ordinal measurements: A generalised estimating equation approach. Medical Research Council Biostatistics Unit Technical Reports Cambridge, England.
- Chaganty, N. R. and Joe, H. (2004). Efficiency of generalized estimating equations for binary responses. Journal of the Royal Statistical Society B 66, 851–860.
- Cornish, E. A. (1954). The Multivariate t-distribution associated with a set of normal sample deviates. Australian Journal of Physics 7, 531–542.
- Dale, J. R. (1986). Global cross-ratio models for bivariate, discrete, ordered responses. Biometrics 42, 909–917.
- Daniels, M. J. and Pourahmadi, M. (2009). Modeling covariance matrices via partial autocorrelations. Journal of Multivariate Analysis 100, 2352–2363.
- Fitzmaurice, G., Davidian, M., Verbeke, G., and Molenberghs, G. (2009). Longitudinal Data Analysis. New York: Chapman & Hall/CRC Press.
- Genz, A. and Bretz, F. (1999). Numerical computation of multivariate t-probabilities with application to power calculation of multiple contrasts. Journal of Statistical Computation and Simulation 63, 361–378.
- Genz, A. and Bretz, F. (2002). Comparison of methods for the computation of multivariate t probabilities. Journal of Computational and Graphical Statistics 11, 950–971.
- Gumbel, E. J. (1961). Bivariate logistic distributions. Journal of the American Statistical Association 56, 335–349.
- Graybill, F. A. (1961). An Introduction to Linear Statistical Models. Volume 1. New York: McGrawHill.
- Heagerty, P. J. (1999). Marginally specified logistic-normal models for longitudinal binary data. Biometrics 55, 688–698.
- Heagerty, P. J. (2002). Marginalized transition models and likelihood inference for longitudinal categorical data. Biometrics 58, 342–351.
- Heagerty, P. J. and Zeger, S. C. (1996). Marginal regression models for clustered ordinal measurements. Journal of American Statistical Association 19, 1024–1036.
- Higham, N. J. (1961). Computing a nearest symmetric positive semidefinite matrix. Linear Algebra and its Applications 103, 103–118.
- Kline, R. B. (2011). Principles and Practice of Structural Equation Modeling. 3nd edition. New York: The Guilford Press.
- Kotz, S., Balakrishnan, N., and Johnson, N. L. (2000). Continuous Multivariate Distributions, Models and Applications. 2nd edition. New York: John Wiley & Sons.
- Lee, K. and Daniels, M. J. (2008). Marginalized models for longitudinal ordinal data with application to quality of life studies. Statistics in Medicine 27, 4359–4380.
- Li, Y. and Schafer, D. W. (2008). Likelihood analysis of the multivariate ordinal probit regression model for repeated ordinal responses. Computational Statistics & Data Analysis 52, 3474–3492.
- Malik, H. J. and Abraham, B. (1973). Multivariate logistic distributions. The Annals of Statistics 1, 588–590.
- McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical Society B 42, 109–142.
- Molenberghs, G. and Lesaffre, E. (1994). Marginal modeling of correlated ordinal data using a multivariate plackett distribution. Journal of the American Statistical Association 89, 633–644.
- Molenberghs, G. and Kenward, M. G. (2007). Missing Data in Clinical Studies. England: John Wiley & Sons.
- Muthén, B. O. (1983). Latent variable structural equation modeling with categorical data. Journal of Econometrics 22, 43–65.
- Muthén, B. O. (1984). A general structural equation model with dichotomous, ordered categorical and continuous latent indicators. Psychometrika 49, 115–132.
- Nelson, R. B. (2006). An introduction to copula. 2nd edition, Portland: Springer-Verlag.
- Nooraee, N., Molenberghs, G., and van den Heuvel, E. R. (2014). GEE for longitudinal ordinal data: Comparing R-geepack, R-multgee, R-repolr, SAS-GENMOD, SPSS-GENLIN. Computational Statistics & Data Analysis 77, 70–83.
- O'Brien, S. M. and Dunson, D. B. (2004). Bayesian multivariate logistic regression. Biometrics 60, 739–746.
- Ormel, J., Oldehinkel, A. J., Sijtsema, J., van Oort, F., Raven, D., Veenstra, R., et al. (2012). The TRacking Adolescents’ Individual Lives Survey (TRAILS): Design, Current Status, and Selected Findings. Journal of the American Academy of Child and Adolescent Psychiatry 51, 1020–1036.
- Olkin, I. (1981). Range restrictions for product-moment correlation matrices. Psychometrika 46, 469–472.
- Parzen, M., Ghosh, S., Lipsitz, S., Sinha, D., Fitzmaurice, G. M., Mallick, B. K., et al. (2011). A generalized linear mixed model for longitudinal binary data with a marginal logit link function. The Annals of Applied Statistics 5, 449–467.
- Robins, J. M., Rotnitzky, A., and Zhao, L. P. (1995). Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. American Statistical Association 90, 106–121.
- Rubin, D. B. (1976). Inference and missing data (with discussion). Biometrika 63, 581–592.
- Rubin, D. B. (1987). Multiple Imputation for Nonresponse in Surveys. NewYork: John Wiley & Sons.
- Scharfstein, D. O., Rotnitzky, A., and Robins, J. M. (1999). Adjusting for nonignorable drop-out using semi-parametric nonresponse models (with comments). American Statistical Association 94, 1096–1146.
- Shanno, D. F. (1970). Conditioning of quasi-newton methods for function minimization. Mathematics of Computation 24, 647–656.
- Stiratelli, R., Laird, N., and Ware, J. H. (1984). Random-effects models for serial observations with binary response. Biometrics 40, 961–971.
- Tan, M., Qu, Y., and Suni Rao, J. (1999). Robustness of the Latent Variable Model for Correlated Binary Data. Biometrics 55, 1541–0420.
- Wang, Z. and Louis, T. A. (2003). Matching conditional and marginal shapes in binary random intercept models using a bridge distribution function. Biometrika 90, 765–775.