Volume 72, Issue 1 pp. 253-261

BIOMETRIC PRACTICE

An approximate marginal logistic distribution for the analysis of longitudinal ordinal data

Nazanin Nooraee,

Corresponding Author

Nazanin Nooraee

University of Groningen, University Medical Center Groningen, Groningen, The Netherlands

Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands

email: [email protected]Search for more papers by this author

Fentaw Abegaz,

Fentaw Abegaz

Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands

Search for more papers by this author

Johan Ormel,

Johan Ormel

University of Groningen, University Medical Center Groningen, Interdisciplinary Center of Psychopathology and Emotion Regulation, Groningen, The Netherlands

Search for more papers by this author

Ernst Wit,

Ernst Wit

Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands

Search for more papers by this author

Edwin R van den Heuvel,

Edwin R van den Heuvel

University of Groningen, University Medical Center Groningen, Groningen, The Netherlands

Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands

Search for more papers by this author

Nazanin Nooraee,

Corresponding Author

Nazanin Nooraee

University of Groningen, University Medical Center Groningen, Groningen, The Netherlands

Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands

email: [email protected]Search for more papers by this author

Fentaw Abegaz,

Fentaw Abegaz

Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands

Search for more papers by this author

Johan Ormel,

Johan Ormel

University of Groningen, University Medical Center Groningen, Interdisciplinary Center of Psychopathology and Emotion Regulation, Groningen, The Netherlands

Search for more papers by this author

Ernst Wit,

Ernst Wit

Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Groningen, The Netherlands

Search for more papers by this author

Edwin R van den Heuvel,

Edwin R van den Heuvel

University of Groningen, University Medical Center Groningen, Groningen, The Netherlands

Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands

Search for more papers by this author

First published: 12 October 2015

https://doi.org/10.1111/biom.12414

Citations: 5

Share a link

Email
Wechat
Bluesky

Summary

Subject-specific and marginal models have been developed for the analysis of longitudinal ordinal data. Subject-specific models often lack a population-average interpretation of the model parameters due to the conditional formulation of random intercepts and slopes. Marginal models frequently lack an underlying distribution for ordinal data, in particular when generalized estimating equations are applied. To overcome these issues, latent variable models underneath the ordinal outcomes with a multivariate logistic distribution can be applied. In this article, we extend the work of O'Brien and Dunson (2004), who studied the multivariate t-distribution with marginal logistic distributions. We use maximum likelihood, instead of a Bayesian approach, and incorporated covariates in the correlation structure, in addition to the mean model. We compared our method with GEE and demonstrated that it performs better than GEE with respect to the fixed effect parameter estimation when the latent variables have an approximately elliptical distribution, and at least as good as GEE for other types of latent variable distributions.

Supporting Information

References

Achenbach, T. M. and Rescorla, L. A. (2006). The Achenbach System of Empirically Based Assessment. ed. Archer, R.P. Lawrence Erlbaum Associates Publishers. Mahwah: New Jersey, USA. 229–262.
Google Scholar
Agresti, A. and Natarajan, R. (2001). Modeling clustered ordered categorical data: A survey. International Statistical Review/ Revue Internationale de Statistique 69, 345–371.
10.1111/j.1751-5823.2001.tb00463.x
Web of Science® Google Scholar
Albert, J. H. and Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association 88, 669–679.
10.1080/01621459.1993.10476321
Web of Science® Google Scholar
Bouma, E. M. C., Ormel, J., Verhulst, F. C., and Oldehinkel, A. J. (2008). Stressful life events and depressive problems in early adolescent boys and girls: The influence of parental depression, temperament and family environment. Journal of Affective Disorders 105, 185–193.
10.1016/j.jad.2007.05.007
PubMed Web of Science® Google Scholar
Breslow, N. E. and Clayton, D. G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Association 88, 9–25.
10.1080/01621459.1993.10594284
Web of Science® Google Scholar
Budden, M., Hadavas, P., Hoffman, L., and Pretz, C. (2007). Generating Valid $urn:x-wiley:15410420:media:biom12414:biom12414-math-0305$ Correlation Matrices. Applied Mathematics E-notes 7, 53–59.
Google Scholar
Clayton, D. (1992). Repeated ordinal measurements: A generalised estimating equation approach. Medical Research Council Biostatistics Unit Technical Reports Cambridge, England.
Google Scholar
Chaganty, N. R. and Joe, H. (2004). Efficiency of generalized estimating equations for binary responses. Journal of the Royal Statistical Society B 66, 851–860.
10.1111/j.1467-9868.2004.05741.x
Web of Science® Google Scholar
Cornish, E. A. (1954). The Multivariate t-distribution associated with a set of normal sample deviates. Australian Journal of Physics 7, 531–542.
10.1071/PH540531
Web of Science® Google Scholar
Dale, J. R. (1986). Global cross-ratio models for bivariate, discrete, ordered responses. Biometrics 42, 909–917.
10.2307/2530704
CAS PubMed Web of Science® Google Scholar
Daniels, M. J. and Pourahmadi, M. (2009). Modeling covariance matrices via partial autocorrelations. Journal of Multivariate Analysis 100, 2352–2363.
10.1016/j.jmva.2009.04.015
CAS PubMed Web of Science® Google Scholar
Fitzmaurice, G., Davidian, M., Verbeke, G., and Molenberghs, G. (2009). Longitudinal Data Analysis. New York: Chapman & Hall/CRC Press.
Google Scholar
Genz, A. and Bretz, F. (1999). Numerical computation of multivariate t-probabilities with application to power calculation of multiple contrasts. Journal of Statistical Computation and Simulation 63, 361–378.
10.1080/00949659908811962
Web of Science® Google Scholar
Genz, A. and Bretz, F. (2002). Comparison of methods for the computation of multivariate t probabilities. Journal of Computational and Graphical Statistics 11, 950–971.
10.1198/106186002394
Web of Science® Google Scholar
Gumbel, E. J. (1961). Bivariate logistic distributions. Journal of the American Statistical Association 56, 335–349.
10.1080/01621459.1961.10482117
Web of Science® Google Scholar
Graybill, F. A. (1961). An Introduction to Linear Statistical Models. Volume 1. New York: McGrawHill.
Google Scholar
Heagerty, P. J. (1999). Marginally specified logistic-normal models for longitudinal binary data. Biometrics 55, 688–698.
10.1111/j.0006-341X.1999.00688.x
CAS PubMed Web of Science® Google Scholar
Heagerty, P. J. (2002). Marginalized transition models and likelihood inference for longitudinal categorical data. Biometrics 58, 342–351.
10.1111/j.0006-341X.2002.00342.x
PubMed Web of Science® Google Scholar
Heagerty, P. J. and Zeger, S. C. (1996). Marginal regression models for clustered ordinal measurements. Journal of American Statistical Association 19, 1024–1036.
Google Scholar
Higham, N. J. (1961). Computing a nearest symmetric positive semidefinite matrix. Linear Algebra and its Applications 103, 103–118.
10.1016/0024-3795(88)90223-6
Web of Science® Google Scholar
Kline, R. B. (2011). Principles and Practice of Structural Equation Modeling. 3nd edition. New York: The Guilford Press.
Google Scholar
Kotz, S., Balakrishnan, N., and Johnson, N. L. (2000). Continuous Multivariate Distributions, Models and Applications. 2nd edition. New York: John Wiley & Sons.
Google Scholar
Lee, K. and Daniels, M. J. (2008). Marginalized models for longitudinal ordinal data with application to quality of life studies. Statistics in Medicine 27, 4359–4380.
10.1002/sim.3352
CAS PubMed Web of Science® Google Scholar
Li, Y. and Schafer, D. W. (2008). Likelihood analysis of the multivariate ordinal probit regression model for repeated ordinal responses. Computational Statistics & Data Analysis 52, 3474–3492.
10.1016/j.csda.2007.10.025
Web of Science® Google Scholar
Malik, H. J. and Abraham, B. (1973). Multivariate logistic distributions. The Annals of Statistics 1, 588–590.
10.1214/aos/1176342430
Web of Science® Google Scholar
McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical Society B 42, 109–142.
10.1111/j.2517-6161.1980.tb01109.x
Web of Science® Google Scholar
Molenberghs, G. and Lesaffre, E. (1994). Marginal modeling of correlated ordinal data using a multivariate plackett distribution. Journal of the American Statistical Association 89, 633–644.
Web of Science® Google Scholar
Molenberghs, G. and Kenward, M. G. (2007). Missing Data in Clinical Studies. England: John Wiley & Sons.
Google Scholar
Muthén, B. O. (1983). Latent variable structural equation modeling with categorical data. Journal of Econometrics 22, 43–65.
10.1016/0304-4076(83)90093-3
Web of Science® Google Scholar
Muthén, B. O. (1984). A general structural equation model with dichotomous, ordered categorical and continuous latent indicators. Psychometrika 49, 115–132.
10.1007/BF02294210
Web of Science® Google Scholar
Nelson, R. B. (2006). An introduction to copula. 2nd edition, Portland: Springer-Verlag.
Google Scholar
Nooraee, N., Molenberghs, G., and van den Heuvel, E. R. (2014). GEE for longitudinal ordinal data: Comparing R-geepack, R-multgee, R-repolr, SAS-GENMOD, SPSS-GENLIN. Computational Statistics & Data Analysis 77, 70–83.
10.1016/j.csda.2014.03.009
Web of Science® Google Scholar
O'Brien, S. M. and Dunson, D. B. (2004). Bayesian multivariate logistic regression. Biometrics 60, 739–746.
10.1111/j.0006-341X.2004.00224.x
PubMed Web of Science® Google Scholar
Ormel, J., Oldehinkel, A. J., Sijtsema, J., van Oort, F., Raven, D., Veenstra, R., et al. (2012). The TRacking Adolescents’ Individual Lives Survey (TRAILS): Design, Current Status, and Selected Findings. Journal of the American Academy of Child and Adolescent Psychiatry 51, 1020–1036.
10.1016/j.jaac.2012.08.004
PubMed Web of Science® Google Scholar
Olkin, I. (1981). Range restrictions for product-moment correlation matrices. Psychometrika 46, 469–472.
10.1007/BF02293804
Web of Science® Google Scholar
Parzen, M., Ghosh, S., Lipsitz, S., Sinha, D., Fitzmaurice, G. M., Mallick, B. K., et al. (2011). A generalized linear mixed model for longitudinal binary data with a marginal logit link function. The Annals of Applied Statistics 5, 449–467.
10.1214/10-AOAS390
PubMed Web of Science® Google Scholar
Robins, J. M., Rotnitzky, A., and Zhao, L. P. (1995). Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. American Statistical Association 90, 106–121.
10.1080/01621459.1995.10476493
Web of Science® Google Scholar
Rubin, D. B. (1976). Inference and missing data (with discussion). Biometrika 63, 581–592.
10.1093/biomet/63.3.581
Web of Science® Google Scholar
Rubin, D. B. (1987). Multiple Imputation for Nonresponse in Surveys. NewYork: John Wiley & Sons.
Google Scholar
Scharfstein, D. O., Rotnitzky, A., and Robins, J. M. (1999). Adjusting for nonignorable drop-out using semi-parametric nonresponse models (with comments). American Statistical Association 94, 1096–1146.
10.1080/01621459.1999.10473862
Web of Science® Google Scholar
Shanno, D. F. (1970). Conditioning of quasi-newton methods for function minimization. Mathematics of Computation 24, 647–656.
10.1090/S0025-5718-1970-0274029-X
Web of Science® Google Scholar
Stiratelli, R., Laird, N., and Ware, J. H. (1984). Random-effects models for serial observations with binary response. Biometrics 40, 961–971.
10.2307/2531147
CAS PubMed Web of Science® Google Scholar
Tan, M., Qu, Y., and Suni Rao, J. (1999). Robustness of the Latent Variable Model for Correlated Binary Data. Biometrics 55, 1541–0420.
10.1111/j.0006-341X.1999.00258.x
Web of Science® Google Scholar
Wang, Z. and Louis, T. A. (2003). Matching conditional and marginal shapes in binary random intercept models using a bridge distribution function. Biometrika 90, 765–775.
10.1093/biomet/90.4.765
Web of Science® Google Scholar

Citing Literature

Volume72, Issue1

March 2016

Pages 253-261

An approximate marginal logistic distribution for the analysis of longitudinal ordinal data

Summary

Supporting Information

References

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

An approximate marginal logistic distribution for the analysis of longitudinal ordinal data

Summary

Supporting Information

References

Citing Literature

References

Related

Information