Journal of Data Science,
Statistics, and Visualisation

Comparing model-based unconstrained ordination methods in the analysis of high-dimensional compositional count data

Authors

DOI:

https://doi.org/10.52933/jdssv.v5i6.133

Keywords:

Community-level modeling, copula, latent variable model, overdispersion, zero-inflation

Abstract

Model-based ordination of ecological community data has gained recently significant popularity among practitioners, largely due to increased availability and utilization of computational resources. Specifically, generalized linear latent variable models (GLLVMs)–a factor-analytic and rank-reduced form of mixed effect models–have proven to be both accurate and computationally efficient. GLLVMs have been implemented for a wide range of response types common to ecological community data; presence-absence, biomass, overdispersed and/or zero-inflated counts serving as examples. In this paper, we demonstrate how GLLVMs can be applied in the analysis of high-dimensional compositional count data. These methods are useful for example in the analysis of microbiome data, which are typically collected using modern lab-based sampling tools and are inherently compositional due to the finite capacity of sequencing instruments. We use simulation studies to compare the ordination methods based on GLLVMs with algorithmic compositional data analysis methods that rely on log-transformations. Also recently developed fast model-based ordination methods that utilize Gaussian copula models are included in our comparisons. The methods are illustrated with a microbiome data example.

Author Biographies

Wenqi Tang, University of Jyväskylä

Department of Mathematics and Statistics, University of Jyväskylä

PhD student

Pekka Korhonen, University of Jyväskylä

Department of Mathematics and Statistics, University of Jyväskylä

PhD student

Jenni Niku, University of Jyväskylä

Faculty of Sport and Health Sciences, University of Jyväskylä

University Teacher

Klaus Nordhausen, University of Helsinki

Department of Mathematics and Statistics, University of Helsinki

Professor

Sara Taskinen, University of Jyväskylä

Department of Mathematics and Statistics, University of Jyväskylä

Senior Lecturer

Downloads

Published

2025-05-12

How to Cite

Tang, W., Korhonen, P., Niku, J., Nordhausen, K., & Taskinen, S. (2025). Comparing model-based unconstrained ordination methods in the analysis of high-dimensional compositional count data. Journal of Data Science, Statistics, and Visualisation, 5(6). https://doi.org/10.52933/jdssv.v5i6.133
Journal of Data Science,
Statistics, and Visualisation
Pages