Comparing model-based unconstrained ordination methods in the analysis of high-dimensional compositional count data

Wenqi Tang; Pekka Korhonen; Jenni Niku; Klaus Nordhausen; Sara Taskinen

doi:10.52933/jdssv.v5i6.133

Comparing model-based unconstrained ordination methods in the analysis of high-dimensional compositional count data

Authors

Wenqi Tang University of Jyväskylä https://orcid.org/0009-0002-3023-4978
Pekka Korhonen University of Jyväskylä https://orcid.org/0000-0003-2650-3645
Jenni Niku University of Jyväskylä
Klaus Nordhausen University of Helsinki https://orcid.org/0000-0002-3758-8501
Sara Taskinen University of Jyväskylä https://orcid.org/0000-0001-9470-7258

DOI:

https://doi.org/10.52933/jdssv.v5i6.133

Keywords:

Community-level modeling, copula, latent variable model, overdispersion, zero-inflation

Abstract

Model-based ordination of ecological community data has gained recently significant popularity among practitioners, largely due to increased availability and utilization of computational resources. Specifically, generalized linear latent variable models (GLLVMs)–a factor-analytic and rank-reduced form of mixed effect models–have proven to be both accurate and computationally efficient. GLLVMs have been implemented for a wide range of response types common to ecological community data; presence-absence, biomass, overdispersed and/or zero-inflated counts serving as examples. In this paper, we demonstrate how GLLVMs can be applied in the analysis of high-dimensional compositional count data. These methods are useful for example in the analysis of microbiome data, which are typically collected using modern lab-based sampling tools and are inherently compositional due to the finite capacity of sequencing instruments. We use simulation studies to compare the ordination methods based on GLLVMs with algorithmic compositional data analysis methods that rely on log-transformations. Also recently developed fast model-based ordination methods that utilize Gaussian copula models are included in our comparisons. The methods are illustrated with a microbiome data example.

Author Biographies

Wenqi Tang, University of Jyväskylä

Department of Mathematics and Statistics, University of Jyväskylä

PhD student

Pekka Korhonen, University of Jyväskylä

Department of Mathematics and Statistics, University of Jyväskylä

PhD student

Jenni Niku, University of Jyväskylä

Faculty of Sport and Health Sciences, University of Jyväskylä

University Teacher

Klaus Nordhausen, University of Helsinki

Department of Mathematics and Statistics, University of Helsinki

Professor

Sara Taskinen, University of Jyväskylä

Department of Mathematics and Statistics, University of Jyväskylä

Senior Lecturer

Comparing model-based unconstrained ordination methods in the analysis of high-dimensional compositional count data

Authors

DOI:

Keywords:

Abstract

Author Biographies

Wenqi Tang, University of Jyväskylä

Pekka Korhonen, University of Jyväskylä

Jenni Niku, University of Jyväskylä

Klaus Nordhausen, University of Helsinki

Sara Taskinen, University of Jyväskylä

Downloads

Published

How to Cite

Issue

Section

License