Improved detection of microbiome-disease associations via population structure-aware generalized linear mixed effects models (microSLAM)

Avatar
Poster
Voices Powered byElevenlabs logo
Connected to paperThis paper is a preprint and has not been certified by peer review

Improved detection of microbiome-disease associations via population structure-aware generalized linear mixed effects models (microSLAM)

Authors

Goldman, M.; Zhao, C.; Pollard, K. S.

Abstract

Microbiome association studies typically link host disease or other traits to summary statistics measured in metagenomics data, such as diversity or taxonomic composition. But identifying disease-associated species based on their relative abundance does not provide insight into why these microbes act as disease markers, and it overlooks cases where disease risk is related to specific strains with unique biological functions. To bridge this knowledge gap, we developed microSLAM, a mixed-effects model and an R package that performs association tests that connect host traits to the presence/absence of genes within each microbiome species, while accounting for strain genetic relatedness across hosts. Traits can be quantitative or binary (such as case/control). MicroSLAM is fit in three steps for each species. The first step estimates population structure across hosts. Step two calculates the association between population structure and the trait, enabling detection of species for which a subset of related strains confer risk. To identify specific genes whose presence/absence across diverse strains is associated with the trait, step three models the trait as a function of gene occurrence plus random effects estimated from step two. Applying microSLAM to 710 gut metagenomes from inflammatory bowel disease (IBD) samples, we discovered 49 species whose population structure correlates with IBD. In addition, after controlling for population structure, we found 57 microbial genes that are significantly more common in healthy individuals and 26 that are more common in IBD patients, including a seven-gene operon in Faecalibacterium prausnitzii that is involved in utilization of fructoselysine from the gut environment. Overall, microSLAM detected IBD associations for 45 species that were not detected using relative abundance tests, and it identified specific strains and genes underlying IBD associations for 13 other species. These findings highlight the importance of accounting for within-species genetic variation in microbiome studies.

Follow Us on

0 comments

Add comment