Frontiers in Biostatistics Seminar Series
Tuesday September 13, 2022
1:00PM Eastern Time
Yijuan Hu, Ph.D.
Associate Professor
Department of Biostatistics and Bioinformatics
Rollins School of Public Health
Emory University
Abstract: Studies on the human microbiome have revealed that differences in microbial communities are associated with many human disorders such as inflammatory bowel disease, type II diabetes, and even Alzheimer’s disease and some cancers. The microbiome is a particularly attractive target for establishing new biomarkers for disease diagnosis and prognosis, and for developing low-cost, low-risk interventions. Microbiome data have two unique features. First, they are compositional, i.e., the total number of sequencing reads per sample is an experimental artifact and only the relative abundance of taxa can be measured. Second, they are subject to a wide variety of experimental biases (e.g., in the process of DNA extraction and PCR amplification) that plague most analyses that directly analyze relative abundance data. These features call for analyses that are based on log-ratio transformation of the relative abundance data. Existing methods often ignore experimental biases, do not handle the extensive (50-90%) zero count data adequately, and do not accommodate other complexities in microbiome data (e.g., high-dimensionality, confounding covariates, and continuous covariates of interest). In this talk, we present a new logistic-regression-based method that takes into account all of these features of microbiome data for robust testing of differential abundance. Our simulation studies indicate that our method is the only one that universally controls the FDR while at the same time maintaining good power. We illustrate our method by the analysis of a throat microbiome dataset.