A fast small‐sample kernel independence test for microbiome community‐level association analysis
Summary
To fully understand the role of microbiome in human health and diseases, researchers are increasingly interested in assessing the relationship between microbiome composition and host genomic data. The dimensionality of the data as well as complex relationships between microbiota and host genomics pose considerable challenges for analysis. In this article, we apply a kernel RV coefficient (KRV) test to evaluate the overall association between host gene expression and microbiome composition. The KRV statistic can capture nonlinear correlations and complex relationships among the individual data types and between gene expression and microbiome composition through measuring general dependency. Testing proceeds via a similar route as existing tests of the generalized RV coefficients and allows for rapid p‐value calculation. Strategies to allow adjustment for confounding effects, which is crucial for avoiding misleading results, and to alleviate the problem of selecting the most favorable kernel are considered. Simulation studies show that KRV is useful in testing statistical independence with finite samples given the kernels are appropriately chosen, and can powerfully identify existing associations between microbiome composition and host genomic data while protecting type I error. We apply the KRV to a microbiome study examining the relationship between host transcriptome and microbiome composition within the context of inflammatory bowel disease and are able to derive new biological insights and provide formal inference on prior qualitative observations.
Citing Literature
Number of times cited according to CrossRef: 13
- Daniel C. Posner, Honghuang Lin, James B. Meigs, Eric D. Kolaczyk, Josée Dupuis, Convex combination sequence kernel association test for rare‐variant studies, Genetic Epidemiology, 10.1002/gepi.22287, 44, 4, (352-367), (2020).
- Arun Srinivasan, Lingzhou Xue, Xiang Zhan, Compositional knockoff filter for high‐dimensional regression analysis of microbiome data, Biometrics, 10.1111/biom.13336, 0, 0, (2020).
- Russell T. Shinohara, Haochang Shou, Marco Carone, Robert Schultz, Birkan Tunc, Drew Parker, Melissa Lynne Martin, Ragini Verma, Distance‐based analysis of variance for brain connectivity, Biometrics, 10.1111/biom.13123, 76, 1, (257-269), (2019).
- Kalins Banerjee, Ni Zhao, Arun Srinivasan, Lingzhou Xue, Steven D. Hicks, Frank A. Middleton, Rongling Wu, Xiang Zhan, An Adaptive Multivariate Two-Sample Test With Application to Microbiome Differential Abundance Analysis, Frontiers in Genetics, 10.3389/fgene.2019.00350, 10, (2019).
- Hyunwook Koh, Yutong Li, Xiang Zhan, Jun Chen, Ni Zhao, A Distance-Based Kernel Association Test Based on the Generalized Linear Mixed Model for Correlated Microbiome Studies, Frontiers in Genetics, 10.3389/fgene.2019.00458, 10, (2019).
- Xiang Zhan, Relationship Between MiRKAT and Coefficient of Determination in Similarity Matrix Regression, Processes, 10.3390/pr7020079, 7, 2, (79), (2019).
- Jennifer F. Wu, Amutha Muthusamy, Gabriel A. Al‐Ghalith, Dan Knights, Bin Guo, Baolin Wu, Rory P. Remmel, David P. Schladt, Maria‐Luisa Alegre, William S. Oetting, Pamala A. Jacobson, Ajay K. Israni, Urinary microbiome associated with chronic allograft dysfunction in kidney transplant recipients, Clinical Transplantation, 10.1111/ctr.13436, 32, 12, (2018).
- Xiang Zhan, Lingzhou Xue, Haotian Zheng, Anna Plantinga, Michael C. Wu, Daniel J. Schaid, Ni Zhao, Jun Chen, A small‐sample kernel association test for correlated data with application to microbiome association studies, Genetic Epidemiology, 10.1002/gepi.22160, 42, 8, (772-782), (2018).
- Hyunwook Koh, An adaptive microbiome α-diversity-based association analysis method, Scientific Reports, 10.1038/s41598-018-36355-7, 8, 1, (2018).
- Duo Jiang, Miaoyan Wang, Recent developments in statistical methods for GWAS and high-throughput sequencing association studies of complex traits, Biostatistics & Epidemiology, 10.1080/24709360.2018.1529346, 2, 1, (132-159), (2018).
- M. Rauf Ahmad, A significance test of the RV coefficient in high dimensions, Computational Statistics & Data Analysis, 10.1016/j.csda.2018.10.008, (2018).
- Bin Guo, Baolin Wu, Reader reaction on the fast small‐sample kernel independence test for microbiome community‐level association analysis, Biometrics, 10.1111/biom.12823, 74, 3, (1120-1124), (2017).
- Xiang Zhan, Ni Zhao, Anna Plantinga, Timothy A. Thornton, Karen N. Conneely, Michael P. Epstein, Michael C. Wu, Powerful Genetic Association Analysis for Common or Rare Variants with High-Dimensional Structured Traits, Genetics, 10.1534/genetics.116.199646, 206, 4, (1779-1790), (2017).




