Volume 73, Issue 4
BIOMETRIC PRACTICE

A fast small‐sample kernel independence test for microbiome community‐level association analysis

Xiang Zhan

Corresponding Author

E-mail address: xzhan@fhcrc.org

Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A.

email: xzhan@fhcrc.org

email: mcwu@fhcrc.org

Search for more papers by this author
Anna Plantinga

Department of Biostatistics, University of Washington, Seattle, Washington 98195, U.S.A.

Search for more papers by this author
Ni Zhao

Department of Biostatistics, Johns Hopkins University, Baltimore, Maryland 21205, U.S.A.

Search for more papers by this author
Michael C. Wu

Corresponding Author

E-mail address: mcwu@fhcrc.org

Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A.

email: xzhan@fhcrc.org

email: mcwu@fhcrc.org

Search for more papers by this author
First published: 10 March 2017
Citations: 13

Summary

To fully understand the role of microbiome in human health and diseases, researchers are increasingly interested in assessing the relationship between microbiome composition and host genomic data. The dimensionality of the data as well as complex relationships between microbiota and host genomics pose considerable challenges for analysis. In this article, we apply a kernel RV coefficient (KRV) test to evaluate the overall association between host gene expression and microbiome composition. The KRV statistic can capture nonlinear correlations and complex relationships among the individual data types and between gene expression and microbiome composition through measuring general dependency. Testing proceeds via a similar route as existing tests of the generalized RV coefficients and allows for rapid p‐value calculation. Strategies to allow adjustment for confounding effects, which is crucial for avoiding misleading results, and to alleviate the problem of selecting the most favorable kernel are considered. Simulation studies show that KRV is useful in testing statistical independence with finite samples given the kernels are appropriately chosen, and can powerfully identify existing associations between microbiome composition and host genomic data while protecting type I error. We apply the KRV to a microbiome study examining the relationship between host transcriptome and microbiome composition within the context of inflammatory bowel disease and are able to derive new biological insights and provide formal inference on prior qualitative observations.

Number of times cited according to CrossRef: 13

  • Convex combination sequence kernel association test for rare‐variant studies, Genetic Epidemiology, 10.1002/gepi.22287, 44, 4, (352-367), (2020).
  • Compositional knockoff filter for high‐dimensional regression analysis of microbiome data, Biometrics, 10.1111/biom.13336, 0, 0, (2020).
  • Distance‐based analysis of variance for brain connectivity, Biometrics, 10.1111/biom.13123, 76, 1, (257-269), (2019).
  • An Adaptive Multivariate Two-Sample Test With Application to Microbiome Differential Abundance Analysis, Frontiers in Genetics, 10.3389/fgene.2019.00350, 10, (2019).
  • A Distance-Based Kernel Association Test Based on the Generalized Linear Mixed Model for Correlated Microbiome Studies, Frontiers in Genetics, 10.3389/fgene.2019.00458, 10, (2019).
  • Relationship Between MiRKAT and Coefficient of Determination in Similarity Matrix Regression, Processes, 10.3390/pr7020079, 7, 2, (79), (2019).
  • Urinary microbiome associated with chronic allograft dysfunction in kidney transplant recipients, Clinical Transplantation, 10.1111/ctr.13436, 32, 12, (2018).
  • A small‐sample kernel association test for correlated data with application to microbiome association studies, Genetic Epidemiology, 10.1002/gepi.22160, 42, 8, (772-782), (2018).
  • An adaptive microbiome α-diversity-based association analysis method, Scientific Reports, 10.1038/s41598-018-36355-7, 8, 1, (2018).
  • Recent developments in statistical methods for GWAS and high-throughput sequencing association studies of complex traits, Biostatistics & Epidemiology, 10.1080/24709360.2018.1529346, 2, 1, (132-159), (2018).
  • A significance test of the RV coefficient in high dimensions, Computational Statistics & Data Analysis, 10.1016/j.csda.2018.10.008, (2018).
  • Reader reaction on the fast small‐sample kernel independence test for microbiome community‐level association analysis, Biometrics, 10.1111/biom.12823, 74, 3, (1120-1124), (2017).
  • Powerful Genetic Association Analysis for Common or Rare Variants with High-Dimensional Structured Traits, Genetics, 10.1534/genetics.116.199646, 206, 4, (1779-1790), (2017).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.