Volume 38, Issue 7
RESEARCH ARTICLE

A powerful and data‐adaptive test for rare‐variant–based gene‐environment interaction analysis

Tianzhong Yang

Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas

Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas

Search for more papers by this author
Han Chen

Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas

Center for Precision Health, School of Public Health and School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas

Search for more papers by this author
Hongwei Tang

Department of Gastrointestinal Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas

Search for more papers by this author
Donghui Li

Department of Gastrointestinal Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas

Search for more papers by this author
Peng Wei

Corresponding Author

E-mail address: pwei2@mdanderson.org

Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas

Peng Wei, Department of Biostatistics, The University of Texas MD Anderson Cancer Center, 1400 Pressler St, Pickens Tower, FCT4.6044, Houston, TX 77030.

Email: pwei2@mdanderson.org

Search for more papers by this author
First published: 20 November 2018
Citations: 4

Abstract

As whole‐exome/genome sequencing data become increasingly available in genetic epidemiology research consortia, there is emerging interest in testing the interactions between rare genetic variants and environmental exposures that modify the risk of complex diseases. However, testing rare‐variant–based gene‐by‐environment interactions (GxE) is more challenging than testing the genetic main effects due to the difficulty in correctly estimating the latter under the null hypothesis of no GxE effects and the presence of neutral variants. In response, we have developed a family of powerful and data‐adaptive GxE tests, called “aGE” tests, in the framework of the adaptive powered score test, originally proposed for testing the genetic main effects. Using extensive simulations, we show that aGE tests can control the type I error rate in the presence of a large number of neutral variants or a nonlinear environmental main effect, and the power is more resilient to the inclusion of neutral variants than that of existing methods. We demonstrate the performance of the proposed aGE tests using Pancreatic Cancer Case‐Control Consortium Exome Chip data. An R package “aGE” is available at http://github.com/ytzhong/projects/.

Number of times cited according to CrossRef: 4

  • Composite Kernel Association Test (CKAT) for SNP-set joint assessment of genotype and genotype-by-treatment interaction in Pharmacogenetics studies, Bioinformatics, 10.1093/bioinformatics/btaa125, (2020).
  • An adaptive test for meta‐analysis of rare variant association studies, Genetic Epidemiology, 10.1002/gepi.22273, 44, 1, (104-116), (2019).
  • A genome-wide search for gene-by-obesity interaction loci of dyslipidemia in Koreans shows diverse genetic risk alleles, Journal of Lipid Research, 10.1194/jlr.P119000226, 60, 12, (2090-2101), (2019).
  • Incorporating multiple sets of eQTL weights into gene‐by‐environment interaction analysis identifies novel susceptibility loci for pancreatic cancer, Genetic Epidemiology, 10.1002/gepi.22348, 0, 0, (undefined).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.