Volume 37, Issue 3
RESEARCH ARTICLE

Dissecting gene‐environment interactions: A penalized robust approach accounting for hierarchical structures

Cen Wu

Corresponding Author

E-mail address: wucen@ksu.edu

Department of Statistics, Kansas State University, Manhattan, KS 66506, USA

Correspondence

Cen Wu, Department of Statistics, Kansas State University, Manhattan, KS 66506, USA.

Email: wucen@ksu.edu

Search for more papers by this author
Yu Jiang

Division of Epidemiology, Biostatistics, and Environmental Health, University of Memphis, Memphis, TN 38111, USA

Search for more papers by this author
Jie Ren

Department of Statistics, Kansas State University, Manhattan, KS 66506, USA

Search for more papers by this author
Yuehua Cui

Department of Statistics and Probability, Michigan State University, 619 Red Cedar Rd, East Lansing, MI 48824, USA

Search for more papers by this author
Shuangge Ma

Department of Biostatistics, Yale University, 60 College Street, New Haven, CT 06520, USA

Search for more papers by this author
First published: 16 October 2017
Citations: 11

Abstract

Identification of gene‐environment (G × E) interactions associated with disease phenotypes has posed a great challenge in high‐throughput cancer studies. The existing marginal identification methods have suffered from not being able to accommodate the joint effects of a large number of genetic variants, while some of the joint‐effect methods have been limited by failing to respect the “main effects, interactions” hierarchy, by ignoring data contamination, and by using inefficient selection techniques under complex structural sparsity. In this article, we develop an effective penalization approach to identify important G × E interactions and main effects, which can account for the hierarchical structures of the 2 types of effects. Possible data contamination is accommodated by adopting the least absolute deviation loss function. The advantage of the proposed approach over the alternatives is convincingly demonstrated in both simulation and a case study on lung cancer prognosis with gene expression measurements and clinical covariates under the accelerated failure time model.

Number of times cited according to CrossRef: 11

  • Semiparametric Bayesian variable selection for gene‐environment interactions, Statistics in Medicine, 10.1002/sim.8434, 39, 5, (617-638), (2019).
  • Structured gene‐environment interaction analysis, Biometrics, 10.1111/biom.13139, 76, 1, (23-35), (2019).
  • Identification of gene–environment interactions with marginal penalization, Genetic Epidemiology, 10.1002/gepi.22270, 44, 2, (159-196), (2019).
  • Robust semiparametric gene‐environment interaction analysis using sparse boosting, Statistics in Medicine, 10.1002/sim.8322, 38, 23, (4625-4641), (2019).
  • Robust network‐based regularization and variable selection for high‐dimensional genomic data in cancer prognosis, Genetic Epidemiology, 10.1002/gepi.22194, 43, 3, (276-291), (2019).
  • A Selective Review of Multi-Level Omics Data Integration Using Variable Selection, High-Throughput, 10.3390/ht8010004, 8, 1, (4), (2019).
  • A Simple Approximation to Bias in Gene–Environment Interaction Estimates When a Case Might Not Be the Case, Frontiers in Genetics, 10.3389/fgene.2019.00886, 10, (2019).
  • Penalized Variable Selection for Lipid–Environment Interactions in a Longitudinal Lipidomics Study, Genes, 10.3390/genes10121002, 10, 12, (1002), (2019).
  • Robust identification of gene-environment interactions for prognosis using a quantile partial correlation approach, Genomics, 10.1016/j.ygeno.2018.07.006, (2018).
  • Robust gene–environment interaction analysis using penalized trimmed regression, Journal of Statistical Computation and Simulation, 10.1080/00949655.2018.1523411, 88, 18, (3502-3528), (2018).
  • Robust genetic interaction analysis, Briefings in Bioinformatics, 10.1093/bib/bby033, (2018).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.