A penalized robust semiparametric approach for gene–environment interactions
Abstract
In genetic and genomic studies, gene‐environment (G×E) interactions have important implications. Some of the existing G×E interaction methods are limited by analyzing a small number of G factors at a time, by assuming linear effects of E factors, by assuming no data contamination, and by adopting ineffective selection techniques. In this study, we propose a new approach for identifying important G×E interactions. It jointly models the effects of all E and G factors and their interactions. A partially linear varying coefficient model is adopted to accommodate possible nonlinear effects of E factors. A rank‐based loss function is used to accommodate possible data contamination. Penalization, which has been extensively used with high‐dimensional data, is adopted for selection. The proposed penalized estimation approach can automatically determine if a G factor has an interaction with an E factor, main effect but not interaction, or no effect at all. The proposed approach can be effectively realized using a coordinate descent algorithm. Simulation shows that it has satisfactory performance and outperforms several competing alternatives. The proposed approach is used to analyze a lung cancer study with gene expression measurements and clinical variables. Copyright © 2015 John Wiley & Sons, Ltd.
Citing Literature
Number of times cited according to CrossRef: 12
- Peng Lai, Fangjian Wang, Tingyu Zhu, Qingzhao Zhang, Model identification and selection for single-index varying-coefficient models, Annals of the Institute of Statistical Mathematics, 10.1007/s10463-020-00757-0, (2020).
- Jie Ren, Fei Zhou, Xiaoxi Li, Qi Chen, Hongmei Zhang, Shuangge Ma, Yu Jiang, Cen Wu, Semiparametric Bayesian variable selection for gene‐environment interactions, Statistics in Medicine, 10.1002/sim.8434, 39, 5, (617-638), (2019).
- Jie Ren, Yinhao Du, Shaoyu Li, Shuangge Ma, Yu Jiang, Cen Wu, Robust network‐based regularization and variable selection for high‐dimensional genomic data in cancer prognosis, Genetic Epidemiology, 10.1002/gepi.22194, 43, 3, (276-291), (2019).
- Tim P. Morris, Ian R. White, Michael J. Crowther, Using simulation studies to evaluate statistical methods, Statistics in Medicine, 10.1002/sim.8086, 38, 11, (2074-2102), (2019).
- Yang Li, Rong Li, Cunjie Lin, Yichen Qin, Shuangge Ma, Penalized integrative semiparametric interaction analysis for multiple genetic datasets, Statistics in Medicine, 10.1002/sim.8172, 38, 17, (3221-3242), (2019).
- Mengyun Wu, Shuangge Ma, Robust semiparametric gene‐environment interaction analysis using sparse boosting, Statistics in Medicine, 10.1002/sim.8322, 38, 23, (4625-4641), (2019).
- Cen Wu, Fei Zhou, Jie Ren, Xiaoxi Li, Yu Jiang, Shuangge Ma, A Selective Review of Multi-Level Omics Data Integration Using Variable Selection, High-Throughput, 10.3390/ht8010004, 8, 1, (4), (2019).
- Fei Zhou, Jie Ren, Gengxin Li, Yu Jiang, Xiaoxi Li, Weiqun Wang, Cen Wu, Penalized Variable Selection for Lipid–Environment Interactions in a Longitudinal Lipidomics Study, Genes, 10.3390/genes10121002, 10, 12, (1002), (2019).
- Yaqing Xu, Mengyun Wu, Shuangge Ma, Syed Ejaz Ahmed, Robust gene–environment interaction analysis using penalized trimmed regression, Journal of Statistical Computation and Simulation, 10.1080/00949655.2018.1523411, 88, 18, (3502-3528), (2018).
- Mengyun Wu, Shuangge Ma, Robust genetic interaction analysis, Briefings in Bioinformatics, 10.1093/bib/bby033, (2018).
- Cen Wu, Yu Jiang, Jie Ren, Yuehua Cui, Shuangge Ma, Dissecting gene‐environment interactions: A penalized robust approach accounting for hierarchical structures, Statistics in Medicine, 10.1002/sim.7518, 37, 3, (437-456), (2017).
- Jooyong Shim, Changha Hwang, Sunjoo Jeong, Insuk Sohn, Semivarying coefficient least-squares support vector regression for analyzing high-dimensional gene-environmental data, Journal of Applied Statistics, 10.1080/02664763.2017.1371676, 45, 8, (1370-1381), (2017).




