Omar De la Cruz and Xiaoquan Wen equally contributed to this work.
Gene, region and pathway level analyses in whole-genome studies
Article first published online: 10 DEC 2009
© 2009 Wiley-Liss, Inc.
Volume 34, Issue 3, pages 222–231, April 2010
How to Cite
De la Cruz, O., Wen, X., Ke, B., Song, M. and Nicolae, D. L. (2010), Gene, region and pathway level analyses in whole-genome studies. Genet. Epidemiol., 34: 222–231. doi: 10.1002/gepi.20452
- Issue published online: 15 MAR 2010
- Article first published online: 10 DEC 2009
- NIH. Grant Numbers: R01 HL087665, R01 DK077489, U01 HL084715
- genome-wide association studies;
- pathway analyses;
- association in gene networks;
- combining P-values
In the setting of genome-wide association studies, we propose a method for assigning a measure of significance to pre-defined sets of markers in the genome. The sets can be genes, conserved regions, or groups of genes such as pathways. Using the proposed methods and algorithms, evidence for association between a particular functional unit and a disease status can be obtained not just by the presence of a strong signal from a SNP within it, but also by the combination of several simultaneous weaker signals that are not strongly correlated. This approach has several advantages. First, moderately strong signals from different SNPs are combined to obtain a much stronger signal for the set, therefore increasing power. Second, in combination with methods that provide information on untyped markers, it leads to results that can be readily combined across studies and platforms that might use different SNPs. Third, the results are easy to interpret, since they refer to functional sets of markers that are likely to behave as a unit in their phenotypic effect. Finally, the availability of gene-level P-values for association is the first step in developing methods that integrate information from pathways and networks with genome-wide association data, and these can lead to a better understanding of the complex traits genetic architecture. The power of the approach is investigated in simulated and real datasets. Novel Crohn's disease associations are found using the WTCCC data. Genet. Epidemiol. 34: 222–231, 2010. © 2009 Wiley-Liss, Inc.