Volume 28, Issue 3
Original Article

Evolutionary‐based grouping of haplotypes in association analysis

Jung‐Ying Tzeng

Corresponding Author

E-mail address: jytzeng@stat.ncsu.edu

Department of Statistics and Bioinformatics Research Center, North Carolina State University, Raleigh

Department of Statistics and Bioinformatics Research Center, North Carolina State University, Campus Box 7566, Raleigh NC, 27695===Search for more papers by this author
First published: 09 March 2005
Citations: 35

Abstract

Haplotypes incorporate more information about the underlying polymorphisms than do genotypes for individual SNPs, and are considered as a more informative format of data in association analysis. To model haplotypes requires high degrees of freedom, which could decrease power and limit a model's capacity to incorporate other complex effects, such as gene‐gene interactions. Even within haplotype blocks, high degrees of freedom are still a concern unless one chooses to discard rare haplotypes. To increase the efficiency and power of haplotype analysis, we adapt the evolutionary concepts of cladistic analyses and propose a grouping algorithm to cluster rare haplotypes to the corresponding ancestral haplotypes. The algorithm determines the cluster bases by preserving common haplotypes using a criterion built on the Shannon information content. Each haplotype is then assigned to its appropriate clusters probabilistically according to the cladistic relationship. Through this algorithm, we perform association analysis based on groups of haplotypes. Simulation results indicate power increases for performing tests on the haplotype clusters when compared to tests using original haplotypes or the truncated haplotype distribution. Genet. Epidemiol. © 2005 Wiley‐Liss, Inc.

Number of times cited according to CrossRef: 35

  • Exploiting gene‐environment independence in haplotype‐based inferences for population‐based case‐control studies with complex sampling, Statistics in Medicine, 10.1002/sim.8395, 39, 1, (57-69), (2019).
  • What has GWAS done for HLA and disease associations?, International Journal of Immunogenetics, 10.1111/iji.12332, 44, 5, (195-211), (2017).
  • New Genetic Approaches to AD: Lessons from APOE-TOMM40 Phylogenetics, Current Neurology and Neuroscience Reports, 10.1007/s11910-016-0643-8, 16, 5, (2016).
  • Detecting associations of rare variants with common diseases: collapsing or haplotyping?, Briefings in Bioinformatics, 10.1093/bib/bbu050, 16, 5, (759-768), (2015).
  • Haplotype Kernel Association Test as a Powerful Method to Identify Chromosomal Regions Harboring Uncommon Causal Variants, Genetic Epidemiology, 10.1002/gepi.21740, 37, 6, (560-570), (2013).
  • Bioinformatics and Statistics — Omics Data Analysis for Personalized Medicine —, Japanese Journal of Biometrics, 10.5691/jjb.32.S51, 32, Special_Issue, (S51-S64), (2011).
  • Significance testing in ridge regression for genetic data, BMC Bioinformatics, 10.1186/1471-2105-12-372, 12, 1, (2011).
  • A Bayesian Hierarchical Model for Detecting Haplotype-Haplotype and Haplotype-Environment Interactions in Genetic Association Studies, Human Heredity, 10.1159/000324841, 71, 3, (148-160), (2011).
  • Combining an Evolution-guided Clustering Algorithm and Haplotype-based LRT in Family Association Studies, BMC Genetics, 10.1186/1471-2156-12-48, 12, 1, (48), (2011).
  • Using an Uncertainty-Coding Matrix in Bayesian Regression Models for Haplotype-Specific Risk Detection in Family Association Studies, PLoS ONE, 10.1371/journal.pone.0021890, 6, 7, (e21890), (2011).
  • The Diverse Applications of Cladistic Analysis of Molecular Evolution, with Special Reference to Nested Clade Analysis, International Journal of Molecular Sciences, 10.3390/ijms11010124, 11, 1, (124-139), (2010).
  • A comprehensive approach to haplotype-specific analysis by penalized likelihood, European Journal of Human Genetics, 10.1038/ejhg.2009.118, 18, 1, (95-103), (2009).
  • Haplotype Association Analysis, Handbook on Analyzing Human Genetic Data, 10.1007/978-3-540-69264-5, (241-276), (2009).
  • A Regression‐based Association Test for Case‐control Studies that Uses Inferred Ancestral Haplotype Similarity, Annals of Human Genetics, 10.1111/j.1469-1809.2009.00536.x, 73, 5, (520-526), (2009).
  • On the use of phylogeny‐based tests to detect association between quantitative traits and haplotypes, Genetic Epidemiology, 10.1002/gepi.20425, 33, 8, (729-739), (2009).
  • Generalized linear modeling with regularization for detecting common disease rare haplotype association, Genetic Epidemiology, 10.1002/gepi.20382, 33, 4, (308-316), (2008).
  • Association mapping by generalized linear regression with density‐based haplotype clustering, Genetic Epidemiology, 10.1002/gepi.20352, 33, 1, (16-26), (2008).
  • Haplotype‐Association Analysis, Genetic Dissection of Complex Traits, 10.1016/S0065-2660(07)00414-2, (335-405), (2008).
  • Statistical performance of cladistic strategies for haplotype grouping in pharmacogenetics, Statistics in Medicine, 10.1002/sim.3399, 27, 28, (5816-5833), (2008).
  • CLUMPHAP: a simple tool for performing haplotype‐based association analysis, Genetic Epidemiology, 10.1002/gepi.20327, 32, 6, (539-545), (2008).
  • Gene-Centric Genomewide Association Study via Entropy, Genetics, 10.1534/genetics.107.082370, 179, 1, (637-650), (2008).
  • Haplotype-Based Association Analysis via Variance-Components Score Test, The American Journal of Human Genetics, 10.1086/521558, 81, 5, (927-938), (2007).
  • Sequential haplotype scan methods for association analysis, Genetic Epidemiology, 10.1002/gepi.20228, 31, 6, (553-564), (2007).
  • Association mapping through heuristic evolutionary history reconstruction-application to GAW15 Problem 3, BMC Proceedings, 10.1186/1753-6561-1-S1-S131, 1, S1, (2007).
  • Density-based clustering in haplotype analysis for association mapping, BMC Proceedings, 10.1186/1753-6561-1-S1-S27, 1, S1, (2007).
  • Incorporating Single-Locus Tests into Haplotype Cladistic Analysis in Case-Control Studies, PLoS Genetics, 10.1371/journal.pgen.0030046, 3, 3, (e46), (2007).
  • Genetic Association Mapping via Evolution-Based Clustering of Haplotypes, PLoS Genetics, 10.1371/journal.pgen.0030111, 3, 7, (e111), (2007).
  • TreeDT: Tree Pattern Mining for Gene Mapping, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 10.1109/TCBB.2006.28, 3, 2, (174-185), (2006).
  • Genome Diversity And Adverse Effects of Anticancer Drug -On Methodology for Search for Relevant Genes-, Haigan, 10.2482/haigan.46.253, 46, 3, (253-258), (2006).
  • Regression-Based Association Analysis with Clustered Haplotypes through Use of Genotypes, The American Journal of Human Genetics, 10.1086/500025, 78, 2, (231-242), (2006).
  • Comment, Journal of the American Statistical Association, 10.1198/016214505000000844, 101, 473, (111-114), (2006).
  • Estimation and testing of genotype and haplotype effects in case‐control studies: comparison of weighted regression and multiple imputation procedures, Genetic Epidemiology, 10.1002/gepi.20142, 30, 3, (259-275), (2006).
  • Association Mapping With Single-Feature Polymorphisms, Genetics, 10.1534/genetics.105.052720, 173, 2, (1125-1133), (2006).
  • Catechol-O-methyltransferase haplotypes are associated with psychosis in Alzheimer disease, Molecular Psychiatry, 10.1038/sj.mp.4001709, 10, 11, (1026-1036), (2005).
  • Genome-Wide Association Mapping in Arabidopsis Identifies Previously Known Flowering Time and Pathogen Resistance Genes, PLoS Genetics, 10.1371/journal.pgen.0010060, 1, 5, (e60), (2005).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.