Volume 41, Issue 4
RESEARCH ARTICLE

Gene‐based segregation method for identifying rare variants in family‐based sequencing studies

Dandi Qiao

Corresponding Author

E-mail address: redaq@channing.harvard.edu

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America

Correspondence

Dandi Qiao, Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.

Email: redaq@channing.harvard.edu

Search for more papers by this author
Christoph Lange

Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America

Search for more papers by this author
Nan M. Laird

Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America

Search for more papers by this author
Sungho Won

Department of Public Health Science, Seoul National University, Seoul, Republic of Korea

Search for more papers by this author
Craig P. Hersh

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America

Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

Search for more papers by this author
Jarrett Morrow

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America

Search for more papers by this author
Brian D. Hobbs

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America

Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

Search for more papers by this author
Sharon M. Lutz

Department of Biostatistics, Anschutz Medical Campus, University of Colorado, Aurora, Colorado, United States of America

Search for more papers by this author
Ingo Ruczinski

Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America

Search for more papers by this author
Terri H. Beaty

Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America

Search for more papers by this author
Edwin K. Silverman

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America

Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

Search for more papers by this author
Michael H. Cho

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America

Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

University of Washington Center for Mendelian Genomics, Seattle, Washington, United States of America

Search for more papers by this author
First published: 13 February 2017
Citations: 7

Grant sponsor: National Heart, Lung and Blood Institute; Grant numbers: R01 HL113264, X01HL115219, R01 HL084323, P01 HL114501, P01 HL105339, R01 HL075478, R01 HL089856, K01 HL129039, K01 HL125858 and 2UMIHG006493; Grant sponsor: Alpha‐1 Foundation; Grant sponsor: National Human Genome Research Institute

ABSTRACT

Whole‐exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene‐based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region‐based methods. We apply our method to whole‐exome sequencing data from 49 extended pedigrees with severe, early‐onset chronic obstructive pulmonary disease (COPD) in the Boston Early‐Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family‐based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).

Number of times cited according to CrossRef: 7

  • Contribution of common and rare variants to bipolar disorder susceptibility in extended pedigrees from population isolates, Translational Psychiatry, 10.1038/s41398-020-0758-1, 10, 1, (2020).
  • Genetic Engineering of Novel Products of Health Significance: Recombinant DNA Technology, Functional Foods and Nutraceuticals, 10.1007/978-3-030-42319-3, (595-611), (2020).
  • Using linkage studies combined with whole‐exome sequencing to identify novel candidate genes for familial colorectal cancer, International Journal of Cancer, 10.1002/ijc.32683, 146, 6, (1568-1577), (2019).
  • Identification of a Novel Candidate Gene for Serrated Polyposis Syndrome Germline Predisposition by Performing Linkage Analysis Combined With Whole-Exome Sequencing, Clinical and Translational Gastroenterology, 10.14309/ctg.0000000000000100, 10, 10, (e00100), (2019).
  • A Frameshift Variant in the CHST9 Gene Identified by Family-Based Whole Genome Sequencing Is Associated with Schizophrenia in Chinese Population, Scientific Reports, 10.1038/s41598-019-49052-w, 9, 1, (2019).
  • Inferring disease risk genes from sequencing data in multiplex pedigrees through sharing of rare variants, Genetic Epidemiology, 10.1002/gepi.22155, 43, 1, (37-49), (2018).
  • Rare coding variant analysis in a large cohort of Ashkenazi Jewish families with inflammatory bowel disease, Human Genetics, 10.1007/s00439-018-1927-7, 137, 9, (723-734), (2018).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.