Volume 43, Issue 6
RESEARCH ARTICLE

Bayesian variable selection using partially observed categorical prior information in fine‐mapping association studies

Abdulaziz A. Alenazi

School of Mathematics and Statistics, University of Sheffield, Sheffield, UK

Department of Mathematics, Northern Border University, Arar, Saudi Arabia

Search for more papers by this author
Angela Cox

Department of Oncology, Sheffield Cancer Research Centre, University of Sheffield Medical School, Sheffield, UK

Search for more papers by this author
Miguel Juarez

School of Mathematics and Statistics, University of Sheffield, Sheffield, UK

Search for more papers by this author
Wei‐Yu Lin

Department of Oncology, Sheffield Cancer Research Centre, University of Sheffield Medical School, Sheffield, UK

Northern Institute for Cancer Research, Medical School, University of Newcastle, Newcastle, UK

Search for more papers by this author
Kevin Walters

Corresponding Author

E-mail address: k.walters@sheffield.ac.uk

School of Mathematics and Statistics, University of Sheffield, Sheffield, UK

Correspondence Kevin Walters, School of Mathematics and Statistics, University of Sheffield, Sheffield S3 7RH, UK.

Email: k.walters@sheffield.ac.uk

Search for more papers by this author
First published: 12 July 2019
Citations: 1

Abstract

Several methods have been proposed to allow functional genomic information to inform prior distributions in Bayesian fine‐mapping case–control association studies. None of these methods allow the inclusion of partially observed functional genomic information. We use functional significance (FS) scores that combine information across multiple bioinformatics sources to inform our effect size prior distributions. These scores are not available for all single‐nucleotide polymorphisms (SNPs) but by partitioning SNPs into naturally occurring FS score groups, we show how missing FS scores can easily be accommodated via finite mixtures of elicited priors. Most current approaches adopt a formal Bayesian variable selection approach and either limit the number of causal SNPs allowed or use approximations to avoid the need to explore the vast parameter space. We focus instead on achieving differential shrinkage of the effect sizes through prior scale mixtures of normals and use marginal posterior probability intervals to select candidate causal SNPs. We show via a simulation study how this approach can improve localisation of the causal SNPs compared to existing mutli‐SNP fine‐mapping methods. We also apply our approach to fine‐mapping a region around the CASP8 gene using the iCOGS consortium breast cancer SNP data.

Number of times cited according to CrossRef: 1

  • Quantifying posterior effect size distribution of susceptibility loci by common summary statistics, Genetic Epidemiology, 10.1002/gepi.22286, 44, 4, (339-351), (2020).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.