Volume 72, Issue 3
BIOMETRIC PRACTICE

Bayesian inference for latent biologic structure with determinantal point processes (DPP)

Yanxun Xu

Corresponding Author

E-mail address: yanxun.xu@jhu.edu

Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, Texas, U.S.A.

Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, Maryland, U.S.A.

email: yanxun.xu@jhu.edu

email: pmueller@math.utexas.edu

email: dtelesca@ucla.edu

Search for more papers by this author
Peter Müller

Corresponding Author

E-mail address: pmueller@math.utexas.edu

Department of Mathematics, The University of Texas at Austin, Austin, Texas, U.S.A.

email: yanxun.xu@jhu.edu

email: pmueller@math.utexas.edu

email: dtelesca@ucla.edu

Search for more papers by this author
Donatello Telesca

Corresponding Author

E-mail address: dtelesca@ucla.edu

Department of Biostatistics, UCLA School of Public Health, Los Angeles, California, U.S.A.

email: yanxun.xu@jhu.edu

email: pmueller@math.utexas.edu

email: dtelesca@ucla.edu

Search for more papers by this author
First published: 12 February 2016
Citations: 6

Summary

We discuss the use of the determinantal point process (DPP) as a prior for latent structure in biomedical applications, where inference often centers on the interpretation of latent features as biologically or clinically meaningful structure. Typical examples include mixture models, when the terms of the mixture are meant to represent clinically meaningful subpopulations (of patients, genes, etc.). Another class of examples are feature allocation models. We propose the DPP prior as a repulsive prior on latent mixture components in the first example, and as prior on feature‐specific parameters in the second case. We argue that the DPP is in general an attractive prior model for latent structure when biologically relevant interpretation of such structure is desired. We illustrate the advantages of DPP prior in three case studies, including inference in mixture models for magnetic resonance images (MRI) and for protein expression, and a feature allocation model for gene expression using data from The Cancer Genome Atlas. An important part of our argument are efficient and straightforward posterior simulation methods. We implement a variation of reversible jump Markov chain Monte Carlo simulation for inference under the DPP prior, using a density with respect to the unit rate Poisson process.

Number of times cited according to CrossRef: 6

  • BAREB: A Bayesian repulsive biclustering model for periodontal data, Statistics in Medicine, 10.1002/sim.8536, 39, 16, (2139-2151), (2020).
  • On a class of repulsive mixture models, TEST, 10.1007/s11749-020-00726-y, (2020).
  • On choosing mixture components via non‐local priors, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12333, 81, 5, (809-837), (2019).
  • undefined, Adjunct Publication of the 26th Conference on User Modeling, Adaptation and Personalization - UMAP '18, 10.1145/3213586.3226223, (127-130), (2018).
  • Density regression using repulsive distributions, Journal of Statistical Computation and Simulation, 10.1080/00949655.2018.1491578, 88, 15, (2931-2947), (2018).
  • Bayesian Repulsive Gaussian Mixture Model, Journal of the American Statistical Association, 10.1080/01621459.2018.1537918, (1-29), (2018).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.