Volume 72, Issue 2
BIOMETRIC METHODOLOGY

A significance test for graph‐constrained estimation

Sen Zhao

Corresponding Author

Department of Biostatistics, University of Washington, Seattle, Washington, U.S.A.

email: senz@u.washington.edu

email: ashojaie@u.washington.edu

Search for more papers by this author
Ali Shojaie

Corresponding Author

Department of Biostatistics, University of Washington, Seattle, Washington, U.S.A.

email: senz@u.washington.edu

email: ashojaie@u.washington.edu

Search for more papers by this author
First published: 22 September 2015
Citations: 5

Summary

Graph‐constrained estimation methods encourage similarities among neighboring covariates presented as nodes of a graph, and can result in more accurate estimates, especially in high‐dimensional settings. Variable selection approaches can then be utilized to select a subset of variables that are associated with the response. However, existing procedures do not provide measures of uncertainty of estimates. Further, the vast majority of existing approaches assume that available graph accurately captures the association among covariates; violations to this assumption could severely hurt the reliability of the resulting estimates. In this article, we present a new inference framework, called the Grace test, which produces coefficient estimates and corresponding p‐values by incorporating the external graph information. We show, both theoretically and via numerical studies, that the proposed method asymptotically controls the type‐I error rate regardless of the choice of the graph. We also show that when the underlying graph is informative, the Grace test is asymptotically more powerful than similar tests that ignore the external information. We study the power properties of the proposed test when the graph is not fully informative and develop a more powerful Grace‐ridge test for such settings. Our numerical studies show that as long as the graph is reasonably informative, the proposed inference procedures deliver improved statistical power over existing methods that ignore external information.

Number of times cited according to CrossRef: 5

  • Prediction of response to anti-cancer drugs becomes robust via network integration of molecular data, Scientific Reports, 10.1038/s41598-019-39019-2, 9, 1, (2019).
  • A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction, Frontiers in Genetics, 10.3389/fgene.2019.00579, 10, (2019).
  • Conditional Regression Based on a Multivariate Zero-Inflated Logistic-Normal Model for Microbiome Relative Abundance Data, Statistics in Biosciences, 10.1007/s12561-018-9219-2, 10, 3, (587-608), (2018).
  • Graph-based sparse linear discriminant analysis for high-dimensional classification, Journal of Multivariate Analysis, 10.1016/j.jmva.2018.12.007, (2018).
  • Prediction of Postpartum Hemorrhage Volume of Pregnant Women Based on GA-SVM Algorithm, ITM Web of Conferences, 10.1051/itmconf/20171101005, 11, (01005), (2017).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.