A significance test for graph‐constrained estimation
Summary
Graph‐constrained estimation methods encourage similarities among neighboring covariates presented as nodes of a graph, and can result in more accurate estimates, especially in high‐dimensional settings. Variable selection approaches can then be utilized to select a subset of variables that are associated with the response. However, existing procedures do not provide measures of uncertainty of estimates. Further, the vast majority of existing approaches assume that available graph accurately captures the association among covariates; violations to this assumption could severely hurt the reliability of the resulting estimates. In this article, we present a new inference framework, called the Grace test, which produces coefficient estimates and corresponding p‐values by incorporating the external graph information. We show, both theoretically and via numerical studies, that the proposed method asymptotically controls the type‐I error rate regardless of the choice of the graph. We also show that when the underlying graph is informative, the Grace test is asymptotically more powerful than similar tests that ignore the external information. We study the power properties of the proposed test when the graph is not fully informative and develop a more powerful Grace‐ridge test for such settings. Our numerical studies show that as long as the graph is reasonably informative, the proposed inference procedures deliver improved statistical power over existing methods that ignore external information.
Citing Literature
Number of times cited according to CrossRef: 5
- Marcela Franco, Ashwini Jeggari, Sylvain Peuget, Franziska Böttger, Galina Selivanova, Andrey Alexeyenko, Prediction of response to anti-cancer drugs becomes robust via network integration of molecular data, Scientific Reports, 10.1038/s41598-019-39019-2, 9, 1, (2019).
- Yi-Hui Zhou, Paul Gallins, A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction, Frontiers in Genetics, 10.3389/fgene.2019.00579, 10, (2019).
- Zhigang Li, Katherine Lee, Margaret R. Karagas, Juliette C. Madan, Anne G. Hoen, A. James O’Malley, Hongzhe Li, Conditional Regression Based on a Multivariate Zero-Inflated Logistic-Normal Model for Microbiome Relative Abundance Data, Statistics in Biosciences, 10.1007/s12561-018-9219-2, 10, 3, (587-608), (2018).
- Jianyu Liu, Guan Yu, Yufeng Liu, Graph-based sparse linear discriminant analysis for high-dimensional classification, Journal of Multivariate Analysis, 10.1016/j.jmva.2018.12.007, (2018).
- Ren-Jun Shuai, Yang He, Ping Chen, Prediction of Postpartum Hemorrhage Volume of Pregnant Women Based on GA-SVM Algorithm, ITM Web of Conferences, 10.1051/itmconf/20171101005, 11, (01005), (2017).




