Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

False Discovery Rate Control for Gaussian Graphical Models via Neighborhood Screening (2401.09979v1)

Published 18 Jan 2024 in stat.ML and cs.LG

Abstract: Gaussian graphical models emerge in a wide range of fields. They model the statistical relationships between variables as a graph, where an edge between two variables indicates conditional dependence. Unfortunately, well-established estimators, such as the graphical lasso or neighborhood selection, are known to be susceptible to a high prevalence of false edge detections. False detections may encourage inaccurate or even incorrect scientific interpretations, with major implications in applications, such as biomedicine or healthcare. In this paper, we introduce a nodewise variable selection approach to graph learning and provably control the false discovery rate of the selected edge set at a self-estimated level. A novel fusion method of the individual neighborhoods outputs an undirected graph estimate. The proposed method is parameter-free and does not require tuning by the user. Benchmarks against competing false discovery rate controlling methods in numerical experiments considering different graph topologies show a significant gain in performance.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. Vincent Bessonneau et al., “Gaussian graphical modeling of the serum exposome and metabolome reveals interactions between environmental chemicals and endogenous metabolites,” Scientific Reports, vol. 11, no. 1, Apr. 2021.
  2. Khalid Iqbal et al., “Gaussian graphical models identify networks of dietary intake in a german adult population,” The Journal of Nutrition, vol. 146, no. 3, pp. 646–652, Mar. 2016.
  3. “A gaussian graphical model approach to climate networks,” Chaos: An Interdisciplinary Journal of Nonlinear Science, vol. 24, no. 2, Apr. 2014.
  4. Agnes Norbury et al., “Social media and smartphone app use predicts maintenance of physical activity during covid-19 enforced isolation in psychiatric outpatients,” Molecular Psychiatry, vol. 26, no. 8, pp. 3920–3930, Dec. 2020.
  5. Nitin Bhushan et al., “Using a gaussian graphical model to explore relationships between items and variables in environmental psychology research,” Frontiers in Psychology, vol. 10, May 2019.
  6. “Graph signal processing: Overview, challenges, and applications,” Proceedings of the IEEE, vol. 106, no. 5, pp. 808–828, 2018.
  7. “Connecting the dots: Identifying network structure via graph signal processing,” IEEE Signal Processing Magazine, vol. 36, no. 3, pp. 16–43, 2019.
  8. “Graph learning under sparsity priors,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, pp. 6523–6527.
  9. Jonathan Mei and José M. F. Moura, “Signal processing on graphs: Performance of graph structure estimation,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016, pp. 6165–6169.
  10. “Topology identification and learning over graphs: Accounting for nonlinearities and dynamics,” Proceedings of the IEEE, vol. 106, no. 5, pp. 787–807, 2018.
  11. “Modelling and studying the effect of graph errors in graph signal processing,” Signal Processing, vol. 189, pp. 108256, Dec. 2021.
  12. “The impacts of network topology on disease spread,” Ecological Complexity, vol. 2, no. 3, pp. 287–299, Sept. 2005.
  13. “Overcoming the effects of false positives and threshold bias in graph theoretical analyses of neuroimaging data,” NeuroImage, vol. 118, pp. 313–333, Sept. 2015.
  14. M. Drton and M. D. Perlman, “Model selection for gaussian concentration graphs,” Biometrika, vol. 91, no. 3, pp. 591–602, Sept. 2004.
  15. Ming Yuan and Yi Lin, “Model selection and estimation in the Gaussian graphical model,” Biometrika, vol. 94, no. 1, pp. 19–35, 03 2007.
  16. “Sparse inverse covariance estimation with the graphical lasso,” Biostatistics, vol. 9, no. 3, pp. 432–441, 12 2007.
  17. “High-dimensional graphs and variable selection with the lasso,” The Annals of Statistics, vol. 34, no. 3, pp. 1436–1462, 2006.
  18. “Controlling the false discovery rate: A practical and powerful approach to multiple testing,” Journal of the Royal Statistical Society. Series B (Methodological), vol. 57, no. 1, pp. 289–300, 1995.
  19. Weidong Liu, “Gaussian graphical model estimation with false discovery rate control,” The Annals of Statistics, vol. 41, no. 6, pp. 2948 – 2978, 2013.
  20. “GGM Knockoff Filter: False Discovery Rate Control for Gaussian Graphical Models,” Journal of the Royal Statistical Society Series B: Statistical Methodology, vol. 83, no. 3, pp. 534–558, 07 2021.
  21. “False discovery rates in biological networks,” in Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, Arindam Banerjee and Kenji Fukumizu, Eds. 13–15 Apr 2021, vol. 130 of Proceedings of Machine Learning Research, pp. 163–171, PMLR.
  22. “Controlling the false discovery rate in modeling brain functional connectivity,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, pp. 2105–2108.
  23. “The terminating-random experiments selector: Fast high-dimensional variable selection with false discovery rate control,” 2022, https://doi.org/10.48550/arXiv.2110.06048.
  24. “False discovery rate control for fast screening of large-scale genomics biobanks,” in IEEE Statistical Signal Processing Workshop (SSP), 2023, pp. 666–670.
  25. TRexSelector: T-Rex Selector: High-Dimensional Variable Selection & FDR Control, 2022, R package version 0.0.1.
  26. “Emergence of scaling in random networks,” Science, vol. 286, no. 5439, pp. 509–512, Oct. 1999.
  27. “Probability models for degree distributions of protein interaction networks,” Europhysics Letters (EPL), vol. 71, no. 1, pp. 152–158, July 2005.
  28. “Structural properties of the caenorhabditis elegans neuronal network,” PLoS Computational Biology, vol. 7, no. 2, pp. e1001066, Feb. 2011.
  29. “Collective dynamics of ‘small-world’ networks,” Nature, vol. 393, no. 6684, pp. 440–442, June 1998.
  30. M. E. J. Newman, “Models of the small world,” Journal of Statistical Physics, vol. 101, no. 3/4, pp. 819–841, 2000.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com