Structural perspective on constraint-based learning of Markov networks (2403.08562v1)
Abstract: Markov networks are probabilistic graphical models that employ undirected graphs to depict conditional independence relationships among variables. Our focus lies in constraint-based structure learning, which entails learning the undirected graph from data through the execution of conditional independence tests. We establish theoretical limits concerning two critical aspects of constraint-based learning of Markov networks: the number of tests and the sizes of the conditioning sets. These bounds uncover an exciting interplay between the structural properties of the graph and the amount of tests required to learn a Markov network. The starting point of our work is that the graph parameter maximum pairwise connectivity, $\kappa$, that is, the maximum number of vertex-disjoint paths connecting a pair of vertices in the graph, is responsible for the sizes of independence tests required to learn the graph. On one hand, we show that at least one test with the size of the conditioning set at least $\kappa$ is always necessary. On the other hand, we prove that any graph can be learned by performing tests of size at most $\kappa$. This completely resolves the question of the minimum size of conditioning sets required to learn the graph. When it comes to the number of tests, our upper bound on the sizes of conditioning sets implies that every $n$-vertex graph can be learned by at most $n{\kappa}$ tests with conditioning sets of sizes at most $\kappa$. We show that for any upper bound $q$ on the sizes of the conditioning sets, there exist graphs with $O(n q)$ vertices that require at least $n{\Omega(\kappa)}$ tests to learn. This lower bound holds even when the treewidth and the maximum degree of the graph are at most $\kappa+2$. On the positive side, we prove that every graph of bounded treewidth can be learned by a polynomial number of tests with conditioning sets of sizes at most $2\kappa$.
- Learning factor graphs in polynomial time & sample complexity. In Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005), 2005.
- Some variations on the PC algorithm. In Milan Studený and Jirí Vomlel, editors, Third European Workshop on Probabilistic Graphical Models, 12-15 September 2006, Prague, Czech Republic. Electronic Proceedings, pages 1–8, 2006. URL http://www.utia.cas.cz/files/mtr/pgm06/41_paper.pdf.
- Complexity of finding embeddings in a k𝑘kitalic_k-tree. SIAM J. Alg. Disc. Meth., 8:277–284, 1987.
- Hans L. Bodlaender. Necessary edges in k-chordalisations of graphs. J. Comb. Optim., 7(3):283–290, 2003. doi: 10.1023/A:1027320705349. URL https://doi.org/10.1023/A:1027320705349.
- Efficient principled learning of thin junction trees. In Advances in Neural Information Processing Systems (NIPS), 2007.
- Approximating discrete probability distributions with dependence trees. IEEE Transactions on Information Theory, 14(3):462–467, 1968.
- The dual pc algorithm for structure learning. In Antonio Salmerón and Rumí Rafael, editors, Proceedings of The 11th International Conference on Probabilistic Graphical Models, volume 186 of Proceedings of Machine Learning Research, pages 301–312. PMLR, 2022. URL https://proceedings.mlr.press/v186/giudice22a.html.
- Learning Markov networks: Maximum bounded tree-width graphs. In SODA, 2001.
- Probabilistic Graphical Models: Principles and Techniques. Adaptive computation and machine learning. MIT Press, 2009. ISBN 9780262013192. URL https://books.google.co.in/books?id=7dzpHCHzNQ4C.
- PAC-learning bounded tree-width graphical models. In UAI’04: Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence, pages 410–417, 2004.
- Graph minors. II. Algorithmic aspects of tree-width. J. Algorithms, 7(3):309–322, 1986. doi: 10.1016/0196-6774(86)90023-4. URL https://doi.org/10.1016/0196-6774(86)90023-4.
- Causation, Prediction, and Search, Second Edition. Adaptive computation and machine learning. MIT Press, 2000. ISBN 978-0-262-19440-2.
- Learning bayesian networks with cops and robbers. In Manfred Jaeger and Thomas Dyhre Nielsen, editors, Proceedings of the 10th International Conference on Probabilistic Graphical Models, volume 138 of Proceedings of Machine Learning Research, pages 473–484. PMLR, 23–25 Sep 2020. URL https://proceedings.mlr.press/v138/talvitie20a.html.