Enhancing SMT-based Weighted Model Integration by Structure Awareness (2302.06188v2)
Abstract: The development of efficient exact and approximate algorithms for probabilistic inference is a long-standing goal of artificial intelligence research. Whereas substantial progress has been made in dealing with purely discrete or purely continuous domains, adapting the developed solutions to tackle hybrid domains, characterised by discrete and continuous variables and their relationships, is highly non-trivial. Weighted Model Integration (WMI) recently emerged as a unifying formalism for probabilistic inference in hybrid domains. Despite a considerable amount of recent work, allowing WMI algorithms to scale with the complexity of the hybrid problem is still a challenge. In this paper we highlight some substantial limitations of existing state-of-the-art solutions, and develop an algorithm that combines SMT-based enumeration, an efficient technique in formal verification, with an effective encoding of the problem structure. This allows our algorithm to avoid generating redundant models, resulting in drastic computational savings. Additionally, we show how SMT-based approaches can seamlessly deal with different integration techniques, both exact and approximate, significantly expanding the set of problems that can be tackled by WMI technology. An extensive experimental evaluation on both synthetic and real-world datasets confirms the substantial advantage of the proposed solution over existing alternatives. The application potential of this technology is further showcased on a prototypical task aimed at verifying the fairness of probabilistic programs.
- SMT-based Weighted Model Integration with Structure Awareness, in: Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, volume 180, 2022, pp. 1876–1885.
- E. A. Lee, Cyber Physical Systems: Design Challenges, in: The 11th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing, 2008, pp. 363–369.
- N. Friedman, M. Goldszmidt, Discretizing continuous attributes while learning bayesian networks, in: ICML, 1996.
- A. Kozlov, D. Koller, Nonuniform dynamic discretization in hybrid networks, in: UAI, 1997. URL: https://dslpitt.org/uai/displayArticleDetails.jsp?mmnu=1&smnu=2&article_id=325&proceeding_id=13.
- S. Lauritzen, Propagation of probabilities, means, and variances in mixed graphical association models, Journal of the American Statistical Association 87 (1992).
- Mixed graphical models via exponential families, in: AISTATS, 2014. URL: http://proceedings.mlr.press/v33/yang14a.html.
- Mixed sum-product networks: A deep architecture for hybrid domains, in: AAAI, 2018. URL: https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16865.
- Tractable querying and learning in hybrid domains via sum-product networks, in: KR Workshop on Hybrid Reasoning and Learning, 2018.
- Probabilistic Inference in Hybrid Domains by Weighted Model Integration, in: Proceedings of the 24th International Joint Conference on Artificial Intelligence, 2015, pp. 2770–2776.
- Efficient Weighted Model Integration via SMT-Based Predicate Abstraction, in: Proceedings of the 26th International Joint Conference on Artificial Intelligence, 2017, pp. 720–728.
- Hybrid Probabilistic Inference with Logical and Algebraic Constraints: A Survey, in: Proceedings of the 30th International Joint Conference on Artificial Intelligence, 2021, pp. 4533–4542.
- M. Chavira, A. Darwiche, On Probabilistic Inference by Weighted Model Counting, Artificial Intelligence 172 (2008) 772–799.
- Satisfiability Modulo Theories, in: Handbook of Satisfiability, volume 336, 2 ed., 2021, pp. 1267–1329.
- Combining Component Caching and Clause Learning for Effective Model Counting, in: International Conference on Theory and Applications of Satisfiability Testing, 2004, pp. 20–28.
- Solving #SAT and Bayesian Inference with Backtracking Search, Journal of Artificial Intelligence Research 34 (2009) 391–442.
- Component Caching in Hybrid Domains with Piecewise Polynomial Densities, Proceedings of the AAAI Conference on Artificial Intelligence 30 (2016).
- Z. Zeng, G. V. den Broeck, Efficient Search-Based Weighted Model Integration, in: Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence, 2020, pp. 175–185.
- Scaling up Hybrid Probabilistic Inference with Logical and Arithmetic Constraints via Message Passing, in: Proceedings of the 37th International Conference on Machine Learning, 2020a, pp. 10990–11000.
- Probabilistic Inference with Algebraic Constraints: Theoretical Limits and Practical Approximations, in: Advances in Neural Information Processing Systems, volume 33, 2020b, pp. 11564–11575.
- On the Approximability of Weighted Model Integration on DNF Structures, Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning 17 (2020) 828–837.
- Hashing-based approximate probabilistic inference in hybrid domains, in: Proceedings of the 31st Conference on Uncertainty in Artificial Intelligence, 2015, pp. 141–150.
- A. Darwiche, P. Marquis, A Knowledge Compilation Map, Journal of Artificial Intelligence Research 17 (2002) 229–264.
- Advanced SMT Techniques for Weighted Model Integration, Artificial Intelligence 275 (2019) 1–27.
- SMT Techniques for Fast Predicate Abstraction, in: Computer Aided Verification, 2006, pp. 424–437.
- S. Sanner, E. Abbasnejad, Symbolic Variable Elimination for Discrete and Continuous Graphical Models, in: 26th AAAI Conference on Artificial Intelligence, 2012, pp. 1954–1960.
- Efficient Symbolic Integration for Probabilistic Inference, in: Proceedings of the 27th International Joint Conference on Artificial Intelligence, 2018, pp. 5031–5037.
- Exact and Approximate Weighted Model Integration with Probability Density Functions Using Knowledge Compilation, Proceedings of the AAAI Conference on Artificial Intelligence 33 (2019) 7825–7833.
- How to Exploit Structure while Solving Weighted Model Integration Problems, in: Proceedings of The 35th Conference on Uncertainty in Artificial Intelligence, 2020, pp. 744–754.
- J. Feldstein, V. Belle, Lifted Reasoning Meets Weighted Model Integration, in: Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence, 2021, pp. 322–332.
- S. L. Lauritzen, F. Jensen, Stable Local Computation with Conditional Gaussian Distributions, Statistics and Computing 11 (2001) 191–203.
- Mixed Graphical Models via Exponential Families, in: Proceedings of the 17th International Conference on Artificial Intelligence and Statistics, 2014, pp. 1042–1050.
- Mixed Sum-Product Networks: A Deep Architecture for Hybrid Domains, Proceedings of the AAAI Conference on Artificial Intelligence 32 (2018) 3828–3835.
- Closed-Form Gibbs Sampling for Graphical Models with Algebraic Constraints, in: 30th AAAI Conference on Artificial Intelligence, 2016, pp. 3287–3293.
- V. Gogate, R. Dechter, Approximate inference algorithms for Hybrid Bayesian Networks with discrete constraints, in: Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence, 2005, pp. 209–216.
- Symbolic Dynamic Programming for Discrete and Continuous State MDPs, in: Proceedings of the 17th Conference on Uncertainty in Artificial Intelligence, 2011, pp. 643–652.
- Probabilistic Inference Modulo Theories, in: Proceedings of the 25th International Joint Conference on Artificial Intelligence, 2016, pp. 3591–3599.
- M. Fredrikson, S. Jha, Satisfiability modulo counting: A new approach for analyzing privacy properties, in: Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), 2014, pp. 1–10.
- Approximate counting in smt and value estimation for probabilistic programs, in: International Conference on Tools and Algorithms for the Construction and Analysis of Systems, Springer, 2015, pp. 320–334.
- FairSquare: Probabilistic Verification of Program Fairness, Proceedings of the ACM on Programming Languages 1 (2017) 1–30.
- R. Sebastiani, Are You Satisfied by This Partial Assignment?, CoRR abs/2003.04225 (2020). arXiv:2003.04225.
- Four Flavors of Entailment, in: Theory and Applications of Satisfiability Testing, volume 12178, 2020, pp. 62–71.
- The MathSAT5 SMT Solver, in: Tools and Algorithms for the Construction and Analysis of Systems, 2013, pp. 93–107.
- Performing Bayesian Inference by Weighted Model Counting, in: Proceedings of the 20th National Conference on Artificial Intelligence, 2005, pp. 475–481.
- G. S. Tseitin, On the Complexity of Derivation in Propositional Calculus, in: Automation of Reasoning, 1983, pp. 466–483.
- D. A. Plaisted, S. Greenbaum, A Structure-preserving Clause Form Translation, Journal of Symbolic Computation 2 (1986) 293–304.
- On CNF conversion for disjoint SAT enumeration, in: 26th International Conference on Theory and Applications of Satisfiability Testing, SAT 2023, volume 271 of LIPIcs, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2023, pp. 15:1–15:16.
- Effective Lattice Point Counting in Rational Convex Polytopes, Journal of Symbolic Computation 38 (2004) 1273–1302.
- PSI: Exact Symbolic Inference for Probabilistic Programs, in: Computer Aided Verification, Lecture Notes in Computer Science, 2016, pp. 62–83.
- P. Ram, A. G. Gray, Density Estimation Trees, in: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011, pp. 627–635.
- Learning Weighted Model Integration Distributions, Proceedings of the AAAI Conference on Artificial Intelligence 34 (2020) 5224–5231.
- A. Chalkis, V. Fisikopoulos, Volesti: Volume Approximation and Sampling for Convex Polytopes in R, The R Journal 13 (2021) 561.
- Practical Volume Estimation by a New Annealing Schedule for Cooling Convex Bodies, CoRR abs/1905.05494 (2019). arXiv:1905.05494.
- Geometric Algorithms for Sampling the Flux Space of Metabolic Networks, arXiv (2021). arXiv:2012.05503.
- A. Gelman, D. B. Rubin, Inference from Iterative Simulation Using Multiple Sequences, Statistical Science 7 (1992) 457–472.