Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Expected Shapley-Like Scores of Boolean Functions: Complexity and Applications to Probabilistic Databases (2401.06493v2)

Published 12 Jan 2024 in cs.DB, cs.AI, and cs.CC

Abstract: Shapley values, originating in game theory and increasingly prominent in explainable AI, have been proposed to assess the contribution of facts in query answering over databases, along with other similar power indices such as Banzhaf values. In this work we adapt these Shapley-like scores to probabilistic settings, the objective being to compute their expected value. We show that the computations of expected Shapley values and of the expected values of Boolean functions are interreducible in polynomial time, thus obtaining the same tractability landscape. We investigate the specific tractable case where Boolean functions are represented as deterministic decomposable circuits, designing a polynomial-time algorithm for this setting. We present applications to probabilistic databases through database provenance, and an effective implementation of this algorithm within the ProvSQL system, which experimentally validates its feasibility over a standard benchmark.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (33)
  1. Foundations of Databases. Addison-Wesley.
  2. Banzhaf Values for Facts in Query Answering. arXiv preprint arXiv:2308.05588 (2023).
  3. Antoine Amarilli. 2023. Uniform Reliability for Unbounded Homomorphism-Closed Graph Queries. In ICDT (LIPIcs, Vol. 255). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 14:1–14:17. https://arxiv.org/abs/2209.11177
  4. Connecting knowledge compilation classes and width parameters. Theory of Computing Systems 64 (2020), 861–914.
  5. The Tractability of SHAP-Score-Based Explanations for Classification over Deterministic and Decomposable Boolean Circuits. In AAAI. AAAI Press, 6670–6678.
  6. On the Complexity of SHAP-Score-Based Explanations: Tractability via Knowledge Compilation and Non-Approximability Results. J. Mach. Learn. Res. 24, 63 (2023), 1–58.
  7. Database Theory. Work in progress, latest version at https://github.com/pdm-book/community.
  8. John F Banzhaf III. 1964. Weighted voting doesn’t work: A mathematical analysis. Rutgers L. Rev. 19 (1964), 317.
  9. The Shapley value in database management. ACM Sigmod Record 52, 2 (2023), 6–17.
  10. Nilesh Dalvi and Dan Suciu. 2013. The dichotomy of probabilistic inference for unions of conjunctive queries. Journal of the ACM (JACM) 59, 6 (2013), 1–87. https://homes.cs.washington.edu/~suciu/jacm-dichotomy.pdf
  11. John Deegan and Edward W Packel. 1978. A new index of power for simple n-person games. International Journal of Game Theory 7 (1978), 113–123.
  12. On the Tractability of SHAP Explanations. In AAAI. AAAI Press, 6505–6513.
  13. Computing the Shapley value of facts in query answering. In SIGMOD Conference. ACM, 1570–1583.
  14. Manfred J Holler and Edward W Packel. 1983. Power, luck and the right index. Zeitschrift für Nationalökonomie 43 (1983), 21–29.
  15. Random generation of combinatorial structures from a uniform distribution. TCS 43 (1986), 169–188.
  16. Ron J Johnston. 1977. National sovereignty and national power in European institutions. Environment and Planning A 9, 5 (1977), 569–577.
  17. Ronald John Johnston. 1978. On the measurement of power: Some reactions to Laver. Environment and Planning A 10, 8 (1978), 907–914.
  18. From Shapley Value to Model Counting and Back. arXiv preprint arXiv:2306.14211 (2023).
  19. Improved feature importance computation for tree models based on the Banzhaf value. In Uncertainty in Artificial Intelligence, Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2022, 1-5 August 2022, Eindhoven, The Netherlands (Proceedings of Machine Learning Research, Vol. 180), James Cussens and Kun Zhang (Eds.). PMLR, 969–979. https://proceedings.mlr.press/v180/karczmarz22a.html
  20. Werner Kirsch and Jessica Langner. 2010. Power indices and minimal winning coalitions. Social Choice and Welfare 34, 1 (2010), 33–46. http://www.jstor.org/stable/41108037
  21. Jean-Marie Lagniez and Pierre Marquis. 2017. An Improved Decision-DNNF Compiler. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, Melbourne, Australia, August 19-25, 2017, Carles Sierra (Ed.). ijcai.org, 667–673. https://doi.org/10.24963/IJCAI.2017/93
  22. Annick Laruelle. 1999. On the choice of a power index. Technical Report. Instituto Valenciano de Investigaciones Económicas.
  23. The Shapley value of tuples in query answering. Logical Methods in Computer Science 17 (2021).
  24. The Shapley value of tuples in query answering. In ICDT, Vol. 155. Schloss Dagstuhl, 20:1–20:19. https://arxiv.org/abs/1904.08679
  25. Mikaël Monet. 2020. Solving a Special Case of the Intensional vs Extensional Conjecture in Probabilistic Databases. In Proceedings of PODS. 149–163.
  26. J Scott Provan and Michael O Ball. 1983. The complexity of counting cuts and of computing the probability that a graph is connected. SIAM J. Comput. 12, 4 (1983), 777–788. https://epubs.siam.org/doi/abs/10.1137/0212053
  27. The impact of negation on the complexity of the Shapley value in conjunctive queries. In Proceedings of PODS. 285–297. https://arxiv.org/abs/1912.12610
  28. Pierre Senellart. 2017. Provenance and Probabilities in Relational Databases: From Theory to Practice. SIGMOD Record 46, 4 (Dec. 2017).
  29. ProvSQL: Provenance and Probability Management in PostgreSQL. Proc. VLDB Endow. 11, 12 (2018), 2034–2037. http://www.vldb.org/pvldb/vol11/p2034-senellart.pdf
  30. Lloyd S Shapley et al. 1953. A value for n-person games. (1953).
  31. Probabilistic Databases. Morgan & Claypool Publishers.
  32. G Tseitin. 1968. On the complexity of derivation in propositional calculus. Studies in Constrained Mathematics and Mathematical Logic (1968).
  33. On the tractability of SHAP explanations. Journal of Artificial Intelligence Research 74 (2022), 851–886.
Citations (2)

Summary

We haven't generated a summary for this paper yet.