Hoeffding decomposition of black-box models with dependent inputs (2310.06567v3)
Abstract: Performing an additive decomposition of arbitrary functions of random elements is paramount for global sensitivity analysis and, therefore, the interpretation of black-box models. The well-known seminal work of Hoeffding characterized the summands in such a decomposition in the particular case of mutually independent inputs. Going beyond the framework of independent inputs has been an ongoing challenge in the literature. Existing solutions have so far required constraining assumptions or suffer from a lack of interpretability. In this paper, we generalize Hoeffding's decomposition for dependent inputs under very mild conditions. For that purpose, we propose a novel framework to handle dependencies based on probability theory, functional analysis, and combinatorics. It allows for characterizing two reasonable assumptions on the dependence structure of the inputs: non-perfect functional dependence and non-degenerate stochastic dependence. We then show that any square-integrable, real-valued function of random elements respecting these two assumptions can be uniquely additively decomposed and offer a characterization of the summands using oblique projections. We then introduce and discuss the theoretical properties and practical benefits of the sensitivity indices that ensue from this decomposition. Finally, the decomposition is analytically illustrated on bivariate functions of Bernoulli inputs.
- An invitation to operator theory. Number v. 50 in Graduate studies in mathematics. American Mathematical Society, Providence, R.I, 2002.
- S. N. Afriat. Orthogonal and oblique projectors and the characteristics of pairs of vector spaces. Mathematical Proceedings of the Cambridge Philosophical Society, 53(4):800–816, 1957.
- S. Axler. Linear Algebra Done Right. Undergraduate Texts in Mathematics. Springer International Publishing, Cham, 2015.
- Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58:82–115, 2020.
- W. Bryc. Conditional expectation with respect to dependent sigma-fields. In Proceedings of VII conference on Probability Theory, pages 409–411, 1984.
- W. Bryc. Conditional Moment Representations for Dependent Random Variables. Electronic Journal of Probability, 1:1–14, 1996. Publisher: Institute of Mathematical Statistics and Bernoulli Society.
- Generalized Hoeffding-Sobol decomposition for dependent variables - application to sensitivity analysis. Electronic Journal of Statistics, 6:2420–2448, 2012.
- J.B. Conway. A Course in Functional Analysis, volume 96 of Graduate Texts in Mathematics. Springer, New York, NY, 2007.
- A classification of projectors. In Topological Algebras, their Applications, and Related Topics, pages 145–160, Bedlewo, Poland, 2005. Institute of Mathematics Polish Academy of Sciences.
- Basics and Trends in Sensitivity Analysis: Theory and Practice in R. Society for Industrial and Applied Mathematics, Philadelphia, PA, 2021.
- J. Dauxois and G. M. Nkiet. Canonical analysis of two euclidean subspaces and its applications. Linear Algebra and its Applications, 264:355–388, 1997.
- Canonical analysis relative to a closed subspace. Linear Algebra and its Applications, 388:119–145, 2004.
- Introduction to Lattices and Order. Cambridge University Press, Cambridge, 2 edition, 2002.
- Uncertainty in Industrial Practice. John Wiley & Sons, Ltd, Chichester, UK, 2008.
- Remarks on the Maximum Correlation Coefficient. Bernoulli, 7(2):343–350, 2001.
- F. Deutsch. The Angle Between Subspaces of a Hilbert Space. In S. P. Singh, editor, Approximation Theory, Wavelets and Applications, NATO Science Series, pages 107–130. Springer Netherlands, Dordrecht, 1995.
- J. Dixmier. étude sur les variétés et les opérateurs de julia, avec quelques applications. Bulletin de la Société Mathématique de France, 77:11–101, 1949.
- P. Doukhan. Mixing, volume 85 of Lecture Notes in Statistics. Springer, New York, NY, 1994.
- I. Feshchenko. When is the sum of closed subspaces of a Hilbert space closed?, 2020. https://arxiv.org/abs/2012.08688.
- I. S. Feshchenko. On closeness of the sum of n subspaces of a Hilbert space. Ukrainian Mathematical Journal, 63(10):1566–1622, 2012.
- K. Friedrichs. On Certain Inequalities and Characteristic Value Problems for Analytic Functions and For Functions of Two Variables. Transactions of the American Mathematical Society, 41(3):321–364, 1937. Publisher: American Mathematical Society.
- A. Galántai. Projectors and Projection Methods. Springer US, Boston, MA, 2004.
- H. Gebelein. Das statistische Problem der Korrelation als Variations- und Eigenwertproblem und sein Zusammenhang mit der Ausgleichsrechnung. ZAMM - Zeitschrift für Angewandte Mathematik und Mechanik, 21(6):364–379, 1941.
- Handbook of uncertainty quantification. Springer Reference. Springer Nature, Cham, Switzerland, 2017.
- J. Hart and P. A. Gremaud. An approximation theoretic perspective of Sobol’ indices with dependent variables. International Journal for Uncertainty Quantification, 8(6), 2018.
- W. Hoeffding. A Class of Statistics with Asymptotically Normal Distribution. The Annals of Mathematical Statistics, 19(3):293–325, 1948.
- G. Hooker. Generalized Functional ANOVA Diagnostics for High-Dimensional Functions of Dependent Variables. Journal of Computational and Graphical Statistics, 16(3):709–732, 2007.
- On the coalitional decomposition of parameters of interest. Comptes Rendus. Mathématique, 361:1653–1662, 2023.
- H. Joe. Multivariate Models and Multivariate Dependence Concepts. Chapman and Hall/CRC, New York, 1997.
- O. Kallenberg. Foundations of Modern Probability, volume 99 of Probability Theory and Stochastic Modelling. Springer International Publishing, Cham, 2021.
- R. A. Koyak. On Measuring Internal Dependence in a Set of Random Variables. The Annals of Statistics, 15(3):1215–1228, 1987.
- Combinatorics: the Rota way. Cambridge University Press, New York, 2012. OCLC: 1226672593.
- On decompositions of multivariate functions. Mathematics of Computation, 79(270):953–966, 2009.
- S.L. Lauritzen. Graphical Models. Oxford Statistical Science Series. Oxford University Press, Oxford, New York, 1996.
- R. Lebrun and A. Dutfoy. Do Rosenblatt and Nataf isoprobabilistic transformations really differ? Probabilistic Engineering Mechanics, 24(4):577–584, 2009.
- R. Lebrun and A. Dutfoy. A generalization of the Nataf transformation to distributions with elliptical copula. Probabilistic Engineering Mechanics, 24(2):172–178, 2009.
- Structure estimation for discrete graphical models: Generalized covariance matrices and their inverses. The Annals of Statistics, 41(6), 2013.
- A Unified Approach to Interpreting Model Predictions. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017.
- P. Malliavin. Integration and Probability, volume 157 of Graduate Texts in Mathematics. Springer, New York, NY, 1995.
- Non-parametric methods for global sensitivity analysis of model output with dependent inputs. Environmental Modelling & Software, 72:173–183, 2015.
- Sympy: symbolic computing in python. PeerJ Computer Science, 3:e103, 2017.
- Giovanni Peccati. Hoeffding-ANOVA decompositions for symmetric statistics of exchangeable observations. The Annals of Probability, 32(3):1796–1829, 2004. Publisher: Institute of Mathematical Statistics.
- J. Pelamatti and V. Chabridon. Sensitivity Analysis in the Presence of Hierarchical Variables. In Programme and abstracts of the 23th Annual Conference of the European Network for Business and Industrial Statistics (ENBIS), volume 1, page 84, Valencia, 2023. Department of Applied Statistics and Operational Research, and Quality, Universitat Politecnica de Valencia.
- H. Rabitz and O. Aliş. General foundations of high‐dimensional model representations. Journal of Mathematical Chemistry, 25(2):197–233, 1999.
- A note on topological direct sum of subspaces. Funct. Anal. Approx. Comput, 10(1):9–20, 2018.
- The Future of Sensitivity Analysis: An essential discipline for systems modeling and policy support. Environmental Modelling & Software, 137:104954, 2021.
- A. Rényi. On measures of dependence. Acta Mathematica Academiae Scientiarum Hungarica, 10(3):441–451, 1959.
- G-C. Rota. On the foundations of combinatorial theory I. Theory of Möbius Functions. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, 2(4):340–368, 1964.
- W. Rudin. Functional analysis. International series in pure and applied mathematics. McGraw-Hill, Boston, Mass., 2. ed., [nachdr.] edition, 1996.
- L. S. Shapley. Notes on the n-Person Game – II: The Value of an n-Person Game. Researach Memorandum ATI 210720, RAND Corporation, Santa Monica, California, 1951.
- Z. Sidák. On Relations Between Strict-Sense and Wide-Sense Conditional Expectations. Theory of Probability & Its Applications, 2(2):267–272, 1957.
- I.M Sobol’. Global sensitivity indices for nonlinear mathematical models and their monte carlo estimates. Mathematics and Computers in Simulation, 55(1):271–280, 2001.
- C. J. Stone. The Use of Polynomial Splines and Their Tensor Products in Multivariate Function Estimation. The Annals of Statistics, 22(1):118–171, 1994. Publisher: Institute of Mathematical Statistics.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.