Papers
Topics
Authors
Recent
Search
2000 character limit reached

Scenario theory for multi-criteria data-driven decision making

Published 1 Apr 2026 in stat.ML, cs.LG, eess.SY, and math.OC | (2604.00553v1)

Abstract: The scenario approach provides a powerful data-driven framework for designing solutions under uncertainty with rigorous probabilistic robustness guarantees. Existing theory, however, primarily addresses assessing robustness with respect to a single appropriateness criterion for the solution based on a dataset, whereas many practical applications - including multi-agent decision problems - require the simultaneous consideration of multiple criteria and the assessment of their robustness based on multiple datasets, one per criterion. This paper develops a general scenario theory for multi-criteria data-driven decision making. A central innovation lies in the collective treatment of the risks associated with violations of individual criteria, which yields substantially more accurate robustness certificates than those derived from a naive application of standard results. In turn, this approach enables a sharper quantification of the robustness level with which all criteria are simultaneously satisfied. The proposed framework applies broadly to multi-criteria data-driven decision problems, providing a principled, scalable, and theoretically grounded methodology for design under uncertainty.

Summary

  • The paper generalizes the scenario approach to rigorously quantify both individual and joint risks in multi-criteria settings using data-driven complexity measures.
  • It introduces collective risk certification methods that sharply reduce conservatism and scale logarithmically with an increasing number of criteria.
  • It provides a-priori risk bounds and efficient computational schemes applicable to robust multi-agent optimization, federated learning, and control under uncertainty.

Scenario Theory for Multi-Criteria Data-Driven Decision Making

Introduction

The paper "Scenario theory for multi-criteria data-driven decision making" (2604.00553) advances the theory and practice of data-driven decision-making under uncertainty in multi-criteria settings. It generalizes the scenario approach, originally focused on single-criterion robustness analysis, to rigorously address the simultaneous satisfaction of multiple appropriateness criteria, each associated with its own dataset. This generalization is critical in modern applications such as multi-agent systems, control under uncertainty, and federated learning, where decentralized data and diverse objectives demand precise quantification of both individual and collective risks.

Problem Formulation and Scenario-Based Decision Scheme

The authors formalize a scenario-based framework where a decision variable z∈Zz \in \mathcal{Z} is to be chosen under uncertainty δ∼P\delta \sim P, with mm appropriateness criteria {Zi(δ)}i=1m\{Z_i(\delta)\}_{i=1}^m and independent datasets {Di}i=1m\{D_i\}_{i=1}^m, each containing NiN_i i.i.d. realizations of uncertainty relevant to criterion ii. A central abstraction is the decision mapping MM which, given the labeled datasets, outputs a chosen design zNz_N. The class of admissible MM is characterized by permutation invariance and two consistency conditions (stability and responsiveness), ensuring that the decision's response to added data matches the scenario approach philosophy.

The framework subsumes robust multi-agent optimization, federated learning, and constrained control problems, encompassing both convex and nonconvex formulations. For instance, robust scenario optimization, which seeks δ∼P\delta \sim P0 such that δ∼P\delta \sim P1 and minimizes a cost δ∼P\delta \sim P2, is a canonical instance of this mapping.

Individual and Joint Risk Quantification

Individual risk with respect to criterion δ∼P\delta \sim P3 is defined as δ∼P\delta \sim P4—the probability that δ∼P\delta \sim P5 fails criterion δ∼P\delta \sim P6 under new uncertainty. Joint risk is δ∼P\delta \sim P7, i.e., the probability of violating at least one criterion.

The primary technical challenge is to estimate, using only the observed datasets, the risks associated with the scenario-based decision δ∼P\delta \sim P8, both individually and jointly, with finite-sample probabilistic guarantees.

Multi-Criteria Complexity and Support Scenarios

A critical concept is the extension of support scenarios and scenario complexity to the multi-criteria context. A scenario is a support scenario if its removal changes the decision outcome. The multi-indexed vector δ∼P\delta \sim P9 counts the number of support scenarios per criterion, encapsulating the combinatorial complexity of the decision.

This observable is central because risk bounds are parameterized by the observed complexity rather than simply the data size, resulting in tighter, data-dependent certificates.

Theoretical Contributions

Independent vs. Collective Certification

The paper precisely characterizes individual risk bounds in two ways:

  • Independent: Applying single-criterion risk bounds to each criterion separately, as in prior extensions (e.g., [falsone_ScenariobasedApproachMultiagent_2020]), yields a "box" region in risk space, but is provably conservative due to ignoring coupling among risks.
  • Collective (Main Contribution): The authors develop a novel dual formulation using generalized moment approaches and the observed complexity vector, allowing a collective characterization of multi-criteria risks. This results in risk regions mm0 that exclude certain "worst-case" risk configurations that are in fact impossible given the joint data and consistency-induced coupling of the scenario mapping.

The dual constraints and region construction allow significant flexibility (allocation strategies for dual variables) and admit efficient computational schemes in certain parameter regimes (notably, under diagonal allocations).

Numerical analyses demonstrate sharply reduced conservatism of the collective risk certificates, especially for the joint (union) risk. For instance, in benchmark cases, the improvement can be over an order of magnitude (see Table 1 in the original text).

Joint Risk Bounds and Scalability

The sub-additivity property mm1 enables straightforward, but loose, joint risk upper bounds. By leveraging the geometry of the collective regions mm2, the paper constructs data-dependent, scenario-complexity-tight joint risk upper bounds. These are shown to be scalable in the number of criteria: as mm3 increases, the collective certificates grow only logarithmically (not linearly), and under certain conditions, the joint risk bound converges to a constant as mm4.

A-priori Certificates

Recognizing the practical necessity of dataset sizing, the authors provide a-priori risk bounds (i.e., before data is collected) given an upper bound mm5 on total scenario complexity. These allow prediction of required sample sizes mm6 per criterion to achieve a target joint risk bound at a prescribed confidence, using (almost explicit) formulas for scenario-rich regimes.

Computational Aspects

While the region-based collective analysis is more complex than the product-box independent approach, the diagonal allocation scheme yields almost explicit hyperbolic region descriptions. The authors provide code (and closed formulas for key cases) for computing upper bounds via root-finding on single-variable (in sample-homogeneous cases) or multi-variable problems.

Implications and Future Directions

Practical implications include:

  • More efficient and less conservative scenario certification for distributed and multi-agent systems, allowing smaller datasets or improved out-of-sample risk profiles.
  • Scalable risk guarantees for federated or large-scale systems with hundreds of criteria.
  • Direct design rules for "scenario-based" control and optimization under multiple risk tolerances.

Theoretical implications center on the deeper understanding of dependence/coupling induced by multi-criteria scenario mappings, extending the role of support complexity as the fundamental observable for information-theoretic scenario quantification. The approach provides a unifying bridge between scenario optimization, statistical learning theory, and robust multi-agent control.

Future work points to the extension of these results to degenerate mappings and nonconvex designs without the non-degeneracy assumption, as started in [garatti_NonconvexScenarioOptimization_2025], and further algorithmic improvements for region computation in the high-dimensional limit.

Conclusion

This work presents a significant advancement in scenario theory for multi-criteria and multi-agent settings by introducing collective risk certification methods based on data-driven complexity observables. The resulting certificates dramatically reduce the conservatism of prior independent approaches, exhibit favorable scalability with the number of criteria, and are readily applicable to practical sizing and certification problems in large-scale data-driven systems. This framework sets the stage for reliable, efficient, and theoretically sound robust design in decentralized and multi-objective decision-making under uncertainty.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We're still in the process of identifying open problems mentioned in this paper. Please check back in a few minutes.

Collections

Sign up for free to add this paper to one or more collections.