Papers
Topics
Authors
Recent
Search
2000 character limit reached

Empirical Decision Problems: An Observables Framework

Updated 4 July 2026
  • Empirical Decision Problems are decision models that replace explicit states with observed act–consequence protocols to infer choice functions directly from behavioral data.
  • They establish consistency and convergence through empirical analogs of classical optimality, using methods like VC bounds and permutation tests to ensure robust statistical inference.
  • The framework extends classical decision theory by linking normative criteria with observable outcomes, enabling applications in dynamic programming, empirical Bayes, and generative AI prompt analysis.

Searching arXiv for papers on empirical decision problems and closely related frameworks. Empirical decision problems are decision models in which the primitive data are not states of the world, lotteries over consequences, or a fully specified consequence function, but a protocol of observed act–consequence pairs. In the formulation of Jansen et al., an empirical decision problem (EDP) is the pair (A,π)(\mathcal A,\pi), where A\mathcal A is a finite set of action descriptions and π\pi is a finite protocol recording observed pairs of actions and consequences; the framework “completely overcomes the need to specify the states of the world explicitly” and studies optimality through empirical choice functions defined directly on subprotocols (Jansen et al., 5 Dec 2025). Within this line of work, the central technical questions are how to define empirical analogs of optimality, when empirical choice functions consistently recover population choice sets, how such choice rules can be statistically tested, and how robustness can be preserved when i.i.d. assumptions are relaxed.

1. Formal definition and primitives

The basic ingredients of an EDP are deliberately sparse. One fixes a finite set A\mathcal A of action descriptions and a possibly infinite set C\mathcal C of observable consequences. For a vector z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n, a z\mathbf z-protocol is a finite sequence

π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,

in which exactly zkz_k of the pairs have the act akAa_k\in\mathcal A. The set of subprotocols using only a subset A\mathcal A0 is denoted by A\mathcal A1. An empirical decision problem is then simply the pair A\mathcal A2 (Jansen et al., 5 Dec 2025).

The corresponding decision object is an empirical choice function (ECF). Writing A\mathcal A3, an ECF is any map

A\mathcal A4

such that for every subprotocol A\mathcal A5,

A\mathcal A6

Thus the output of the choice rule is always a subset of the actions actually represented in the relevant subprotocol (Jansen et al., 5 Dec 2025).

This formalization is intentionally different from classical setups in which states, consequences, and act mappings are specified ex ante. In an EDP, the observable protocol itself is primitive. A plausible implication is that the framework is designed for settings in which the modeler is unwilling or unable to commit to a “closed world” description before observing behavior or outcomes.

2. Empirical optimality without explicit states

The EDP literature introduces empirical analogs of classical optimality by identifying each action description A\mathcal A7 with a latent random variable A\mathcal A8 taking values in A\mathcal A9. A population choice function π\pi0 is fixed by a binary criterion function π\pi1, and the exclusion of an action from a choice set is expressed pairwise: π\pi2 The empirical analogue replaces the latent distributions by empirical distributions π\pi3 computed from the protocol (Jansen et al., 5 Dec 2025).

Under the stated regularity conditions, the empirical choice function is a consistent approximator of the population choice set. The required assumptions are that π\pi4 depends on distributions only and is uniformly continuous in a pseudometric π\pi5, that the class of sets π\pi6 has finite VC-dimension, and that one adds a regularization term of order π\pi7. The resulting rule excludes π\pi8 whenever there exists π\pi9 such that

A\mathcal A0

For each fixed A\mathcal A1, as A\mathcal A2, one then has almost sure convergence

A\mathcal A3

This is the core consistency theorem of the framework (Jansen et al., 5 Dec 2025).

The same work also establishes basic existence results. Under very mild conditions, including transitivity and antisymmetry of A\mathcal A4, ECFs always exist; examples include selecting the protocol’s empirical expected-utility maximizers or its empirical first-order-stochastic-dominance-undominated options (Jansen et al., 5 Dec 2025). This matters because the theory is not restricted to one specific optimality criterion. Instead, it defines a statistical interface through which different normative criteria can be estimated from observed act–consequence data.

3. Statistical testing, convergence rates, and robustness

The inferential program around EDPs has three components: consistent statistical estimation of choice sets, consistent statistical testing of choice functions with robustness guarantees, and direct inference for empirical choice functions using credal sets (Jansen et al., 5 Dec 2025).

For estimation, convergence rates are driven by VC bounds. For each pair A\mathcal A5 and each A\mathcal A6,

A\mathcal A7

where A\mathcal A8 is the VC-dimension and A\mathcal A9. The rate statement is important because it links empirical decision problems to standard empirical-process machinery rather than treating protocol-based choice as purely descriptive (Jansen et al., 5 Dec 2025).

For testing, the null and alternative are formulated directly in terms of membership in the population choice set: C\mathcal C0 Under pairwise ordering, the problem decomposes into two-sample tests. A permutation or bootstrap test is built on the statistic

C\mathcal C1

Under exchangeability it has exact level C\mathcal C2, and under the paper’s Assumptions 1–4—including i.i.d. sampling, pairwise ordering, inclusion of the null “C\mathcal C3” in distributional equivalence, and a least-favorable-distribution structure—it remains asymptotically valid for the one-sided null C\mathcal C4 and is consistent against the directed alternative (Jansen et al., 5 Dec 2025).

Robustness is handled through contamination models. For each action block, one forms the credal set

C\mathcal C5

Replacing empirical distributions in the test statistic by worst-case elements of these sets yields a robust permutation test that remains asymptotically level C\mathcal C6 when each protocol block lies in a C\mathcal C7-contamination model around an i.i.d. sample (Jansen et al., 5 Dec 2025).

These three layers give EDPs an unusual profile: the framework is state-free at the level of primitives, but statistically disciplined at the level of inference.

4. Relation to classical decision theory and identification

Empirical decision problems are best understood against adjacent attempts to relax the classical requirement that a decision problem be specified through an explicit state space. In constructive decision theory, the primitive objects of choice are syntactic programs formed from tests and actions, such as conditional expressions of the form “if C\mathcal C8 then C\mathcal C9 else z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n0.” Blume, Easley, and Halpern show that if preferences over such programs satisfy axioms including statewise cancellation and graph-closedness, then there exist a state space z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n1, an outcome space z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n2, an interpretation of tests and choices, a probability on z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n3, and a utility function on z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n4 yielding an SEU representation; crucially, states and outcomes are not part of the initial description of the problem (0906.4316). EDPs share this refusal to take states as primitive, but they replace preference axioms on syntactic programs with inference from observed act–consequence protocols.

A second neighboring line studies what can be recovered from rankings of information structures. Strzalecki and Stewart define a static decision problem as a triple z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n5 with finite z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n6, action set z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n7, bounded von Neumann–Morgenstern utility z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n8, and value-of-belief function

z=(z1,,zn)Nn\mathbf z=(z_1,\dots,z_n)\in\mathbb N^n9

They show that finitely many ordinal comparisons of experiment values identify the finite set of undominated actions, up to relabeling and duplication, together with the partition of the belief simplex on which each action is optimal; finitely many additional cardinal comparisons identify the piecewise-affine value function z\mathbf z0 up to an action-independent payoff (Whitmeyer, 2024). This suggests a complementary perspective: one can sometimes reconstruct the “shape” of a hidden decision problem from finite empirical comparisons even when the underlying utility description is not directly observed.

A third adjacent literature concerns identification problems under missing data. Dominitz and Manski formulate empirical decision making through a choice set z\mathbf z1, states of nature z\mathbf z2, welfare function z\mathbf z3, and identified set z\mathbf z4 of states observationally equivalent to observed data z\mathbf z5. They emphasize maximin and minimax-regret criteria when z\mathbf z6 remains large, and show that for bounded outcomes z\mathbf z7 the agnostic identification region for z\mathbf z8 has width equal to the non-response fraction z\mathbf z9 unless additional assumptions such as MAR are imposed (Dominitz et al., 15 Sep 2025). Relative to that framework, EDPs move even further toward observables-only primitives, but the shared concern with irreducible ambiguity and robust inference is evident.

5. Applications and computational realizations

The proof-of-concept application in the EDP paper compares prompting strategies in a generative-AI setting. The actions are three prompts: π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,0 “Explain GO in exactly 20 words.”, π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,1 “Explain GO in exactly 20 words, please.”, and π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,2 “Tell me what GO is in exactly 20 words. No more, no less. Just do it.” The protocol uses π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,3, π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,4, and π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,5 prompt repetitions. Each response is evaluated by Perplexity (lower is better) and Coherence (higher is better), and π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,6 is taken as the class of all pairwise-monotone functions on π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,7. Under empirical FSD, all three prompts are pairwise incomparable, so

π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,8

At level π=((ai1,ci1),,(aiN,ciN)),N=k=1nzk,\pi=\bigl((a_{i_1},c_{i_1}),\dots,(a_{i_N},c_{i_N})\bigr), \qquad N=\sum_{k=1}^n z_k,9, permutation tests for zkz_k0 against zkz_k1 and zkz_k2 produce approximate p-values zkz_k3 and zkz_k4, and robustness analysis indicates that the decision to retain zkz_k5 remains significant up to roughly zkz_k6 contamination (Jansen et al., 5 Dec 2025).

Beyond protocol-based EDPs, several neighboring literatures instantiate empirical decision making in more structured domains. In finite statistical decision problems with a finite family zkz_k7 of decision rules and risk zkz_k8, the minimax problem

zkz_k9

is a convex program over the simplex, and mirror subgradient descent with negative-entropy mirror map—equivalently, the Hedge algorithm—returns an akAa_k\in\mathcal A0-minimax mixed rule in akAa_k\in\mathcal A1 oracle calls (Fernández et al., 9 Sep 2025).

In compound decision problems and empirical Bayes, one observes akAa_k\in\mathcal A2 and evaluates separable rules through the compound risk

akAa_k\in\mathcal A3

The empirical-Bayes program estimates the unknown mixing distribution akAa_k\in\mathcal A4 by NPMLE and plugs the estimate into the Bayes rule; in the Kiefer–Wolfowitz formulation, the NPMLE can be taken to be discrete with at most akAa_k\in\mathcal A5 atoms (Koenker et al., 2024). Related regret analyses show sharp lower bounds for normal and Poisson models and establish, for example, that in the Poisson case the optimal regret scales as akAa_k\in\mathcal A6 for compact support and akAa_k\in\mathcal A7 for subexponential priors, both attained by Robbins’s estimator (Polyanskiy et al., 2021).

In dynamic settings, empirical dynamic programming replaces exact expectations in the Bellman operator by sample averages. For a discounted MDP, the empirical Bellman operator

akAa_k\in\mathcal A8

defines empirical value iteration and empirical policy iteration, and convergence is analyzed through probabilistic fixed points and a stochastic-dominance argument on discretized error bins (Haskell et al., 2013). In logic-based settings, probabilistic answer set programming under credal semantics encodes decision atoms, probabilistic facts, and utility attributes, and Azzolini et al. solve for maximin and maximax strategies via a three-layer Algebraic Model Counting construction compiled into an X/D-first sd-DNNF circuit (Azzolini et al., 2024).

6. Limits, misconceptions, and open directions

A common misconception is that abandoning explicit states removes the need for probabilistic assumptions. The current EDP theory does not support that claim. Its inferential guarantees rely on randomness conditions such as exchangeability or i.i.d. sampling; truly adversarial or deterministic protocols lie outside the scope of the present results, and learning dependencies beyond small-contamination models remains an open research direction (Jansen et al., 5 Dec 2025).

A second misconception is that larger datasets or more powerful ML systems automatically dissolve empirical ambiguity. The missing-data literature makes the opposite point: identification problems may not diminish as sample size grows, because the identified set akAa_k\in\mathcal A9 can remain large even with infinite data. In that setting, standard ML defaults such as case deletion, simple imputation, and model-based imputation implicitly rely on MCAR- or MAR-type assumptions rather than resolving the underlying identification problem (Dominitz et al., 15 Sep 2025). This suggests that EDPs should be viewed not as a rejection of rigor, but as part of a broader effort to align decision analysis with what is actually observable and inferentially defensible.

The remaining open directions are already explicit in the current literature. For EDPs, extending direct-inference empirical choice functions to more general forms of imprecision, including belief functions and intervals, is presented as a promising avenue; for adjacent empirical decision frameworks, open issues include handling non-i.i.d. dependence, sharpening constants and computational guarantees, and integrating robust decision criteria with richer empirical protocols (Jansen et al., 5 Dec 2025). The unifying theme is that empirical decision problems replace idealized model primitives with observable structure, while still demanding formal guarantees about consistency, testing, and robustness.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Empirical Decision Problems.