Cayley-Moser Optimal Stopping
- The Cayley-Moser optimal stopping problem is a sequential decision-making model that uses irreversible accept/reject rules based on dynamic thresholds to maximize expected rewards.
- In the discrete model, a recursive threshold (A_m) is derived via dynamic programming and backward induction to efficiently decide when to stop.
- The continuous-time formulation with Poisson arrivals uses an ODE for threshold evaluation, yielding closed-form sale-price and stopping-time distributions under various offer distributions.
The Cayley-Moser Optimal Stopping Problem is a canonical sequential decision-making framework in which a decision-maker observes a finite or infinite sequence of random offers and faces the constraint of irreversible acceptance or rejection at each stage. The goal is to maximize the expected reward by selecting an optimal stopping time. This paradigm, in both discrete- and continuous-time versions, provides a model for settings such as hiring, asset sales, and online search, and stands out due to the full-information regime, with known value distributions, in contrast to the classical “best-choice” or Secretary problem.
1. Classical Cayley-Moser Problem: Model and Solution Structure
In its classical discrete-time formulation, the decision-maker is presented with a fixed number of candidates or offer values , independently sampled from the distribution. The decision-maker sequentially observes each , and must decide to accept (stop) or reject irreversibly. The selected payoff is , with the stopping time. The policy must be adapted, with no recall of rejected offers.
The objective is:
The optimal policy is derived via dynamic programming and backward induction. Letting denote the maximal expected reward with applicants remaining:
The “indifference” threshold or aspiration level at stage is set by : accept if , otherwise continue. The recursion, known as the Cayley–Moser recursion, is:
At step (with to go), the cutoff is . This produces a deterministic threshold sequence and myopic “accept-if-above-cutoff” policy (Demers, 2018).
2. Statistical Properties and Duration Analysis
The distribution of the stopping time (the index of the chosen offer) is explicitly evaluated as:
This formula enables computation of moments and other statistics of the search duration, capturing the process duration as a function of the threshold sequence. The expected stopping time is:
Exact closed forms for general are unavailable, but the recursion allows efficient evaluation (Demers, 2018).
For large , Gilbert–Mosteller’s asymptotic approximation applies:
The left-skewed, triangular distribution of contrasts markedly with the heavy right-tail profile found in the Secretary problem. Numerical values for small illustrate the trend (table below):
| N | ||||
|---|---|---|---|---|
| 10 | 3.81 | 0.381 | 3 | 0.300 |
| 50 | 19.9 | 0.398 | 15 | 0.300 |
| 100 | 33.8 | 0.338 | 29 | 0.290 |
Key observations:
- Expected stopping time scales as .
- Median is around , below the Secretary-problem threshold of .
- Candidates are typically selected much earlier than the final period.
3. Comparison to the Classical Secretary and Sultan’s Dowry Problems
The Cayley–Moser problem differs crucially from the classic Secretary (best-choice) problem. In the latter, only ordinal information is available; the decision-maker observes relative ranks, not values. The optimal threshold is , with expected interviews about . In the Cayley–Moser regime, where full information is exploited, the expected search time is dramatically shorter, as aspiration levels are continually updated to the evolving maximum-a-posteriori estimate of the attainable future reward.
Moreover, the distributional form of the stopping time is approximately triangular for Cayley–Moser, versus a heavy-tailed (geometric-like) distribution for the Secretary problem (Demers, 2018). This demonstrates the effect of full information: not only is the stopping policy more efficient on average, but also the process duration is more concentrated and predictable.
4. Continuous-Time Cayley-Moser with Poissonian Arrivals
A Poissonian arrival model generalizes the Cayley–Moser problem to continuous time, providing analytic tractability and novel insight. Here, offers arrive according to a Poisson process of rate over a known horizon . Each offer is i.i.d. from a known , and upon expiration the seller may settle for a “salvage” value with mean (Katriel, 4 Nov 2025).
The value function , representing the maximum expected sale price at time before deadline and observing offer , satisfies:
with , where is the next arrival time. The optimal policy is accept if .
This induces the Volterra integral equation:
which can be differentiated to the ODE:
where
This ODE can be reduced by quadrature:
implying for
Explicit solutions are attainable for specific distributions:
- Uniform : , .
- Exponential : , .
- Pareto : , .
A key distinction from the discrete case is the analytic tractability of the threshold function , as the continuous-time model leads to a solvable ODE, in contrast to the discrete nonlinear threshold recursion.
5. Distributional Forms in the Continuous-Time Problem
The explicit continuous-time formulation enables closed-form calculation of both the sale-price and stopping-time distributions:
- The sale-price distribution satisfies the ODE:
with solution
- The stopping-time CDF is
The conditional density is given by:
For exponential offers and , on , indicating a uniform distribution of stopping times up to the horizon in this setting.
6. Interpretations, Economic Insights, and Regime Comparisons
In both discrete- and continuous-time models, the threshold function ( or ) is both the reservation price and the conditional expected reward under optimal stopping. For fixed horizon , the threshold is monotonically increasing in available time: with greater time to sell, more selectivity is possible.
In the continuous model, increasing Poisson arrival rate effectively increases the “number of opportunities” , flattening the reservation curve and lowering selectivity for fixed remaining time. This continuous-time model allows for explicit analysis in situations with non-uniform offer distributions, salvage options, and time-dependent opportunity structure (Katriel, 4 Nov 2025).
As , the continuous-time threshold matches the asymptotic of the discrete regime, corroborating the limit behavior across formulations. In both settings, explicit forms for the full distribution of search duration and realized reward are available in the Poissonian case, contrasting with the exclusively asymptotic results obtainable in the original finite discrete process (Katriel, 4 Nov 2025).
A salient insight: compared to the Secretary problem, the Cayley–Moser regime exploits value information for more efficient and earlier selection, both in expectation and in the concentration of process duration. The informativeness of the offers—rather than solely their rank—permits substantially shorter search and higher efficiency in optimal stopping.