Papers
Topics
Authors
Recent
Search
2000 character limit reached

Cayley-Moser Optimal Stopping

Updated 9 November 2025
  • The Cayley-Moser optimal stopping problem is a sequential decision-making model that uses irreversible accept/reject rules based on dynamic thresholds to maximize expected rewards.
  • In the discrete model, a recursive threshold (A_m) is derived via dynamic programming and backward induction to efficiently decide when to stop.
  • The continuous-time formulation with Poisson arrivals uses an ODE for threshold evaluation, yielding closed-form sale-price and stopping-time distributions under various offer distributions.

The Cayley-Moser Optimal Stopping Problem is a canonical sequential decision-making framework in which a decision-maker observes a finite or infinite sequence of random offers and faces the constraint of irreversible acceptance or rejection at each stage. The goal is to maximize the expected reward by selecting an optimal stopping time. This paradigm, in both discrete- and continuous-time versions, provides a model for settings such as hiring, asset sales, and online search, and stands out due to the full-information regime, with known value distributions, in contrast to the classical “best-choice” or Secretary problem.

1. Classical Cayley-Moser Problem: Model and Solution Structure

In its classical discrete-time formulation, the decision-maker is presented with a fixed number NN of candidates or offer values {Xi}i=1N\{X_i\}_{i=1}^N, independently sampled from the Uniform[0,1]\mathrm{Uniform}[0,1] distribution. The decision-maker sequentially observes each XiX_i, and must decide to accept (stop) or reject irreversibly. The selected payoff is XτX_\tau, with τ\tau the stopping time. The policy must be adapted, with no recall of rejected offers.

The objective is:

maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].

The optimal policy is derived via dynamic programming and backward induction. Letting VmV_m denote the maximal expected reward with mm applicants remaining:

V0=0,Vm=E[max{X,Vm1}]=01max{x,Vm1}dx=Vm12+12.V_0 = 0, \quad V_m = \mathbb{E}\left[\max\{X, V_{m-1}\}\right] = \int_0^1 \max\{x, V_{m-1}\} dx = \frac{V_{m-1}^2 + 1}{2}.

The “indifference” threshold or aspiration level {Xi}i=1N\{X_i\}_{i=1}^N0 at stage {Xi}i=1N\{X_i\}_{i=1}^N1 is set by {Xi}i=1N\{X_i\}_{i=1}^N2: accept {Xi}i=1N\{X_i\}_{i=1}^N3 if {Xi}i=1N\{X_i\}_{i=1}^N4, otherwise continue. The recursion, known as the Cayley–Moser recursion, is:

{Xi}i=1N\{X_i\}_{i=1}^N5

At step {Xi}i=1N\{X_i\}_{i=1}^N6 (with {Xi}i=1N\{X_i\}_{i=1}^N7 to go), the cutoff is {Xi}i=1N\{X_i\}_{i=1}^N8. This produces a deterministic threshold sequence and myopic “accept-if-above-cutoff” policy (Demers, 2018).

2. Statistical Properties and Duration Analysis

The distribution of the stopping time {Xi}i=1N\{X_i\}_{i=1}^N9 (the index of the chosen offer) is explicitly evaluated as:

Uniform[0,1]\mathrm{Uniform}[0,1]0

This formula enables computation of moments and other statistics of the search duration, capturing the process duration as a function of the threshold sequence. The expected stopping time is:

Uniform[0,1]\mathrm{Uniform}[0,1]1

Exact closed forms for general Uniform[0,1]\mathrm{Uniform}[0,1]2 are unavailable, but the recursion allows efficient Uniform[0,1]\mathrm{Uniform}[0,1]3 evaluation (Demers, 2018).

For large Uniform[0,1]\mathrm{Uniform}[0,1]4, Gilbert–Mosteller’s asymptotic approximation applies:

Uniform[0,1]\mathrm{Uniform}[0,1]5

Uniform[0,1]\mathrm{Uniform}[0,1]6

Uniform[0,1]\mathrm{Uniform}[0,1]7

The left-skewed, triangular distribution of Uniform[0,1]\mathrm{Uniform}[0,1]8 contrasts markedly with the heavy right-tail profile found in the Secretary problem. Numerical values for small Uniform[0,1]\mathrm{Uniform}[0,1]9 illustrate the trend (table below):

N XiX_i0 XiX_i1 XiX_i2 XiX_i3
10 3.81 0.381 3 0.300
50 19.9 0.398 15 0.300
100 33.8 0.338 29 0.290

Key observations:

  • Expected stopping time scales as XiX_i4.
  • Median is around XiX_i5, below the Secretary-problem threshold of XiX_i6.
  • Candidates are typically selected much earlier than the final period.

3. Comparison to the Classical Secretary and Sultan’s Dowry Problems

The Cayley–Moser problem differs crucially from the classic Secretary (best-choice) problem. In the latter, only ordinal information is available; the decision-maker observes relative ranks, not values. The optimal threshold is XiX_i7, with expected interviews about XiX_i8. In the Cayley–Moser regime, where full information XiX_i9 is exploited, the expected search time is dramatically shorter, as aspiration levels are continually updated to the evolving maximum-a-posteriori estimate of the attainable future reward.

Moreover, the distributional form of the stopping time XτX_\tau0 is approximately triangular for Cayley–Moser, versus a heavy-tailed (geometric-like) distribution for the Secretary problem (Demers, 2018). This demonstrates the effect of full information: not only is the stopping policy more efficient on average, but also the process duration is more concentrated and predictable.

4. Continuous-Time Cayley-Moser with Poissonian Arrivals

A Poissonian arrival model generalizes the Cayley–Moser problem to continuous time, providing analytic tractability and novel insight. Here, offers arrive according to a Poisson process of rate XτX_\tau1 over a known horizon XτX_\tau2. Each offer XτX_\tau3 is i.i.d. from a known XτX_\tau4, and upon expiration the seller may settle for a “salvage” value XτX_\tau5 with mean XτX_\tau6 (Katriel, 4 Nov 2025).

The value function XτX_\tau7, representing the maximum expected sale price at time XτX_\tau8 before deadline and observing offer XτX_\tau9, satisfies:

τ\tau0

with τ\tau1, where τ\tau2 is the next arrival time. The optimal policy is accept τ\tau3 if τ\tau4.

This induces the Volterra integral equation:

τ\tau5

which can be differentiated to the ODE:

τ\tau6

where

τ\tau7

This ODE can be reduced by quadrature:

τ\tau8

implying τ\tau9 for

maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].0

Explicit solutions are attainable for specific distributions:

  • Uniform maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].1: maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].2, maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].3.
  • Exponential maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].4: maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].5, maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].6.
  • Pareto maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].7: maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].8, maxτ{1,,N}E[Xτ].\max_{\tau\in\{1,\dots,N\}} \mathbb{E}\left[X_\tau\right].9.

A key distinction from the discrete case is the analytic tractability of the threshold function VmV_m0, as the continuous-time model leads to a solvable ODE, in contrast to the discrete nonlinear threshold recursion.

5. Distributional Forms in the Continuous-Time Problem

The explicit continuous-time formulation enables closed-form calculation of both the sale-price and stopping-time distributions:

  • The sale-price distribution VmV_m1 satisfies the ODE:

VmV_m2

with solution

VmV_m3

  • The stopping-time CDF VmV_m4 is

VmV_m5

The conditional density VmV_m6 is given by:

VmV_m7

For exponential offers and VmV_m8, VmV_m9 on mm0, indicating a uniform distribution of stopping times up to the horizon in this setting.

6. Interpretations, Economic Insights, and Regime Comparisons

In both discrete- and continuous-time models, the threshold function (mm1 or mm2) is both the reservation price and the conditional expected reward under optimal stopping. For fixed horizon mm3, the threshold is monotonically increasing in available time: with greater time to sell, more selectivity is possible.

In the continuous model, increasing Poisson arrival rate mm4 effectively increases the “number of opportunities” mm5, flattening the reservation curve and lowering selectivity for fixed remaining time. This continuous-time model allows for explicit analysis in situations with non-uniform offer distributions, salvage options, and time-dependent opportunity structure (Katriel, 4 Nov 2025).

As mm6, the continuous-time threshold matches the asymptotic of the discrete regime, corroborating the limit behavior across formulations. In both settings, explicit forms for the full distribution of search duration and realized reward are available in the Poissonian case, contrasting with the exclusively asymptotic results obtainable in the original finite discrete process (Katriel, 4 Nov 2025).

A salient insight: compared to the Secretary problem, the Cayley–Moser regime exploits value information for more efficient and earlier selection, both in expectation and in the concentration of process duration. The informativeness of the offers—rather than solely their rank—permits substantially shorter search and higher efficiency in optimal stopping.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Cayley-Moser Optimal Stopping Problem.