Papers
Topics
Authors
Recent
2000 character limit reached

Multi-Secretary Problem

Updated 5 December 2025
  • Multi-secretary problem is an extension of the classical secretary problem that permits up to k irrevocable selections from sequentially arriving candidates to maximize total value or success probability.
  • Adaptive online policies like the Budget-Ratio and simulated-greedy algorithms dynamically adjust thresholds and achieve near-optimal performance with uniformly bounded regret compared to the offline optimum.
  • Extensions of this framework apply to matroid selection, multi-winner elections, and online fairness, influencing practical applications such as hiring, resource allocation, and participatory budgeting.

The multi-secretary problem, also known as the k-choice secretary problem, extends the classical optimal stopping setting introduced by Cayley and Moser to the case where a decision maker can select up to kk of nn items that arrive sequentially in random or controlled order. Each item is revealed irrevocably and only relative or realized quality information up to the present time is available. The objective is to maximize a set function—typically total value or the probability of capturing the best elements—subject to an online, capacity-constrained selection process.

1. Formal Models and Fundamental Variants

The canonical multi-secretary problem involves nn independent random variables X1,,XnX_1, \dots, X_n observed sequentially, each from a known distribution with finite support {a1<a2<<am}\{a_1 < a_2 < \cdots < a_m\}. At each time tt, the algorithm may irrevocably select XtX_t, subject to a total budget kk of allowed selections. The expected online reward is

$\ALG_n(\pi) = \mathbb{E}\Bigl[\sum_{t=1}^n X_t \sigma_t \Bigr]$

for some policy π\pi with σt{0,1}\sigma_t \in \{0,1\}, tσtk\sum_t \sigma_t \leq k. The comparison benchmark is the offline optimum

$\OPT_n = \mathbb{E}\Bigl[\sum_{j=1}^m a_j S_j^n \Bigr],$

where SjnS_j^n counts the number of selections among the kk largest XtX_t having value aja_j (Arlotto et al., 2017).

Variant formulations include:

  • The uniform matroid or linear kk-secretary problem, where XtX_t are arbitrary weights in adversarial or random order (Ma et al., 2011).
  • The (J,K)(J,K)-secretary problem (multi-choice, multi-best), with the objective to select JJ elements maximizing count or value among the KK true best (Chan et al., 2013).
  • Models with multiple items per rank or parallel (multi-queue) settings (Pinsky, 2022, Sun et al., 2014).
  • Non-linear objectives and fairness constraints, notably in online social choice (Papasotiropoulos et al., 28 Nov 2025).

2. Optimal and Near-Optimal Online Policies

A foundational result is that, in the setting with independent values drawn from a known common finite-support law, adaptivity in selection is critical for minimizing regret relative to the offline optimum. The Budget-Ratio (BR) policy of Arlotto & Gurvich achieves a uniformly bounded regret across all nn and kk by dynamically tuning selection thresholds based on the current residual budget per remaining period. Specifically, the policy sets Rt=Kt/(nt)R_t = K_t/(n-t) at time tt and determines acceptance thresholds {τj}\{\tau_j\} based on the cumulative distribution. The next arrival is accepted if its value exceeds the type dictated by RtR_t (Arlotto et al., 2017).

For non-adaptive policies, the best achievable regret is Ω(n)\Omega(\sqrt{n}), fundamentally due to concentration bounds on binomial selection variability. Adaptive schemes such as BR, or multiple-phase thresholding in the kk-secretary context, maintain a near-perfect tracking of offline allocation except for O(1)O(1) expected mistakes (Arlotto et al., 2017).

For random-order kk-secretary (linear weight) problems, the simulated-greedy algorithm achieves a constant (specifically, 9.6) competitive ratio, using an initial sample to set a threshold and then accepting any subsequent value exceeding this until the quota is filled (Ma et al., 2011). Asymptotically, the optimal policy for kk\to\infty approaches the offline optimum, with error decaying as O(ln2k/k2)O(\ln^2 k / k^2) (Sun et al., 2014).

Parallel and free-order settings utilize multiple threshold structures or dimensionally reduced orderings; entropy-optimal order selection can yield 1O(logk/k)1-O(\sqrt{\log k/k}) competitive ratios using Θ(loglogn)\Theta(\log\log n) bits of randomness (Hajiaghayi et al., 2022).

3. Structural and Asymptotic Characterizations

The continuous LP framework provides an exact characterization of the asymptotics of the JJ-choice KK-best secretary problem. In infinite and large-nn limits, optimal policies correspond to (J,K)(J,K)-threshold algorithms: each quota unlocks at critical times to select up to JJ kk-potentials, with thresholds determined by complementary slackness of the LP. When K=1K=1, the thresholds are explicitly rational and enable precise calculation of the limiting probability of successfully capturing the top JJ items (Chan et al., 2013).

For kk-multiplicity per rank, skipping the first cknc k n items and selecting the first item at least as good as the best seen so far gives, in the limit, success probabilities pk(c)p_k(c) that converge to 1 rapidly as kk increases—e.g., p2()0.70p_2(\cdot)\approx 0.70, p3()0.85p_3(\cdot)\approx 0.85, and p7()>0.99p_7(\cdot)>0.99 (Pinsky, 2022).

In multi-threshold positional strategies for selection, successive selection thresholds are defined recursively, and the optimal thresholds for ss selections in the nn\to\infty limit coincide with the classical Gilbert–Mosteller (Dowry problem) values, with the "right-hand-based" property: the later-stage thresholds are independent of total initial selection allowance (Liu et al., 2023).

4. Extensions: Parallels, Generalizations, and Online Fairness

Parallel and shared-quota multi-secretary problems involve the distribution of candidates among QQ queues or the division of the selection budget across multiple simultaneous streams. Linear-programming duality characterizes optimal threshold structures, and the Adaptive Observation–Selection Protocol achieves tight or near-tight competitive ratios, with closed-form expressions available for special cases such as (Q,J,K)=(2,2,2)(Q,J,K)=(2,2,2) (Sun et al., 2014).

The Stable Secretaries variant models online matching with stability constraints, quantifying the fraction of agents not in blocking pairs. Under random arrival, a constant fraction Ω(1)\Omega(1) of agents can avoid blocking, but the fraction of stable pairs is O(1/n)O(1/\sqrt{n}) and can be as low as O(1/n)O(1/n) under adversarial input (Babichenko et al., 2017).

Recent work explicitly links the multi-secretary problem to online multi-winner elections under cardinal preferences and fairness (e.g., Extended Justified Representation). Impossibility theorems establish that EJR and its relaxations are unattainable deterministically online, but sample-then-select hybrid algorithms (Online MES, BOS) achieve high-probability EJR-like guarantees under random order, and Greedy Budgeting ensures basic justified representation (Papasotiropoulos et al., 28 Nov 2025).

5. Principal Results and Algorithmic Insights

The following table summarizes key competitive guarantees and regret rates for main models:

Model / Policy Achievable Regret / Competitive Ratio Reference
Adaptive Budget-Ratio (BR) Regret uniformly bounded in (n,k)(n,k) (Arlotto et al., 2017)
Non-adaptive (static) policy Regret Ω(n)\Omega(\sqrt{n}) (Arlotto et al., 2017)
Simulated-greedy (random order) 9.6-competitive, uniform matroid (Ma et al., 2011)
LP-threshold (infinite items) Optimal rational thresholds, large nn (Chan et al., 2013)
Parallel deterministic protocol Case (2,2,2)(2,2,2): comp. ratio 0.372\approx 0.372 (Sun et al., 2014)
Free-order, randomness-efficient Ratio 1O(logk/k)1-O(\sqrt{\log k/k}), entropy-opt. (Hajiaghayi et al., 2022)
kk-multiplicity per rank Success 1\rightarrow 1 rapidly for kk\to\infty (Pinsky, 2022)
Online fairness (Random order) (1/e)kp\geq(1/e)^{k-p}-EJR for pp slots (Papasotiropoulos et al., 28 Nov 2025)

Algorithmically, all near-optimal policies—across classical, parallel, free-order, and fairness-augmented domains—share a thresholding structure, dynamically or statically tuned via offline sample statistics or adaptive control over budget exhaustion.

6. Open Directions and Challenges

Despite substantial progress, several research directions remain:

  • Online fairness and proportionality constraints in adversarial or unknown-horizon settings appear inherently limited (no deterministic EJR possible online) (Papasotiropoulos et al., 28 Nov 2025).
  • Extensions to general submodular objectives within matroid constraints retain only constant-factor guarantees, and closing these gaps for broad classes of valuation functions remains unresolved (Ma et al., 2011).
  • Asymptotic and finite-n analyses for more general arrival processes (non-uniform, adversarial with limited power) and richer informational settings (feedback, delayed response) are underdeveloped.
  • The optimal use of randomness (entropy-minimized strategies) and deterministic derandomizations for multi-secretary and related prophet models continue to be active research areas (Hajiaghayi et al., 2022).

7. Connections to Broader Theoretical and Applied Domains

The multi-secretary problem is deeply intertwined with the theory of matroid secretary problems, prophet inequalities, and online allocation under feasibility and fairness constraints. It provides a tractable mathematical foundation for real-world online selection tasks, such as online hiring, resource allocation in participatory budgeting, public goods selection under initiative processes, and more generally, the study of irrevocable online decision-making under uncertainty with combinatorial structure.

References: (Arlotto et al., 2017, Ma et al., 2011, Chan et al., 2013, Sun et al., 2014, Pinsky, 2022, Liu et al., 2023, Hajiaghayi et al., 2022, Papasotiropoulos et al., 28 Nov 2025, Babichenko et al., 2017)

Whiteboard

Follow Topic

Get notified by email when new papers are published related to Multi-Secretary Problem.