Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 63 tok/s
Gemini 2.5 Pro 49 tok/s Pro
GPT-5 Medium 11 tok/s Pro
GPT-5 High 10 tok/s Pro
GPT-4o 83 tok/s Pro
Kimi K2 139 tok/s Pro
GPT OSS 120B 438 tok/s Pro
Claude Sonnet 4 38 tok/s Pro
2000 character limit reached

A Stackelberg Game based on the Secretary Problem: Optimal Response is History Dependent (2409.04153v1)

Published 6 Sep 2024 in cs.GT and math.OC

Abstract: This article considers a problem arising from a two-player game based on the classical secretary problem. First, Player 1 selects one object from a sequence as in the secretary problem. All of the other objects are then presented to Player 2 in the same order as in the original sequence. The goal of both players is to select the best object. The optimal response of Player 2 is adapted to the optimal strategy in the secretary problem. This means that when Player 2 observes an object that is the best seen so far, it can be inferred whether Player 1 selected one of the earlier objects in the original sequence. However, Player 2 cannot compare the current object with the one selected by Player 1. Hence, this game defines an auxiliary problem in which Player 2 has incomplete information on the relative rank of an object. It is shown that the optimal strategy of Player 2 is based on both the number of objects to have appeared and the probability that the current object is better than the object chosen by Player 1 (if Player 1 chose an earlier object in the sequence). However, this probability is dependent on the previously observed objects. A lower bound on the optimal expected reward in the auxiliary problem is defined by limiting the memory of Player 2. An upper bound is derived by giving Player 2 additional information at appropriate times. The methods used illustrate approaches that can be used to approximate the optimal reward in a stopping problem when there is incomplete information on the ranks of objects and/or the optimal strategy is history dependent, as in the Robbins' problem

Summary

We haven't generated a summary for this paper yet.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (1)

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.