A Fast Algorithm for the Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit (2306.09202v3)

Published 15 Jun 2023 in cs.LG

Abstract: We study the real-valued combinatorial pure exploration problem in the stochastic multi-armed bandit (R-CPE-MAB). We study the case where the size of the action set is polynomial with respect to the number of arms. In such a case, the R-CPE-MAB can be seen as a special case of the so-called transductive linear bandits. We introduce an algorithm named the combinatorial gap-based exploration (CombGapE) algorithm, whose sample complexity upper bound matches the lower bound up to a problem-dependent constant factor. We numerically show that the CombGapE algorithm outperforms existing methods significantly in both synthetic and real-world datasets.

References (34)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

A Fast Algorithm for the Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit (2306.09202v3)

Summary

Related Papers