Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Online Resource Allocation under Horizon Uncertainty (2206.13606v3)

Published 27 Jun 2022 in cs.DS, cs.LG, and math.OC

Abstract: We study stochastic online resource allocation: a decision maker needs to allocate limited resources to stochastically-generated sequentially-arriving requests in order to maximize reward. At each time step, requests are drawn independently from a distribution that is unknown to the decision maker. Online resource allocation and its special cases have been studied extensively in the past, but prior results crucially and universally rely on the strong assumption that the total number of requests (the horizon) is known to the decision maker in advance. In many applications, such as revenue management and online advertising, the number of requests can vary widely because of fluctuations in demand or user traffic intensity. In this work, we develop online algorithms that are robust to horizon uncertainty. In sharp contrast to the known-horizon setting, no algorithm can achieve even a constant asymptotic competitive ratio that is independent of the horizon uncertainty. We introduce a novel generalization of dual mirror descent which allows the decision maker to specify a schedule of time-varying target consumption rates, and prove corresponding performance guarantees. We go on to give a fast algorithm for computing a schedule of target consumption rates that leads to near-optimal performance in the unknown-horizon setting. In particular, our competitive ratio attains the optimal rate of growth (up to logarithmic factors) as the horizon uncertainty grows large. Finally, we also provide a way to incorporate machine-learned predictions about the horizon which interpolates between the known and unknown horizon settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. Fast algorithms for online stochastic convex programming. In Proceedings of the twenty-sixth annual ACM-SIAM symposium on Discrete algorithms, pages 1405–1424. SIAM, 2014.
  2. A dynamic near-optimal algorithm for online linear programming. Operations Research, 62(4):876–890, 2014.
  3. Predict and match: Prophet inequalities with uncertain supply. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 4(1):1–23, 2020.
  4. A nonparametric framework for online stochastic matching with correlated arrivals. arXiv preprint arXiv:2208.02229, 2022.
  5. Uniformly bounded regret in the multisecretary problem. Stochastic Systems, 9(3):231–260, 2019.
  6. Fluid approximations for revenue management under high-variance demand: Good and bad formulations. Available at SSRN, 2022.
  7. Toward robust revenue management: Competitive analysis of online booking. Operations Research, 57(4):950–963, 2009.
  8. The best of many worlds: Dual mirror descent for online allocation problems. arXiv preprint arXiv:2011.10124, 2020.
  9. Survey of dynamic resource constrained reward collection problems: Unified model and analysis. Available at SSRN 3963265, 2021.
  10. Learning in repeated auctions with budgets: Regret minimization and equilibrium. Management Science, 65(9):3952–3968, 2019.
  11. Essential smoothness, essential strict convexity, and legendre functions in banach spaces. Communications in Contemporary Mathematics, 3(04):615–647, 2001.
  12. Robust optimization–methodology and applications. Mathematical programming, 92(3):453–480, 2002.
  13. Nonlinear programming. Athena Scientific Belmont, MA, 1998.
  14. Dynamic pricing strategies in the presence of demand shifts. Manufacturing & Service Operations Management, 16(4):513–528, 2014.
  15. Online matching frameworks under stochastic rewards, product ranking, and unknown patience. arXiv preprint arXiv:1907.03963, 2019.
  16. Pornpawee Bumpensanti and He Wang. A re-solving heuristic with uniformly bounded loss for network revenue management. Management Science, 66(7):2993–3009, 2020.
  17. The adwords problem: online keyword matching with budgeted bidders under random permutations. In Proceedings of the 10th ACM conference on Electronic commerce, pages 71–78, 2009.
  18. Near optimal online algorithms and fast approximation algorithms for resource allocation problems. In Proceedings of the 12th ACM conference on Electronic commerce, pages 29–38, 2011.
  19. Online allocation with traffic spikes: Mixing adversarial and stochastic models. In Proceedings of the Sixteenth ACM Conference on Economics and Computation, pages 169–186, 2015.
  20. Online ad assignment with free disposal. In International workshop on internet and network economics, pages 374–385. Springer, 2009.
  21. Online stochastic packing applied to display ad allocation. In European Symposium on Algorithms, pages 182–194. Springer, 2010.
  22. Good prophets know when the end is near. Available at SSRN 3479189, 2019.
  23. Real-time optimization of personalized assortments. Management Science, 60(6):1532–1551, 2014.
  24. How the experts algorithm can help solve lps online. Mathematics of Operations Research, 41(4):1404–1431, 2016.
  25. Elad Hazan et al. Introduction to online convex optimization. Foundations and Trends® in Optimization, 2(3-4):157–325, 2016.
  26. A re-solving heuristic with bounded revenue loss for network revenue management with customer choice. Mathematics of Operations Research, 37(2):313–345, 2012.
  27. Online stochastic optimization with wasserstein based non-stationarity. arXiv preprint arXiv:2012.06961, 2020.
  28. An optimal algorithm for on-line bipartite matching. In Proceedings of the twenty-second annual ACM symposium on Theory of computing, pages 352–358, 1990.
  29. Primal beats dual on online packing lps in the random-order model. In Proceedings of the forty-sixth annual ACM symposium on Theory of computing, pages 303–312, 2014.
  30. Simple and fast algorithm for binary integer and online linear programming. Advances in Neural Information Processing Systems, 33:9412–9421, 2020.
  31. Algorithms for online matching, assortment, and pricing with tight weight-dependent competitive ratios. Operations Research, 68(6):1787–1803, 2020.
  32. Aranyak Mehta. Online matching and ad allocation. 2013.
  33. Adwords and generalized online matching. Journal of the ACM (JACM), 54(5):22–es, 2007.
  34. Algorithms with predictions, 2020.
  35. Shai Shalev-Shwartz et al. Online learning and online convex optimization. Foundations and Trends® in Machine Learning, 4(2):107–194, 2012.
  36. Aleksandrs Slivkins et al. Introduction to multi-armed bandits. Foundations and Trends® in Machine Learning, 12(1-2):1–286, 2019.
  37. The theory and practice of revenue management, volume 1. Springer, 2004.
  38. The bayesian prophet: A low-regret framework for online decision making. ACM SIGMETRICS Performance Evaluation Review, 47(1):81–82, 2019.
Citations (14)

Summary

We haven't generated a summary for this paper yet.