Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Bandit Sequential Posted Pricing via Half-Concavity (2312.12794v2)

Published 20 Dec 2023 in cs.LG, cs.DS, and cs.GT

Abstract: Sequential posted pricing auctions are popular because of their simplicity in practice and their tractability in theory. A usual assumption in their study is that the Bayesian prior distributions of the buyers are known to the seller, while in reality these priors can only be accessed from historical data. To overcome this assumption, we study sequential posted pricing in the bandit learning model, where the seller interacts with $n$ buyers over $T$ rounds: In each round the seller posts $n$ prices for the $n$ buyers and the first buyer with a valuation higher than the price takes the item. The only feedback that the seller receives in each round is the revenue. Our main results obtain nearly-optimal regret bounds for single-item sequential posted pricing in the bandit learning model. In particular, we achieve an $\tilde{O}(\mathsf{poly}(n)\sqrt{T})$ regret for buyers with (Myerson's) regular distributions and an $\tilde{O}(\mathsf{poly}(n)T{{2}/{3}})$ regret for buyers with general distributions, both of which are tight in the number of rounds $T$. Our result for regular distributions was previously not known even for the single-buyer setting and relies on a new half-concavity property of the revenue function in the value space. For $n$ sequential buyers, our technique is to run a generalized single-buyer algorithm for all the buyers and to carefully bound the regret from the sub-optimal pricing of the suffix buyers.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. Reducing mechanism design to algorithm design via machine learning. J. Comput. Syst. Sci., 74(8):1245–1270, 2008.
  2. Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Foundations and Trends in Machine Learning, 5(1):1–122, 2012.
  3. How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design. In STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 919–932. ACM, 2021.
  4. Posted prices vs. negotiations: an asymptotic analysis. EC, 10:1386790–1386801, 2008.
  5. Bandits with knapsacks. J. ACM, 65(3):13:1–13:55, 2018.
  6. A general theory of sample complexity for multi-item profit maximization. In Proceedings of the 2018 ACM Conference on Economics and Computation, pages 173–174. ACM, 2018.
  7. Dynamic pricing with finitely many unknown valuations. In Algorithmic Learning Theory, pages 247–273. PMLR, 2019.
  8. Prediction, learning, and games. Cambridge university press, 2006.
  9. Learning multi-item auctions with (or without) samples. In Chris Umans, editor, 58th IEEE Annual Symposium on Foundations of Computer Science, FOCS, pages 516–527. IEEE Computer Society, 2017.
  10. On the complexity of optimal lottery pricing and randomized mechanisms. In IEEE 56th Annual Symposium on Foundations of Computer Science, FOCS 2015, Berkeley, CA, USA, 17-20 October, 2015, pages 1464–1479, 2015.
  11. From pricing to prophets, and back! Oper. Res. Lett., 47(1):25–29, 2019.
  12. Regret minimization for reserve prices in second-price auctions. IEEE Trans. Inf. Theory, 61(1):549–564, 2015.
  13. Algorithmic Pricing via Virtual Valuations. In the 8th ACM Conference on Electronic Commerce (EC), 2007.
  14. Multi-parameter mechanism design and sequential posted pricing. In Proceedings of the 42nd ACM Symposium on Theory of Computing, STOC, pages 311–320, 2010.
  15. The power of randomness in bayesian optimal mechanism design. Games and Economic Behavior, 91:297–317, 2015.
  16. Computing simple mechanisms: Lift-and-round over marginal reduced forms. In 54th Annual ACM SIGACT Symposium on Theory of Computing, STOC, pages 704–717, 2022.
  17. Unimodal bandits: Regret lower bounds and optimal algorithms. In Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, China, 21-26 June 2014, volume 32 of JMLR Workshop and Conference Proceedings, pages 521–529. JMLR.org, 2014.
  18. The sample complexity of revenue maximization. In David B. Shmoys, editor, Symposium on Theory of Computing, STOC 2014, New York, NY, USA, May 31 - June 03, 2014, pages 243–252. ACM, 2014.
  19. The sample complexity of auctions with side information. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, STOC, pages 426–439. ACM, 2016.
  20. Revenue maximization with a single sample. Games Econ. Behav., 91:318–333, 2015.
  21. Auctions versus posted prices in online markets. Journal of Political Economy, 126(1):178–215, 2018.
  22. Generalizing complex hypotheses on product distributions: Auctions, prophet inequalities, and pandora’s problem. In Conference on Learning Theory, COLT, pages 2248–2288, 2021.
  23. Settling the sample complexity of single-parameter revenue maximization. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, STOC, pages 662–673. ACM, 2019.
  24. Bandit algorithms for prophet inequality and pandora’s box. In Proceedings of the thirty-sixth annual ACM-SIAM symposium on Discrete Algorithms, SODA, 2024.
  25. Efficient empirical revenue maximization in single-parameter auction environments. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC, pages 856–868. ACM, 2017.
  26. The sample complexity of up-to-ϵitalic-ϵ\epsilonitalic_ϵ multi-dimensional revenue maximization. J. ACM, 68(3):15:1–15:28, 2021.
  27. Jason D Hartline. Mechanism design and approximation. Book draft. October, 122(1), 2013.
  28. Making the most of your samples. SIAM J. Comput., 47(3):651–674, 2018.
  29. Adversarial bandits with knapsacks. J. ACM, 69(6):40:1–40:47, 2022.
  30. Learning reserve prices in second-price auctions. In 14th Innovations in Theoretical Computer Science Conference, ITCS, 2023.
  31. The value of knowing a demand curve: Bounds on regret for online posted-price auctions. In 44th Symposium on Foundations of Computer Science, FOCS, pages 594–605. IEEE Computer Society, 2003.
  32. Bandit algorithms. Cambridge University Press, 2020.
  33. Pricing query complexity of revenue maximization. In Proceedings of the thirty-fourth annual ACM-SIAM symposium on Discrete Algorithms, 2023.
  34. Brendan Lucier. An economic view of prophet inequalities. SIGecom Exch., 16(1):24–47, 2017.
  35. Lipschitz bandits: Regret lower bounds and optimal algorithms. CoRR, abs/1405.4758, 2014.
  36. Learning with comparison feedback: Online estimation of sample statistics. In Vitaly Feldman, Katrina Ligett, and Sivan Sabato, editors, Algorithmic Learning Theory, 16-19 March 2021, Virtual Conference, Worldwide, volume 132 of Proceedings of Machine Learning Research, pages 983–1001. PMLR, 2021.
  37. Learning simple auctions. In Proceedings of the 29th Conference on Learning Theory, COLT, volume 49 of JMLR Workshop and Conference Proceedings, pages 1298–1318. JMLR.org, 2016.
  38. Roger B Myerson. Optimal auction design. Mathematics of operations research, 6(1):58–73, 1981.
  39. Online learning via offline greedy algorithms: Applications in market design and optimization. In EC ’21: The 22nd ACM Conference on Economics and Computation, pages 737–738. ACM, 2021.
  40. Non-stochastic cdf estimation using threshold queries. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 3551–3572. SIAM, 2023.
  41. Tim Roughgarden. Twenty lectures on algorithmic game theory. 2017.
  42. Ironing in the dark. In Proceedings of the 2016 ACM Conference on Economics and Computation, EC, pages 1–18. ACM, 2016.
  43. Aleksandrs Slivkins. Introduction to multi-armed bandits. Foundations and Trends in Machine Learning, 12(1-2):1–286, 2019.
Citations (2)

Summary

We haven't generated a summary for this paper yet.