Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

New Perspectives in Online Contract Design (2403.07143v2)

Published 11 Mar 2024 in cs.GT and cs.LG

Abstract: This work studies the repeated principal-agent problem from an online learning perspective. The principal's goal is to learn the optimal contract that maximizes her utility through repeated interactions, without prior knowledge of the agent's type (i.e., the agent's cost and production functions). This work contains three technical results. First, learning linear contracts with binary outcomes is equivalent to dynamic pricing with an unknown demand curve. Second, learning an approximately optimal contract with identical agents can be accomplished with a polynomial sample complexity scheme. Third, learning the optimal contract with heterogeneous agents can be reduced to Lipschitz bandits under mild regularity conditions. The technical results demonstrate that the one-dimensional effort model, the default model for principal-agent problems in economics which seems largely ignored in recent works from the computer science community, may possibly be the more suitable choice when studying contract design from a learning perspective.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Learning prices for repeated auctions with strategic buyers. In Neural Information Processing Systems.
  2. Contract theory. MIT press.
  3. Learning approximately optimal contracts. In International Symposium on Algorithmic Game Theory, pages 331–346. Springer.
  4. Conlon, J. R. (2009). Two new conditions supporting the first-order approach to multisignal principal–agent problems. Econometrica, 77(1):249–278.
  5. Combinatorial contracts. In 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS), pages 815–826. IEEE.
  6. Multi-agent contracts. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, pages 1311–1324.
  7. Simple versus optimal contracts. In Proceedings of the 2019 ACM Conference on Economics and Computation, pages 369–387.
  8. The complexity of contracts. SIAM Journal on Computing, 50(1):211–254.
  9. An analysis of the principal-agent problem. In Foundations of Insurance Economics: Readings in Economics and Finance, pages 302–340. Springer.
  10. Learning in stackelberg games with non-myopic agents. Proceedings of the 23rd ACM Conference on Economics and Computation.
  11. Adaptive contract design for crowdsourcing markets: Bandit algorithms for repeated principal-agent problems. In Proceedings of the fifteenth ACM conference on Economics and computation, pages 359–376.
  12. Holmström, B. (1979). Moral hazard and observability. The Bell journal of economics, pages 74–91.
  13. Holmstrom, B. (1982). Moral hazard in teams. The Bell journal of economics, pages 324–340.
  14. Holmström, B. (1999). Managerial incentive problems: A dynamic perspective. The review of Economic studies, 66(1):169–182.
  15. Innes, R. D. (1990). Limited liability and incentive contracting with ex-ante action choices. Journal of economic theory, 52(1):45–67.
  16. Jewitt, I. (1988). Justifying the first-order approach to principal-agent problems. Econometrica: Journal of the Econometric Society, pages 1177–1190.
  17. Moral hazard with bounded payments. Journal of Economic Theory, 143(1):59–82.
  18. Information space conditions for the first-order approach in agency problems. Journal of Economic Theory, 160:243–279.
  19. The value of knowing a demand curve: Bounds on regret for online posted-price auctions. In 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings., pages 594–605. IEEE.
  20. Bandits and experts in metric spaces. Journal of the ACM (JACM), 66:1 – 77.
  21. Efficient convex optimization with membership oracles. In Conference On Learning Theory, pages 1292–1294. PMLR.
  22. Learning and approximating the optimal strategy to commit to. In Algorithmic Game Theory.
  23. Adaptivity to smoothness in x-armed bandits. In Conference on Learning Theory, pages 1463–1492. PMLR.
  24. Mirrlees, J. A. (1999). The theory of moral hazard and unobservable behaviour: Part i. The Review of Economic Studies, 66(1):3–21.
  25. Revenue optimization against strategic buyers. In Neural Information Processing Systems.
  26. Learning optimal strategies to commit to. In AAAI Conference on Artificial Intelligence.
  27. Adaptive discretization for adversarial lipschitz bandits. In Annual Conference Computational Learning Theory.
  28. Rogerson, W. P. (1985). The first-order approach to principal-agent problems. Econometrica: Journal of the Econometric Society, pages 1357–1367.
  29. The sample complexity of online contract design. Proceedings of the 24th ACM Conference on Economics and Computation.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (1)
  1. Shiliang Zuo (9 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.