Truthful Aggregation of LLMs with an Application to Online Advertising (2405.05905v5)
Abstract: The next frontier of online advertising is revenue generation from LLM-generated content. We consider a setting where advertisers aim to influence the responses of an LLM to align with their interests, while platforms seek to maximize advertiser value and ensure user satisfaction. The challenge is that advertisers' preferences generally conflict with those of the user, and advertisers may misreport their preferences. To address this, we introduce MOSAIC, an auction mechanism that ensures that truthful reporting is a dominant strategy for advertisers and that aligns the utility of each advertiser with their contribution to social welfare. Importantly, the mechanism operates without LLM fine-tuning or access to model weights and provably converges to the output of the optimally fine-tuned LLM as computational resources increase. Additionally, it can incorporate contextual information about advertisers, which significantly improves social welfare. Through experiments with a publicly available LLM, we show that MOSAIC leads to high advertiser value and platform revenue with low computational overhead. While our motivating application is online advertising, our mechanism can be applied in any setting with monetary transfers, making it a general-purpose solution for truthfully aggregating the preferences of self-interested agents over LLM-generated replies.
- A general theoretical paradigm to understand learning from human preferences. arXiv preprint arXiv:2310.12036, 2023.
- Constitutional ai: Harmlessness from ai feedback. arXiv preprint arXiv:2212.08073, 2022.
- Edward Clarke. Multipart pricing of public goods. Public Choice, 11(1):17–33, 1971. URL https://EconPapers.repec.org/RePEc:kap:pubcho:v:11:y:1971:i:1:p:17-33.
- Social Choice for AI Alignment: Dealing with Diverse Human Feedback, April 2024.
- Auctions with llm summaries, 2024.
- Mechanism design for large language models, 2023.
- Internet advertising and the generalized second-price auction: Selling billions of dollars worth of keywords. American Economic Review, 97(1):242–259, March 2007. doi: 10.1257/aer.97.1.242. URL https://www.aeaweb.org/articles?id=10.1257/aer.97.1.242.
- Online Advertisements with LLMs: Opportunities and Challenges, April 2024.
- Generative Social Choice, November 2023.
- General truthfulness characterizations via convex analysis. Games and Economic Behavior, 130:636–662, 2021. ISSN 0899-8256. doi: https://doi.org/10.1016/j.geb.2021.09.010. URL https://www.sciencedirect.com/science/article/pii/S0899825621001330.
- Theodore Groves. Incentives in teams. Econometrica, 41(4):617–631, 1973. ISSN 00129682, 14680262. URL http://www.jstor.org/stable/1914085.
- Algorithmic Persuasion Through Simulation, April 2024.
- Truth revelation in approximately efficient combinatorial auctions. Journal of the ACM (JACM), 49(5):577–602, 2002.
- Q-probe: A lightweight approach to reward maximization for language models, 2024.
- Roger B Myerson. Optimal Auction Design. Mathematics of Operations Research, 6(1), February 1981.
- N. Nisan and A. Ronen. Computationally Feasible VCG Mechanisms. Journal of Artificial Intelligence Research, 29:19–47, May 2007. ISSN 1076-9757. doi: 10.1613/jair.2046.
- Algorithmic mechanism design. In Proceedings of the thirty-first annual ACM symposium on Theory of computing, pages 129–140, 1999.
- Economics and computation: a design approach, 2024.
- Direct preference optimization: Your language model is secretly a reward model, 2023.
- Jean-Charles Rochet. A necessary and sufficient condition for rationalizability in a quasi-linear context. Journal of Mathematical Economics, 16(2):191–200, 1987. ISSN 0304-4068. doi: https://doi.org/10.1016/0304-4068(87)90007-3. URL https://www.sciencedirect.com/science/article/pii/0304406887900073.
- Code llama: Open foundation models for code, 2024.
- Llama 2: Open foundation and fine-tuned chat models, 2023.
- Hal R. Varian. Position auctions. International Journal of Industrial Organization, 25(6):1163–1178, December 2007. URL https://ideas.repec.org/a/eee/indorg/v25y2007i6p1163-1178.html.
- William Vickrey. Counterspeculation, auctions, and competitive sealed tenders. The Journal of Finance, 16(1):8–37, 1961. doi: https://doi.org/10.1111/j.1540-6261.1961.tb02789.x. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1540-6261.1961.tb02789.x.
- Fine-tuning language models from human preferences, 2020.