Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method (2402.15813v3)

Published 24 Feb 2024 in cs.CL and cs.GT

Abstract: Bargaining is an important and unique part of negotiation between humans. As LLM-driven agents learn to negotiate and act like real humans, how to evaluate agents' bargaining abilities remains an open problem. For the first time, we formally described the Bargaining task as an asymmetric incomplete information game, defining the gains of the Buyer and Seller in multiple bargaining processes. It allows us to quantitatively assess an agent's performance in the Bargain task. We collected a real product price dataset, AmazonHistoryPrice, and conducted evaluations of various LLM agents' bargaining abilities. We find that playing a Buyer is much harder than a Seller, and increasing model size can not effectively improve the Buyer's performance. To address the challenge, we propose a novel approach called OG-Narrator that integrates a deterministic Offer Generator to control the price range of Buyer's offers, and an LLM Narrator to create natural language sentences for generated offers. Experimental results show that OG-Narrator improves the buyer's deal rates from 26.67% to 88.88% and brings a ten times multiplication of profits on all baselines, even a model that has not been aligned.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. Qwen technical report.
  2. Noncooperative models of bargaining. Handbook of game theory with economic applications, 1:179–225.
  3. The nash bargaining solution in economic modelling. The RAND Journal of Economics, pages 176–188.
  4. Put your money where your mouth is: Evaluating strategic planning and execution of llm agents in an auction arena.
  5. GLM: General language model pretraining with autoregressive blank infilling. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 320–335, Dublin, Ireland. Association for Computational Linguistics.
  6. Chaim Fershtman. 1990. The importance of the agenda in bargaining. Games and Economic Behavior, 2(3):224–238.
  7. Improving language model negotiation with self-play and in-context learning from ai feedback.
  8. Francisca Gayà Torres. 2021. Rubinstein’s bargaining model.
  9. Textbooks are all you need. ArXiv preprint, abs/2306.11644.
  10. Decoupling strategy and generation in negotiation dialogues. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 2333–2343, Brussels, Belgium. Association for Computational Linguistics.
  11. Language of bargaining.
  12. Mistral 7b.
  13. Mixtral of experts.
  14. Large language models are zero-shot reasoners.
  15. Efficient memory management for large language model serving with pagedattention.
  16. Deal or no deal? end-to-end learning for negotiation dialogues.
  17. Amogh Mannekote. 2023. Towards a neural era in dialogue management for collaboration: A literature survey.
  18. OpenAI. 2023. Gpt-4 technical report.
  19. Training language models to follow instructions with human feedback.
  20. Generative agents: Interactive simulacra of human behavior.
  21. Train short, test long: Attention with linear biases enables input length extrapolation. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net.
  22. Ariel Rubinstein. 1982. Perfect equilibrium in a bargaining model. Econometrica: Journal of the Econometric Society, pages 97–109.
  23. Alfworld: Aligning text and embodied environments for interactive learning. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net.
  24. Llama 2: Open foundation and fine-tuned chat models. ArXiv preprint, abs/2307.09288.
  25. Voyager: An open-ended embodied agent with large language models.
  26. Chain-of-thought prompting elicits reasoning in large language models.
  27. Baichuan 2: Open large-scale language models.
  28. Auto-gpt for online decision making: Benchmarks and additional opinions.
  29. Webshop: Towards scalable real-world web interaction with grounded language agents. Advances in Neural Information Processing Systems, 35:20744–20757.
  30. A dynamic strategy coach for effective negotiation. In Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, pages 367–378, Stockholm, Sweden. Association for Computational Linguistics.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Tian Xia (66 papers)
  2. Zhiwei He (42 papers)
  3. Tong Ren (6 papers)
  4. Yibo Miao (24 papers)
  5. Zhuosheng Zhang (125 papers)
  6. Yang Yang (884 papers)
  7. Rui Wang (997 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.