Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Explainable Strategy Templates using NLP Transformers (2311.14061v1)

Published 23 Nov 2023 in cs.AI

Abstract: This paper bridges the gap between mathematical heuristic strategies learned from Deep Reinforcement Learning (DRL) in automated agent negotiation, and comprehensible, natural language explanations. Our aim is to make these strategies more accessible to non-experts. By leveraging traditional NLP techniques and LLMs equipped with Transformers, we outline how parts of DRL strategies composed of parts within strategy templates can be transformed into user-friendly, human-like English narratives. To achieve this, we present a top-level algorithm that involves parsing mathematical expressions of strategy templates, semantically interpreting variables and structures, generating rule-based primary explanations, and utilizing a Generative Pre-trained Transformer (GPT) model to refine and contextualize these explanations. Subsequent customization for varied audiences and meticulous validation processes in an example illustrate the applicability and potential of this approach.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Concurrent Bilateral Negotiation for Open E-Markets: The CONAN Strategy. Knowledge Information Systems 56, 2 (2018), 463–501.
  2. Decoupling negotiating agents to explore the space of negotiation strategies. In Novel Insights in Agent-based Complex Automated Negotiation. Springer, 61–83.
  3. Pallavi Bagga. 2021. Agent Learning for Automated Bilateral Negotiations. Ph. D. Dissertation. Royal Holloway, University of London.
  4. ANEGMA: an automated negotiation model for e-markets. Journal of Autonomous Agents and Multi-Agent Systems 35 (2021).
  5. Learnable strategies for bilateral agent negotiation over multiple issues. arXiv preprint arXiv:2009.08302 (2020).
  6. Pareto Bid Estimation for Multi-Issue Bilateral Negotiation under User Preference Uncertainty. In 2021 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE, 1–6.
  7. Deep learnable strategy templates for multi-issue bilateral negotiation. In Proc. of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), P. Faliszewski, V. Mascardi, C. Pelachaud, and M.E. Taylor (Eds.).
  8. Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020).
  9. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation 6, 2 (2002), 182–197.
  10. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv preprint arXiv:1810.04805 (2018).
  11. Optimal negotiation strategies for agents with incomplete information. In ATAL’01. Springer, 377–392.
  12. A comparative study of game theoretic and evolutionary models of bargaining for software agents. Artificial Intelligence Review 23, 2 (2005), 187–205.
  13. Matthew Honnibal and Ines Montani. 2015. spaCy: Industrial-strength Natural Language Processing in Python. https://spacy.io.
  14. Ching-Lai Hwang and Kwangsun Yoon. 1981. Methods for multiple attribute decision making. In Multiple attribute decision making. Springer, 58–191.
  15. FinRL: Deep reinforcement learning framework to automate trading in quantitative finance. In Proceedings of the second ACM international conference on AI in finance. 1–9.
  16. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019).
  17. SymPy: symbolic computing in Python. PeerJ Computer Science 3 (2017), e103. https://www.sympy.org
  18. NASA Software. 1985. C Language Integrated Production System (CLIPS). NASA Lyndon B. Johnson Space Center, Houston, Texas. https://www.clipsrules.net/ Version 6.31.
  19. OpenAI. 2023. GPT-4: Technical Report. arXiv preprint arXiv:4812508 (2023). https://cdn.openai.com/papers/gpt-4.pdf
  20. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research 21, 1 (2020), 5485–5551.
  21. Ariel Rubinstein. 1982. Perfect equilibrium in a bargaining model. Econometrica: Journal of the Econometric Society (1982), 97–109.
  22. Automating supply chain negotiations using autonomous agents: a case study in transportation logistics. In Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems. 1506–1513.
  23. Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems 32 (2019).

Summary

We haven't generated a summary for this paper yet.