Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning (2405.02654v2)

Published 4 May 2024 in cs.MA, cs.AI, and cs.GT

Abstract: The significance of network structures in promoting group cooperation within social dilemmas has been widely recognized. Prior studies attribute this facilitation to the assortment of strategies driven by spatial interactions. Although reinforcement learning has been employed to investigate the impact of dynamic interaction on the evolution of cooperation, there remains a lack of understanding about how agents develop neighbour selection behaviours and the formation of strategic assortment within an explicit interaction structure. To address this, our study introduces a computational framework based on multi-agent reinforcement learning in the spatial Prisoner's Dilemma game. This framework allows agents to select dilemma strategies and interacting neighbours based on their long-term experiences, differing from existing research that relies on preset social norms or external incentives. By modelling each agent using two distinct Q-networks, we disentangle the coevolutionary dynamics between cooperation and interaction. The results indicate that long-term experience enables agents to develop the ability to identify non-cooperative neighbours and exhibit a preference for interaction with cooperative ones. This emergent self-organizing behaviour leads to the clustering of agents with similar strategies, thereby increasing network reciprocity and enhancing group cooperation.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (44)
  1. Emergence of norms in interactions with complex rewards. In Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, pages 2280–2282, 2023.
  2. Partner selection for the emergence of cooperation in multi-agent systems using reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 7047–7054, 2020.
  3. Knowing the past improves cooperation in the future. Scientific reports, 9(1):262, 2019.
  4. A review of cooperation in multi-agent learning. arXiv preprint arXiv:2312.05162, 2023.
  5. Social influence as intrinsic motivation for multi-agent deep reinforcement learning. In International conference on machine learning, pages 3040–3049, 2019.
  6. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  7. Intuitive honesty versus dishonesty: Meta-analytic evidence. Perspectives on Psychological Science, 14(5):778–796, 2019.
  8. Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents. Proceedings of the National Academy of Sciences, 119(3):e2106028118, 2022.
  9. Negotiation and honesty in artificial intelligence methods for the board game of diplomacy. Nature Communications, 13(1):7214, 2022.
  10. Multi-agent reinforcement learning in sequential social dilemmas. In Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, pages 464–473, 2017.
  11. Changing the intensity of interaction based on individual behavior in the iterated prisoner’s dilemma game. IEEE Transactions on Evolutionary Computation, 21(4):506–517, 2016.
  12. Michael L Littman. Markov games as a framework for multi-agent reinforcement learning. In Machine learning proceedings 1994, pages 157–163. Elsevier, 1994.
  13. Multi-agent actor-critic for mixed cooperative-competitive environments. In Proceedings of the 31st International Conference on Neural Information Processing Systems, pages 6382–6393, 2017.
  14. Scaffolding cooperation in human groups with deep reinforcement learning. Nature Human Behaviour, 7(10):1787–1796, 2023.
  15. Janet Metcalfe. Learning from errors. Annual review of psychology, 68:465–489, 2017.
  16. Human-level control through deep reinforcement learning. nature, 518(7540):529–533, 2015.
  17. Experimental economics: Rethinking the rules. Princeton University Press, 2009.
  18. The spatial dilemmas of evolution. International Journal of Bifurcation and Chaos, 3(01):35–78, 1993.
  19. Martin A Nowak. Five rules for the evolution of cooperation. science, 314(5805):1560–1563, 2006.
  20. Cooperation in alternating interactions with memory constraints. Nature Communications, 13(1):737, 2022.
  21. Evolutionary dynamics of group interactions on structured populations: a review. Journal of the royal society interface, 10(80):20120997, 2013.
  22. Statistical physics of human cooperation. Physics Reports, 687:1–51, 2017.
  23. A multi-agent reinforcement learning model of common-pool resource appropriation. In Proceedings of the 31st International Conference on Neural Information Processing Systems, pages 3646–3655, 2017.
  24. Human cooperation. Trends in cognitive sciences, 17(8):413–425, 2013.
  25. Dynamic social networks promote cooperation in experiments with humans. Proceedings of the National Academy of Sciences, 108(48):19193–19198, 2011.
  26. Prisoner’s dilemma: A study in conflict and cooperation. University of Michigan Press, 1965.
  27. Reputation-based interaction promotes cooperation with reinforcement learning. IEEE Transactions on Evolutionary Computation, 2023.
  28. Evolutionary dynamics in the spatial public goods game with tolerance-based expulsion and cooperation. Chaos, Solitons & Fractals, 151:111241, 2021.
  29. Cooperation prevails when individuals adjust their social ties. PLoS computational biology, 2(10):e140, 2006.
  30. Prioritized experience replay. arXiv preprint arXiv:1511.05952, 2015.
  31. Social learning promotes institutions for governing the commons. Nature, 466(7308):861–863, 2010.
  32. Karl Sigmund. The calculus of selfishness. Princeton University Press, 2010.
  33. Evolution of cooperation with asymmetric social interactions. Proceedings of the National Academy of Sciences, 119(1):e2113468118, 2022.
  34. Reputation-based partner choice is an effective alternative to indirect reciprocity in solving social dilemmas. Evolution and Human Behavior, 34(3):201–206, 2013.
  35. Evolutionary prisoner’s dilemma game on a square lattice. Physical Review E, 58:69–73, 1998.
  36. Blocking defector invasion by focusing on the most successful partner. Applied Mathematics and Computation, 385:125430, 2020.
  37. Jun Tanimoto. Difference of reciprocity effect in two coevolutionary models of presumed two-player and multiplayer games. Physical Review E, 87(6):062136, 2013.
  38. Modeling moral choices in social dilemmas with multi-agent reinforcement learning. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, pages 317–325, 2023.
  39. Human strategy updating in evolutionary games. Proceedings of the National Academy of Sciences, 107(7):2962–2966, 2010.
  40. Deconstructing cooperation and ostracism via multi-agent reinforcement learning. arXiv preprint arXiv:2310.04623, 2023.
  41. A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings. Collective Intelligence, 2(2):26339137231162025, 2023.
  42. Insight into the so-called spatial reciprocity. Physical Review E, 88(4):042145, 2013.
  43. Universal scaling for the dilemma strength in evolutionary games. Physics of Life Reviews, 14:1–30, 2015.
  44. Resolving social dilemmas with minimal reward transfer. arXiv preprint arXiv:2310.12928, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Tianyu Ren (10 papers)
  2. Xiao-jun Zeng (21 papers)