Papers
Topics
Authors
Recent
2000 character limit reached

Learning Models of Adversarial Agent Behavior under Partial Observability (2306.11168v2)

Published 19 Jun 2023 in cs.LG, cs.AI, and cs.MA

Abstract: The need for opponent modeling and tracking arises in several real-world scenarios, such as professional sports, video game design, and drug-trafficking interdiction. In this work, we present Graph based Adversarial Modeling with Mutal Information (GrAMMI) for modeling the behavior of an adversarial opponent agent. GrAMMI is a novel graph neural network (GNN) based approach that uses mutual information maximization as an auxiliary objective to predict the current and future states of an adversarial opponent with partial observability. To evaluate GrAMMI, we design two large-scale, pursuit-evasion domains inspired by real-world scenarios, where a team of heterogeneous agents is tasked with tracking and interdicting a single adversarial agent, and the adversarial agent must evade detection while achieving its own objectives. With the mutual information formulation, GrAMMI outperforms all baselines in both domains and achieves 31.68% higher log-likelihood on average for future adversarial state predictions across both domains.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. Learning from suboptimal demonstration via self-supervised reward regression. In Conference on robot learning, pages 1262–1277. PMLR, 2021.
  2. Mixture kalman filters. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 62(3):493–508, 2000.
  3. Infogan: Interpretable representation learning by information maximizing generative adversarial nets, 2016.
  4. A recurrent latent variable model for sequential data. Advances in neural information processing systems, 28, 2015.
  5. Target tracking by particle filtering in binary sensor networks. IEEE Transactions on signal processing, 56(6):2229–2238, 2008.
  6. Learning policy representations in multiagent systems. In International conference on machine learning, pages 1802–1811. PMLR, 2018.
  7. Multi-modal imitation learning from unstructured demonstrations using generative adversarial nets. Advances in neural information processing systems, 30, 2017.
  8. Opponent modeling in deep reinforcement learning. In International conference on machine learning, pages 1804–1813. PMLR, 2016.
  9. Propagating state uncertainty through trajectory forecasting. In 2022 International Conference on Robotics and Automation (ICRA), pages 2351–2358. IEEE, 2022.
  10. Target tracking algorithm based on adaptive strong tracking particle filter. IET Science, Measurement & Technology, 10(7):704–710, 2016.
  11. Multiple hypothesis tracking revisited. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), December 2015.
  12. Unscented kalman filters for multiple target tracking with symmetric measurement equations. IEEE Transactions on Automatic Control, 54(2):370–375, 2009.
  13. Infogail: Interpretable imitation learning from visual demonstrations, 2017.
  14. A survey of opponent modeling in adversarial domains. Journal of Artificial Intelligence Research, 73:277–327, 2022.
  15. Interpretable and personalized apprenticeship scheduling: Learning interpretable scheduling policies from heterogeneous user demonstrations. Advances in Neural Information Processing Systems, 33:6417–6428, 2020.
  16. Variational autoencoders for opponent modeling in multi-agent systems. arXiv preprint arXiv:2001.10829, 2020.
  17. Modeling others using oneself in multi-agent reinforcement learning. In International conference on machine learning, pages 4257–4266. PMLR, 2018.
  18. G Mallikarjuna Rao and Ch Satyanarayana. Visual object target tracking using particle filter: a survey. International Journal of Image, Graphics and Signal Processing, 5(6):1250, 2013.
  19. X. Rong Li and V.P. Jilkov. Survey of maneuvering target tracking. part i. dynamic models. IEEE Transactions on Aerospace and Electronic Systems, 39(4):1333–1364, 2003.
  20. Mind meld: Personalized meta-learning for robot-centric imitation learning. In 2022 17th ACM/IEEE International Conference on Human-Robot Interaction (HRI), pages 157–165. IEEE, 2022.
  21. Learning when to communicate at scale in multiagent cooperative and competitive tasks. arXiv preprint arXiv:1812.09755, 2018.
  22. Variational imitation learning with diverse-quality demonstrations. In Proceedings of the 37th International Conference on Machine Learning, pages 9407–9417, 2020.
  23. Imitation learning from imperfect demonstration. In International Conference on Machine Learning, pages 6818–6827. PMLR, 2019.
  24. A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems, 32(1):4–24, 2020.
  25. Preserving structure in model-free tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(4):756–769, 2014.
Citations (5)

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.